org.apache.tika.parser.external
Class ExternalParsersConfigReader

java.lang.Object
  extended by org.apache.tika.parser.external.ExternalParsersConfigReader
All Implemented Interfaces:
ExternalParsersConfigReaderMetKeys

public final class ExternalParsersConfigReader
extends java.lang.Object
implements ExternalParsersConfigReaderMetKeys

Builds up ExternalParser instances based on XML file(s) which define what to run, for what, and how to process any output metadata. Typically used to configure up a series of external programs (like catdoc or pdf2txt) to extract text content from documents.

  TODO XML DTD Here
 


Field Summary
 
Fields inherited from interface org.apache.tika.parser.external.ExternalParsersConfigReaderMetKeys
CHECK_TAG, COMMAND_TAG, ERROR_CODES_TAG, EXTERNAL_PARSERS_TAG, METADATA_KEY_ATTR, METADATA_MATCH_TAG, METADATA_TAG, MIMETYPE_TAG, MIMETYPES_TAG, PARSER_TAG
 
Constructor Summary
ExternalParsersConfigReader()
           
 
Method Summary
static java.util.List<ExternalParser> read(org.w3c.dom.Document document)
           
static java.util.List<ExternalParser> read(org.w3c.dom.Element element)
           
static java.util.List<ExternalParser> read(java.io.InputStream stream)
           
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

ExternalParsersConfigReader

public ExternalParsersConfigReader()
Method Detail

read

public static java.util.List<ExternalParser> read(java.io.InputStream stream)
                                           throws TikaException,
                                                  java.io.IOException
Throws:
TikaException
java.io.IOException

read

public static java.util.List<ExternalParser> read(org.w3c.dom.Document document)
                                           throws TikaException,
                                                  java.io.IOException
Throws:
TikaException
java.io.IOException

read

public static java.util.List<ExternalParser> read(org.w3c.dom.Element element)
                                           throws TikaException,
                                                  java.io.IOException
Throws:
TikaException
java.io.IOException


Copyright © 2007-2011 The Apache Software Foundation. All Rights Reserved.