|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Object org.apache.tika.parser.AbstractParser org.apache.tika.parser.ExternalParser
ExternalParser
instead
@Deprecated public class ExternalParser
Very basic parser that uses an external program (like catdoc or pdf2txt) to extract text content from a given document.
Constructor Summary | |
---|---|
ExternalParser()
Deprecated. |
Method Summary | |
---|---|
java.lang.String |
getCommand()
Deprecated. |
java.util.Set<MediaType> |
getSupportedTypes()
Deprecated. |
java.util.Set<MediaType> |
getSupportedTypes(ParseContext context)
Deprecated. Returns the set of media types supported by this parser when used with the given parse context. |
void |
parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata,
ParseContext context)
Deprecated. Executes the configured external command and passes the given document stream as a simple XHTML document to the given SAX content handler. |
void |
setCommand(java.lang.String command)
Deprecated. |
void |
setSupportedTypes(java.util.Set<MediaType> supportedTypes)
Deprecated. |
Methods inherited from class org.apache.tika.parser.AbstractParser |
---|
parse |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Constructor Detail |
---|
public ExternalParser()
Method Detail |
---|
public java.util.Set<MediaType> getSupportedTypes(ParseContext context)
Parser
context
- parse context
public java.util.Set<MediaType> getSupportedTypes()
public void setSupportedTypes(java.util.Set<MediaType> supportedTypes)
public java.lang.String getCommand()
public void setCommand(java.lang.String command)
public void parse(java.io.InputStream stream, org.xml.sax.ContentHandler handler, Metadata metadata, ParseContext context) throws java.io.IOException, org.xml.sax.SAXException, TikaException
stream
- the document stream (input)handler
- handler for the XHTML SAX events (output)metadata
- document metadata (input and output)context
- parse context
java.io.IOException
- if the document stream could not be read
org.xml.sax.SAXException
- if the SAX events could not be processed
TikaException
- if the document could not be parsed
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |