|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Object org.apache.tika.parser.CompositeParser
public class CompositeParser
Composite parser that delegates parsing tasks to a component parser based on the declared content type of the incoming document. A fallback parser is defined for cases where a parser for the given content type is not available.
Constructor Summary | |
---|---|
CompositeParser()
|
Method Summary | |
---|---|
Parser |
getFallback()
Returns the fallback parser. |
protected Parser |
getParser(Metadata metadata)
Returns the parser that best matches the given metadata. |
java.util.Map<java.lang.String,Parser> |
getParsers()
Returns the component parsers. |
void |
parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata)
Deprecated. This method will be removed in Apache Tika 1.0. |
void |
parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata,
ParseContext context)
Delegates the call to the matching component parser. |
void |
setFallback(Parser fallback)
Sets the fallback parser. |
void |
setParsers(java.util.Map<java.lang.String,Parser> parsers)
Sets the component parsers. |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Constructor Detail |
---|
public CompositeParser()
Method Detail |
---|
public java.util.Map<java.lang.String,Parser> getParsers()
public void setParsers(java.util.Map<java.lang.String,Parser> parsers)
parsers
- component parsers, keyed by media typepublic Parser getFallback()
public void setFallback(Parser fallback)
fallback
- fallback parserprotected Parser getParser(Metadata metadata)
Subclasses can override this method to provide more accurate parser resolution.
metadata
- document metadata
public void parse(java.io.InputStream stream, org.xml.sax.ContentHandler handler, Metadata metadata, ParseContext context) throws java.io.IOException, org.xml.sax.SAXException, TikaException
Potential RuntimeException
s, IOException
s and
SAXException
s unrelated to the given input stream and content
handler are automatically wrapped into TikaException
s to better
honor the Parser
contract.
parse
in interface Parser
stream
- the document stream (input)handler
- handler for the XHTML SAX events (output)metadata
- document metadata (input and output)context
- parse context
java.io.IOException
- if the document stream could not be read
org.xml.sax.SAXException
- if the SAX events could not be processed
TikaException
- if the document could not be parsedpublic void parse(java.io.InputStream stream, org.xml.sax.ContentHandler handler, Metadata metadata) throws java.io.IOException, org.xml.sax.SAXException, TikaException
Parser
#parse(InputStream, ContentHandler, Metadata, Map)
method
instead in new code. Calls to this backwards compatibility method
are forwarded to the new parse() method with an empty parse context.
parse
in interface Parser
java.io.IOException
org.xml.sax.SAXException
TikaException
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |