Package org.apache.tika.parser
Class ParserPostProcessor
java.lang.Object
org.apache.tika.parser.AbstractParser
org.apache.tika.parser.ParserDecorator
org.apache.tika.parser.ParserPostProcessor
- All Implemented Interfaces:
Serializable
,Parser
Parser decorator that post-processes the results from a decorated parser.
The post-processing takes care of filling in the "fulltext", "summary",
and "outlinks" metadata entries based on the full text content returned by
the decorated parser.
- See Also:
-
Constructor Summary
ConstructorDescriptionParserPostProcessor
(Parser parser) Creates a post-processing decorator for the given parser. -
Method Summary
Modifier and TypeMethodDescriptionvoid
parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) Forwards the call to the delegated parser and post-processes the results as described above.Methods inherited from class org.apache.tika.parser.ParserDecorator
getDecorationName, getSupportedTypes, getWrappedParser, withFallbacks, withoutTypes, withTypes
Methods inherited from class org.apache.tika.parser.AbstractParser
parse
-
Constructor Details
-
ParserPostProcessor
Creates a post-processing decorator for the given parser.- Parameters:
parser
- the parser to be decorated
-
-
Method Details
-
parse
public void parse(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) throws IOException, SAXException, TikaException Forwards the call to the delegated parser and post-processes the results as described above.- Specified by:
parse
in interfaceParser
- Overrides:
parse
in classParserDecorator
- Parameters:
stream
- the document stream (input)handler
- handler for the XHTML SAX events (output)metadata
- document metadata (input and output)context
- parse context- Throws:
IOException
- if the document stream could not be readSAXException
- if the SAX events could not be processedTikaException
- if the document could not be parsed
-