Package org.apache.tika.parser
Class ParserPostProcessor
- java.lang.Object
-
- org.apache.tika.parser.ParserDecorator
-
- org.apache.tika.parser.ParserPostProcessor
-
- All Implemented Interfaces:
Serializable,Parser
public class ParserPostProcessor extends ParserDecorator
Parser decorator that post-processes the results from a decorated parser. The post-processing takes care of filling in the "fulltext", "summary", and "outlinks" metadata entries based on the full text content returned by the decorated parser.- See Also:
- Serialized Form
-
-
Constructor Summary
Constructors Constructor Description ParserPostProcessor(Parser parser)Creates a post-processing decorator for the given parser.
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description voidparse(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context)Forwards the call to the delegated parser and post-processes the results as described above.-
Methods inherited from class org.apache.tika.parser.ParserDecorator
getDecorationName, getSupportedTypes, getWrappedParser, withFallbacks, withoutTypes, withTypes
-
-
-
-
Constructor Detail
-
ParserPostProcessor
public ParserPostProcessor(Parser parser)
Creates a post-processing decorator for the given parser.- Parameters:
parser- the parser to be decorated
-
-
Method Detail
-
parse
public void parse(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) throws IOException, SAXException, TikaException
Forwards the call to the delegated parser and post-processes the results as described above.- Specified by:
parsein interfaceParser- Overrides:
parsein classParserDecorator- Parameters:
stream- the document stream (input)handler- handler for the XHTML SAX events (output)metadata- document metadata (input and output)context- parse context- Throws:
IOException- if the document stream could not be readSAXException- if the SAX events could not be processedTikaException- if the document could not be parsed
-
-