Package org.apache.tika.parser
Class ParserPostProcessor
- java.lang.Object
- 
- org.apache.tika.parser.ParserDecorator
- 
- org.apache.tika.parser.ParserPostProcessor
 
 
- 
- All Implemented Interfaces:
- Serializable,- Parser
 
 public class ParserPostProcessor extends ParserDecorator Parser decorator that post-processes the results from a decorated parser. The post-processing takes care of filling in the "fulltext", "summary", and "outlinks" metadata entries based on the full text content returned by the decorated parser.- See Also:
- Serialized Form
 
- 
- 
Constructor SummaryConstructors Constructor Description ParserPostProcessor(Parser parser)Creates a post-processing decorator for the given parser.
 - 
Method SummaryAll Methods Instance Methods Concrete Methods Modifier and Type Method Description voidparse(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context)Forwards the call to the delegated parser and post-processes the results as described above.- 
Methods inherited from class org.apache.tika.parser.ParserDecoratorgetDecorationName, getSupportedTypes, getWrappedParser, withFallbacks, withoutTypes, withTypes
 
- 
 
- 
- 
- 
Constructor Detail- 
ParserPostProcessorpublic ParserPostProcessor(Parser parser) Creates a post-processing decorator for the given parser.- Parameters:
- parser- the parser to be decorated
 
 
- 
 - 
Method Detail- 
parsepublic void parse(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) throws IOException, SAXException, TikaException Forwards the call to the delegated parser and post-processes the results as described above.- Specified by:
- parsein interface- Parser
- Overrides:
- parsein class- ParserDecorator
- Parameters:
- stream- the document stream (input)
- handler- handler for the XHTML SAX events (output)
- metadata- document metadata (input and output)
- context- parse context
- Throws:
- IOException- if the document stream could not be read
- SAXException- if the SAX events could not be processed
- TikaException- if the document could not be parsed
 
 
- 
 
-