Class Summary |
AutoDetectParser |
|
CompositeParser |
Composite parser that delegates parsing tasks to a component parser
based on the declared content type of the incoming document. |
DelegatingParser |
Base class for parser implementations that want to delegate parts of the
task of parsing an input document to another parser. |
EmptyParser |
Dummy parser that always produces an empty XHTML document without even
attempting to parse the given document stream. |
ErrorParser |
Dummy parser that always throws a TikaException without even
attempting to parse the given document stream. |
ExternalParser |
Parser that uses an external program (like catdoc or pdf2txt) to extract
text content from a given document. |
ParseContext |
Parse context. |
ParserDecorator |
Decorator base class for the Parser interface. |
ParserPostProcessor |
Parser decorator that post-processes the results from a decorated parser. |
ParsingReader |
Reader for the text content from a given binary stream. |