Package org.apache.tika.parser
@Version("1.0.0")
package org.apache.tika.parser
Tika parsers.
-
ClassDescriptionAbstract base class for parsers that use the AutoDetectReader and need to use an
EncodingDetector.Abstract base class for parsers that call external processes.Deprecated.for removal in 4.xConfiguration for AutoDetectParser behavior.Composite parser that delegates parsing tasks to a component parser based on the declared content type of the incoming document.Decrypts the incoming document stream and delegates further parsing to another parser instance.A composite parser based on all theParserimplementations available through theservice provider mechanism.Base class for parser implementations that want to delegate parts of the task of parsing an input document to another parser.Dummy parser that always produces an empty XHTML document without even attempting to parse the given document stream.Dummy parser that always throws aTikaExceptionwithout even attempting to parse the given document stream.Parse context.Tika parser interface.Decorator base class for theParserinterface.A ParserDecorator that filters supported mime types.Use this class to store exceptions, warnings and other information during the parse.Parser decorator that post-processes the results from a decorated parser.Marker class to indicate parsing intent in ParseContext.Reader for the text content from a given binary stream.Interface for providing a password to a Parser for handling Encrypted and Password Protected Documents.This is a helper class that wraps a parser in a recursive handler.Configuration forRegexCaptureParser.A simplePasswordProviderthat returns a configured password for all documents.The RecursiveParserWrapper wraps the parser sent into the parsecontext and then uses that parser to store state (among many other things).