Package org.apache.tika.parser
Tika parsers.
-
Interface Summary Interface Description DigestingParser.Digester Interface for digester.DigestingParser.Encoder Encodes byte array from a MessageDigest to StringParser Tika parser interface.PasswordProvider Interface for providing a password to a Parser for handling Encrypted and Password Protected Documents. -
Class Summary Class Description AbstractEncodingDetectorParser Abstract base class for parsers that use the AutoDetectReader and need to use theEncodingDetector
configured byTikaConfig
AbstractParser Abstract base class for new parsers.AutoDetectParser AutoDetectParserFactory Factory for an AutoDetectParserCompositeParser Composite parser that delegates parsing tasks to a component parser based on the declared content type of the incoming document.CryptoParser Decrypts the incoming document stream and delegates further parsing to another parser instance.DefaultParser A composite parser based on all theParser
implementations available through theservice provider mechanism
.DelegatingParser Base class for parser implementations that want to delegate parts of the task of parsing an input document to another parser.DigestingParser EmptyParser Dummy parser that always produces an empty XHTML document without even attempting to parse the given document stream.ErrorParser Dummy parser that always throws aTikaException
without even attempting to parse the given document stream.NetworkParser ParseContext Parse context.ParserDecorator Decorator base class for theParser
interface.ParserFactory ParserPostProcessor Parser decorator that post-processes the results from a decorated parser.ParsingReader Reader for the text content from a given binary stream.RecursiveParserWrapper This is a helper class that wraps a parser in a recursive handler.