@Version("1.0.0")
Package org.apache.tika.parser
Tika parsers.
-
Interface Summary Interface Description DigestingParser.Digester Interface for digester.DigestingParser.DigesterFactory This is used inAutoDetectParserConfig
to (optionally) wrap the parser in a digesting parser.DigestingParser.Encoder Encodes byte array from a MessageDigest to StringParser Tika parser interface.PasswordProvider Interface for providing a password to a Parser for handling Encrypted and Password Protected Documents.RenderingParser -
Class Summary Class Description AbstractEncodingDetectorParser Abstract base class for parsers that use the AutoDetectReader and need to use theEncodingDetector
configured byTikaConfig
AbstractExternalProcessParser Abstract base class for parsers that call external processes.AbstractParser Deprecated. for removal in 4.xAutoDetectParser AutoDetectParserConfig This config object can be used to tune how conservative we want to be when parsing data that is extremely compressible and resembles a ZIP bomb.AutoDetectParserFactory Factory for an AutoDetectParserCompositeParser Composite parser that delegates parsing tasks to a component parser based on the declared content type of the incoming document.CryptoParser Decrypts the incoming document stream and delegates further parsing to another parser instance.DefaultParser A composite parser based on all theParser
implementations available through theservice provider mechanism
.DelegatingParser Base class for parser implementations that want to delegate parts of the task of parsing an input document to another parser.DigestingParser EmptyParser Dummy parser that always produces an empty XHTML document without even attempting to parse the given document stream.ErrorParser Dummy parser that always throws aTikaException
without even attempting to parse the given document stream.NetworkParser ParseContext Parse context.ParserDecorator Decorator base class for theParser
interface.ParseRecord Use this class to store exceptions, warnings and other information during the parse.ParserFactory ParserPostProcessor Parser decorator that post-processes the results from a decorated parser.ParsingReader Reader for the text content from a given binary stream.RecursiveParserWrapper This is a helper class that wraps a parser in a recursive handler.RegexCaptureParser StatefulParser The RecursiveParserWrapper wraps the parser sent into the parsecontext and then uses that parser to store state (among many other things).