Package org.apache.tika.parser
@Version("1.0.0")
package org.apache.tika.parser
Tika parsers.
-
ClassDescriptionAbstract base class for parsers that use the AutoDetectReader and need to use the
EncodingDetector
configured byTikaConfig
Abstract base class for parsers that call external processes.Deprecated.for removal in 4.xThis config object can be used to tune how conservative we want to be when parsing data that is extremely compressible and resembles a ZIP bomb.Factory for an AutoDetectParserComposite parser that delegates parsing tasks to a component parser based on the declared content type of the incoming document.Decrypts the incoming document stream and delegates further parsing to another parser instance.A composite parser based on all theParser
implementations available through theservice provider mechanism
.Base class for parser implementations that want to delegate parts of the task of parsing an input document to another parser.Interface for digester.This is used inAutoDetectParserConfig
to (optionally) wrap the parser in a digesting parser.Encodes byte array from a MessageDigest to StringDummy parser that always produces an empty XHTML document without even attempting to parse the given document stream.Dummy parser that always throws aTikaException
without even attempting to parse the given document stream.Parse context.Tika parser interface.Decorator base class for theParser
interface.Use this class to store exceptions, warnings and other information during the parse.Parser decorator that post-processes the results from a decorated parser.Reader for the text content from a given binary stream.Interface for providing a password to a Parser for handling Encrypted and Password Protected Documents.This is a helper class that wraps a parser in a recursive handler.The RecursiveParserWrapper wraps the parser sent into the parsecontext and then uses that parser to store state (among many other things).