|
||||||||||
PREV NEXT | FRAMES NO FRAMES |
Uses of Parser in org.apache.tika |
---|
Constructors in org.apache.tika with parameters of type Parser | |
---|---|
Tika(Detector detector,
Parser parser)
Creates a Tika facade using the given detector and parser instances. |
Uses of Parser in org.apache.tika.config |
---|
Methods in org.apache.tika.config that return Parser | |
---|---|
Parser |
TikaConfig.getParser()
Returns the configured parser instance. |
Parser |
TikaConfig.getParser(MediaType mimeType)
Deprecated. Use the TikaConfig.getParser() method instead |
Methods in org.apache.tika.config that return types with arguments of type Parser | |
---|---|
java.util.Map<MediaType,Parser> |
TikaConfig.getParsers()
Deprecated. Use the TikaConfig.getParser() method instead |
Methods in org.apache.tika.config with parameters of type Parser | |
---|---|
static TikaConfig |
TikaConfig.getDefaultConfig(Parser delegate)
Deprecated. This method will be removed in Apache Tika 1.0 |
Constructors in org.apache.tika.config with parameters of type Parser | |
---|---|
TikaConfig(org.w3c.dom.Document document,
Parser delegate)
Deprecated. This method will be removed in Apache Tika 1.0 |
|
TikaConfig(org.w3c.dom.Element element,
Parser delegate)
Deprecated. This method will be removed in Apache Tika 1.0 |
|
TikaConfig(java.io.InputStream stream,
Parser delegate)
Deprecated. This method will be removed in Apache Tika 1.0 |
Uses of Parser in org.apache.tika.extractor |
---|
Constructors in org.apache.tika.extractor with parameters of type Parser | |
---|---|
ParserContainerExtractor(Parser parser,
Detector detector)
|
Uses of Parser in org.apache.tika.fork |
---|
Classes in org.apache.tika.fork that implement Parser | |
---|---|
class |
ForkParser
|
Uses of Parser in org.apache.tika.gui |
---|
Constructors in org.apache.tika.gui with parameters of type Parser | |
---|---|
TikaGUI(Parser parser)
|
Uses of Parser in org.apache.tika.parser |
---|
Classes in org.apache.tika.parser that implement Parser | |
---|---|
class |
AutoDetectParser
|
class |
CompositeParser
Composite parser that delegates parsing tasks to a component parser based on the declared content type of the incoming document. |
class |
DefaultParser
A composite parser based on all the Parser implementations
available through the service provider mechanism . |
class |
DelegatingParser
Base class for parser implementations that want to delegate parts of the task of parsing an input document to another parser. |
class |
EmptyParser
Dummy parser that always produces an empty XHTML document without even attempting to parse the given document stream. |
class |
ErrorParser
Dummy parser that always throws a TikaException without even
attempting to parse the given document stream. |
class |
ExternalParser
Parser that uses an external program (like catdoc or pdf2txt) to extract text content from a given document. |
class |
ParserDecorator
Decorator base class for the Parser interface. |
class |
ParserPostProcessor
Parser decorator that post-processes the results from a decorated parser. |
Methods in org.apache.tika.parser that return Parser | |
---|---|
protected Parser |
DelegatingParser.getDelegateParser(ParseContext context)
Returns the parser instance to which parsing tasks should be delegated. |
Parser |
CompositeParser.getFallback()
Returns the fallback parser. |
protected Parser |
CompositeParser.getParser(Metadata metadata)
Returns the parser that best matches the given metadata. |
protected Parser |
CompositeParser.getParser(Metadata metadata,
ParseContext context)
|
Parser |
ParserDecorator.getWrappedParser()
Gets the parser wrapped by this ParserDecorator |
static Parser |
ParserDecorator.withTypes(Parser parser,
java.util.Set<MediaType> types)
Decorates the given parser so that it always claims to support parsing of the given media types. |
Methods in org.apache.tika.parser that return types with arguments of type Parser | |
---|---|
java.util.Map<MediaType,Parser> |
CompositeParser.getParsers()
Returns the component parsers. |
java.util.Map<MediaType,Parser> |
CompositeParser.getParsers(ParseContext context)
|
Methods in org.apache.tika.parser with parameters of type Parser | |
---|---|
void |
CompositeParser.setFallback(Parser fallback)
Sets the fallback parser. |
static Parser |
ParserDecorator.withTypes(Parser parser,
java.util.Set<MediaType> types)
Decorates the given parser so that it always claims to support parsing of the given media types. |
Method parameters in org.apache.tika.parser with type arguments of type Parser | |
---|---|
void |
CompositeParser.setParsers(java.util.Map<MediaType,Parser> parsers)
Sets the component parsers. |
Constructors in org.apache.tika.parser with parameters of type Parser | |
---|---|
AutoDetectParser(Detector detector,
Parser... parsers)
|
|
AutoDetectParser(Parser... parsers)
Creates an auto-detecting parser instance using the specified set of parser. |
|
CompositeParser(MediaTypeRegistry registry,
Parser... parsers)
|
|
ParserDecorator(Parser parser)
Creates a decorator for the given parser. |
|
ParserPostProcessor(Parser parser)
Creates a post-processing decorator for the given parser. |
|
ParsingReader(Parser parser,
java.io.InputStream stream,
Metadata metadata)
Deprecated. This method will be removed in Apache Tika 1.0 |
|
ParsingReader(Parser parser,
java.io.InputStream stream,
Metadata metadata,
java.util.concurrent.Executor executor)
Deprecated. This method will be removed in Apache Tika 1.0 |
|
ParsingReader(Parser parser,
java.io.InputStream stream,
Metadata metadata,
ParseContext context)
Creates a reader for the text content of the given binary stream with the given document metadata. |
|
ParsingReader(Parser parser,
java.io.InputStream stream,
Metadata metadata,
ParseContext context,
java.util.concurrent.Executor executor)
Creates a reader for the text content of the given binary stream with the given document metadata. |
Constructor parameters in org.apache.tika.parser with type arguments of type Parser | |
---|---|
CompositeParser(MediaTypeRegistry registry,
java.util.List<Parser> parsers)
|
Uses of Parser in org.apache.tika.parser.asm |
---|
Classes in org.apache.tika.parser.asm that implement Parser | |
---|---|
class |
ClassParser
Parser for Java .class files. |
Uses of Parser in org.apache.tika.parser.audio |
---|
Classes in org.apache.tika.parser.audio that implement Parser | |
---|---|
class |
AudioParser
|
class |
MidiParser
|
Uses of Parser in org.apache.tika.parser.dwg |
---|
Classes in org.apache.tika.parser.dwg that implement Parser | |
---|---|
class |
DWGParser
DWG (CAD Drawing) parser. |
Uses of Parser in org.apache.tika.parser.epub |
---|
Classes in org.apache.tika.parser.epub that implement Parser | |
---|---|
class |
EpubContentParser
Parser for EPUB OPS *.html files. |
class |
EpubParser
Epub parser |
Methods in org.apache.tika.parser.epub that return Parser | |
---|---|
Parser |
EpubParser.getContentParser()
|
Parser |
EpubParser.getMetaParser()
|
Methods in org.apache.tika.parser.epub with parameters of type Parser | |
---|---|
void |
EpubParser.setContentParser(Parser content)
|
void |
EpubParser.setMetaParser(Parser meta)
|
Uses of Parser in org.apache.tika.parser.feed |
---|
Classes in org.apache.tika.parser.feed that implement Parser | |
---|---|
class |
FeedParser
Feed parser. |
Uses of Parser in org.apache.tika.parser.font |
---|
Classes in org.apache.tika.parser.font that implement Parser | |
---|---|
class |
TrueTypeParser
Parser for TrueType font files (TTF). |
Uses of Parser in org.apache.tika.parser.hdf |
---|
Classes in org.apache.tika.parser.hdf that implement Parser | |
---|---|
class |
HDFParser
Since the NetCDFParser depends on the NetCDF-Java API,
we are able to use it to parse HDF files as well. |
Uses of Parser in org.apache.tika.parser.html |
---|
Classes in org.apache.tika.parser.html that implement Parser | |
---|---|
class |
HtmlParser
HTML parser. |
Uses of Parser in org.apache.tika.parser.image |
---|
Classes in org.apache.tika.parser.image that implement Parser | |
---|---|
class |
ImageParser
|
class |
TiffParser
|
Uses of Parser in org.apache.tika.parser.iwork |
---|
Classes in org.apache.tika.parser.iwork that implement Parser | |
---|---|
class |
IWorkPackageParser
A parser for the IWork container files. |
class |
IWorkParser
A parser for the IWork formats. |
Uses of Parser in org.apache.tika.parser.jpeg |
---|
Classes in org.apache.tika.parser.jpeg that implement Parser | |
---|---|
class |
JpegParser
|
Uses of Parser in org.apache.tika.parser.mbox |
---|
Classes in org.apache.tika.parser.mbox that implement Parser | |
---|---|
class |
MboxParser
Mbox (mailbox) parser. |
Uses of Parser in org.apache.tika.parser.microsoft |
---|
Classes in org.apache.tika.parser.microsoft that implement Parser | |
---|---|
class |
OfficeParser
Defines a Microsoft document content extractor. |
Uses of Parser in org.apache.tika.parser.microsoft.ooxml |
---|
Classes in org.apache.tika.parser.microsoft.ooxml that implement Parser | |
---|---|
class |
OOXMLParser
Office Open XML (OOXML) parser. |
Uses of Parser in org.apache.tika.parser.mp3 |
---|
Classes in org.apache.tika.parser.mp3 that implement Parser | |
---|---|
class |
Mp3Parser
The Mp3Parser is used to parse ID3 Version 1 Tag information
from an MP3 file, if available. |
Uses of Parser in org.apache.tika.parser.netcdf |
---|
Classes in org.apache.tika.parser.netcdf that implement Parser | |
---|---|
class |
NetCDFParser
A Parser for NetCDF
files using the UCAR, MIT-licensed NetCDF for Java
API. |
Uses of Parser in org.apache.tika.parser.odf |
---|
Classes in org.apache.tika.parser.odf that implement Parser | |
---|---|
class |
OpenDocumentContentParser
Parser for ODF content.xml files. |
class |
OpenDocumentMetaParser
Parser for OpenDocument meta.xml files. |
class |
OpenDocumentParser
OpenOffice parser |
Methods in org.apache.tika.parser.odf that return Parser | |
---|---|
Parser |
OpenDocumentParser.getContentParser()
|
Parser |
OpenDocumentParser.getMetaParser()
|
Methods in org.apache.tika.parser.odf with parameters of type Parser | |
---|---|
void |
OpenDocumentParser.setContentParser(Parser content)
|
void |
OpenDocumentParser.setMetaParser(Parser meta)
|
Uses of Parser in org.apache.tika.parser.opendocument |
---|
Classes in org.apache.tika.parser.opendocument that implement Parser | |
---|---|
class |
OpenOfficeParser
Deprecated. Use the OpenDocumentParser class instead.
This class will be removed in Apache Tika 1.0. |
Uses of Parser in org.apache.tika.parser.pdf |
---|
Classes in org.apache.tika.parser.pdf that implement Parser | |
---|---|
class |
PDFParser
PDF parser. |
Uses of Parser in org.apache.tika.parser.pkg |
---|
Classes in org.apache.tika.parser.pkg that implement Parser | |
---|---|
class |
PackageParser
Parser for various packaging and compression formats. |
Uses of Parser in org.apache.tika.parser.rtf |
---|
Classes in org.apache.tika.parser.rtf that implement Parser | |
---|---|
class |
RTFParser
RTF parser |
Uses of Parser in org.apache.tika.parser.txt |
---|
Classes in org.apache.tika.parser.txt that implement Parser | |
---|---|
class |
TXTParser
Plain text parser. |
Uses of Parser in org.apache.tika.parser.video |
---|
Classes in org.apache.tika.parser.video that implement Parser | |
---|---|
class |
FLVParser
Parser for metadata contained in Flash Videos (.flv). |
Uses of Parser in org.apache.tika.parser.xml |
---|
Classes in org.apache.tika.parser.xml that implement Parser | |
---|---|
class |
DcXMLParser
Dublin Core metadata parser |
class |
XMLParser
XML parser. |
Uses of Parser in org.apache.tika.utils |
---|
Methods in org.apache.tika.utils that return Parser | |
---|---|
static Parser |
ParseUtils.getParser(java.io.File documentFile,
TikaConfig config)
Returns a parser that can handle the specified MIME type, and is set to receive input from a stream opened from the specified URL. |
static Parser |
ParseUtils.getParser(java.lang.String mimeType,
TikaConfig config)
Returns a parser that can handle the specified MIME type, and is set to receive input from a stream opened from the specified URL. |
static Parser |
ParseUtils.getParser(java.net.URL documentUrl,
TikaConfig config)
Returns a parser that can handle the specified MIME type, and is set to receive input from a stream opened from the specified URL. |
|
||||||||||
PREV NEXT | FRAMES NO FRAMES |