| Modifier and Type | Method and Description | 
|---|---|
| Parser | Tika. getParser()Returns the parser instance used by this facade. | 
| Constructor and Description | 
|---|
| Tika(Detector detector,
    Parser parser)Creates a Tika facade using the given detector and parser instances, but the default
 Translator. | 
| Tika(Detector detector,
    Parser parser,
    Translator translator)Creates a Tika facade using the given detector, parser, and translator instances. | 
| Modifier and Type | Method and Description | 
|---|---|
| Parser | DigestingAutoDetectParserFactory. getParser(TikaConfig config) | 
| abstract Parser | ParserFactory. getParser(TikaConfig config) | 
| Parser | AutoDetectParserFactory. getParser(TikaConfig config) | 
| Modifier and Type | Method and Description | 
|---|---|
| protected void | FileResourceConsumer. parse(String resourceId,
     Parser parser,
     InputStream is,
     ContentHandler handler,
     Metadata m,
     ParseContext parseContext)Utility method to handle logging equivalently among all
 implementing classes. | 
| Constructor and Description | 
|---|
| BasicTikaFSConsumer(ArrayBlockingQueue<FileResource> queue,
                   Parser parser,
                   ContentHandlerFactory contentHandlerFactory,
                   OutputStreamFactory fsOSFactory) | 
| RecursiveParserWrapperFSConsumer(ArrayBlockingQueue<FileResource> queue,
                                Parser parser,
                                ContentHandlerFactory contentHandlerFactory,
                                OutputStreamFactory fsOSFactory,
                                MetadataFilter metadataFilter) | 
| StreamOutRPWFSConsumer(ArrayBlockingQueue<FileResource> queue,
                      Parser parser,
                      ContentHandlerFactory contentHandlerFactory,
                      OutputStreamFactory fsOSFactory,
                      MetadataFilter metadataFilter) | 
| Modifier and Type | Method and Description | 
|---|---|
| Parser | TikaConfig. getParser()Returns the configured parser instance. | 
| Parser | TikaConfig. getParser(MediaType mimeType)Deprecated. 
 Use the  TikaConfig.getParser()method instead | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | DirListParserParses the output of /bin/ls and counts the number of files and the number of
 executables using Tika. | 
| class  | EncryptedPrescriptionParser | 
| class  | LanguageDetectingParser | 
| class  | PickBestTextEncodingParserDeprecated. 
 Currently not suitable for real use, more a demo / prototype! | 
| class  | PrescriptionParser | 
| Modifier and Type | Method and Description | 
|---|---|
| protected boolean | PickBestTextEncodingParser. parserCompleted(Parser parser,
               Metadata metadata,
               ContentHandler handler,
               ParseContext context,
               Exception exception)Deprecated.  | 
| protected void | PickBestTextEncodingParser. parserPrepare(Parser parser,
             Metadata metadata,
             ParseContext context)Deprecated.  | 
| Modifier and Type | Method and Description | 
|---|---|
| Parser | ParsingEmbeddedDocumentExtractor. getDelegatingParser() | 
| static Parser | EmbeddedDocumentUtil. getStatelessParser(ParseContext context)Utility function to get the Parser that was sent in to the
 ParseContext to handle embedded documents. | 
| static Parser | EmbeddedDocumentUtil. tryToFindExistingLeafParser(Class clazz,
                           ParseContext context)Tries to find an existing parser within the ParseContext. | 
| Constructor and Description | 
|---|
| ParserContainerExtractor(Parser parser,
                        Detector detector) | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | ForkParser | 
| Constructor and Description | 
|---|
| ForkParser(ClassLoader loader,
          Parser parser) | 
| Constructor and Description | 
|---|
| TikaGUI(Parser parser) | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | AbstractEncodingDetectorParserAbstract base class for parsers that use the AutoDetectReader and need
 to use the  EncodingDetectorconfigured byTikaConfig | 
| class  | AbstractExternalProcessParserAbstract base class for parsers that call external processes. | 
| class  | AbstractParserAbstract base class for new parsers. | 
| class  | AutoDetectParser | 
| class  | CompositeParserComposite parser that delegates parsing tasks to a component parser
 based on the declared content type of the incoming document. | 
| class  | CryptoParserDecrypts the incoming document stream and delegates further parsing to
 another parser instance. | 
| class  | DefaultParserA composite parser based on all the  Parserimplementations
 available through theservice provider mechanism. | 
| class  | DelegatingParserBase class for parser implementations that want to delegate parts of the
 task of parsing an input document to another parser. | 
| class  | DigestingParser | 
| class  | EmptyParserDummy parser that always produces an empty XHTML document without even
 attempting to parse the given document stream. | 
| class  | ErrorParserDummy parser that always throws a  TikaExceptionwithout even
 attempting to parse the given document stream. | 
| class  | NetworkParser | 
| class  | ParserDecoratorDecorator base class for the  Parserinterface. | 
| class  | ParserPostProcessorParser decorator that post-processes the results from a decorated parser. | 
| class  | RecursiveParserWrapperThis is a helper class that wraps a parser in a recursive handler. | 
| class  | RegexCaptureParser | 
| class  | StatefulParserThe RecursiveParserWrapper wraps the parser sent
 into the parsecontext and then uses that parser
 to store state (among many other things). | 
| Modifier and Type | Method and Description | 
|---|---|
| abstract Parser | ParserFactory. build() | 
| Parser | AutoDetectParserFactory. build() | 
| protected Parser | DelegatingParser. getDelegateParser(ParseContext context)Returns the parser instance to which parsing tasks should be delegated. | 
| Parser | CompositeParser. getFallback()Returns the fallback parser. | 
| protected Parser | CompositeParser. getParser(Metadata metadata)Returns the parser that best matches the given metadata. | 
| protected Parser | CompositeParser. getParser(Metadata metadata,
         ParseContext context) | 
| Parser | ParserDecorator. getWrappedParser()Gets the parser wrapped by this ParserDecorator | 
| static Parser | ParserDecorator. withFallbacks(Collection<? extends Parser> parsers,
             Set<MediaType> types)Deprecated. 
 This has been replaced by  FallbackParser | 
| static Parser | ParserDecorator. withoutTypes(Parser parser,
            Set<MediaType> excludeTypes)Decorates the given parser so that it never claims to support
 parsing of the given media types, but will work for all others. | 
| static Parser | ParserDecorator. withTypes(Parser parser,
         Set<MediaType> types)Decorates the given parser so that it always claims to support
 parsing of the given media types. | 
| Modifier and Type | Method and Description | 
|---|---|
| Map<MediaType,List<Parser>> | CompositeParser. findDuplicateParsers(ParseContext context)Utility method that goes through all the component parsers and finds
 all media types for which more than one parser declares support. | 
| List<Parser> | CompositeParser. getAllComponentParsers()Returns all parsers registered with the Composite Parser,
 including ones which may not currently be active. | 
| List<Parser> | DefaultParser. getAllComponentParsers() | 
| Map<MediaType,Parser> | CompositeParser. getParsers()Returns the component parsers. | 
| Map<MediaType,Parser> | CompositeParser. getParsers(ParseContext context) | 
| Map<MediaType,Parser> | DefaultParser. getParsers(ParseContext context) | 
| Modifier and Type | Method and Description | 
|---|---|
| void | CompositeParser. setFallback(Parser fallback)Sets the fallback parser. | 
| static Parser | ParserDecorator. withoutTypes(Parser parser,
            Set<MediaType> excludeTypes)Decorates the given parser so that it never claims to support
 parsing of the given media types, but will work for all others. | 
| static Parser | ParserDecorator. withTypes(Parser parser,
         Set<MediaType> types)Decorates the given parser so that it always claims to support
 parsing of the given media types. | 
| Modifier and Type | Method and Description | 
|---|---|
| void | CompositeParser. setParsers(Map<MediaType,Parser> parsers)Sets the component parsers. | 
| static Parser | ParserDecorator. withFallbacks(Collection<? extends Parser> parsers,
             Set<MediaType> types)Deprecated. 
 This has been replaced by  FallbackParser | 
| Constructor and Description | 
|---|
| AutoDetectParser(Detector detector,
                Parser... parsers) | 
| AutoDetectParser(Parser... parsers)Creates an auto-detecting parser instance using the specified set of parser. | 
| CompositeParser(MediaTypeRegistry registry,
               Parser... parsers) | 
| DigestingParser(Parser parser,
               DigestingParser.Digester digester)Creates a decorator for the given parser. | 
| ParserDecorator(Parser parser)Creates a decorator for the given parser. | 
| ParserPostProcessor(Parser parser)Creates a post-processing decorator for the given parser. | 
| ParsingReader(Parser parser,
             InputStream stream,
             Metadata metadata,
             ParseContext context)Creates a reader for the text content of the given binary stream
 with the given document metadata. | 
| ParsingReader(Parser parser,
             InputStream stream,
             Metadata metadata,
             ParseContext context,
             Executor executor)Creates a reader for the text content of the given binary stream
 with the given document metadata. | 
| RecursiveParserWrapper(Parser wrappedParser)Initialize the wrapper with  RecursiveParserWrapper.catchEmbeddedExceptionsset
 totrueas default. | 
| RecursiveParserWrapper(Parser wrappedParser,
                      boolean catchEmbeddedExceptions) | 
| StatefulParser(Parser parser)Creates a decorator for the given parser. | 
| Constructor and Description | 
|---|
| CompositeParser(MediaTypeRegistry registry,
               List<Parser> parsers) | 
| CompositeParser(MediaTypeRegistry registry,
               List<Parser> parsers,
               Collection<Class<? extends Parser>> excludeParsers) | 
| CompositeParser(MediaTypeRegistry registry,
               List<Parser> parsers,
               Collection<Class<? extends Parser>> excludeParsers) | 
| DefaultParser(MediaTypeRegistry registry,
             ServiceLoader loader,
             Collection<Class<? extends Parser>> excludeParsers) | 
| DefaultParser(MediaTypeRegistry registry,
             ServiceLoader loader,
             Collection<Class<? extends Parser>> excludeParsers,
             EncodingDetector encodingDetector,
             Renderer renderer) | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | AppleSingleFileParserParser that strips the header off of AppleSingle and AppleDouble
 files. | 
| class  | PListParserParser for Apple's plist and bplist. | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | ClassParserParser for Java .class files. | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | AudioParser | 
| class  | MidiParser | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | SourceCodeParserGeneric Source code parser for Java, Groovy, C++. | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | Pkcs7ParserBasic parser for PKCS7 data. | 
| class  | TSDParserTika parser for Time Stamped Data Envelope (application/timestamped-data) | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | TextAndCSVParserUnless the  TikaCoreProperties.CONTENT_TYPE_USER_OVERRIDEis set,
 this parser tries to assess whether the file is a text file, csv or tsv. | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | CTAKESParserCTAKESParser decorates a  Parserand leverages onCTAKESContentHandlerto extract biomedical information from
 clinical text using Apache cTAKES. | 
| Constructor and Description | 
|---|
| CTAKESParser(Parser parser)Wraps the specified Parser | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | DBFParserThis is a Tika wrapper around the DBFReader. | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | DGN8ParserThis is a VERY LIMITED parser. | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | DIFParser | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | DWGParserDWG (CAD Drawing) parser. | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | EnviHeaderParser | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | EpubContentParserParser for EPUB OPS  *.htmlfiles. | 
| class  | EpubParserEpub parser | 
| Modifier and Type | Method and Description | 
|---|---|
| Parser | EpubParser. getContentParser() | 
| Parser | EpubParser. getMetaParser() | 
| Modifier and Type | Method and Description | 
|---|---|
| void | EpubParser. setContentParser(Parser content) | 
| void | EpubParser. setMetaParser(Parser meta) | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | ExecutableParserParser for executable files. | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | CompositeExternalParserA Composite Parser that wraps up all the available External Parsers,
 and provides an easy way to access them. | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | ExternalParserThis is a next generation external parser that uses some of the more
 recent additions to Tika. | 
| Modifier and Type | Method and Description | 
|---|---|
| Parser | ExternalParser. getOutputParser() | 
| Modifier and Type | Method and Description | 
|---|---|
| void | ExternalParser. setOutputParser(Parser parser)This parser is called on the output of the process. | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | FeedParserFeed parser. | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | AdobeFontMetricParserParser for AFM Font Files | 
| class  | TrueTypeParserParser for TrueType font files (TTF). | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | GDALParserWraps execution of the Geospatial Data Abstraction
 Library (GDAL)  gdalinfotool used to extract geospatial
 information out of hundreds of geo file formats. | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | GeoParser | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | GeographicInformationParser | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | GribParser | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | HDFParserSince the  NetCDFParserdepends on the NetCDF-Java API,
 we are able to use it to parse HDF files as well. | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | HtmlParserHTML parser. | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | HttpParser | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | HwpV5Parser | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | AbstractImageParser | 
| class  | BPGParserParser for the Better Portable Graphics (BPG) File Format. | 
| class  | HeifParser | 
| class  | ICNSParserA basic parser class for Apple ICNS icon files | 
| class  | ImageParser | 
| class  | JpegParser | 
| class  | JXLParserTries to scrape XMP out of JXL | 
| class  | PSDParserParser for the Adobe Photoshop PSD File Format. | 
| class  | TiffParser | 
| class  | WebPParser | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | IDMLParserAdobe InDesign IDML Parser. | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | IptcAnpaParserParser for IPTC ANPA New Wire Feeds | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | ISArchiveParser | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | IWorkPackageParserA parser for the IWork container files. | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | IWork13PackageParser | 
| class  | IWork18PackageParserFor now, this parser isn't even registered. | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | AbstractDBParserAbstract class that handles iterating through tables within a database. | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | JournalParser | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | RFC822ParserUses apache-mime4j to parse emails. | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | MatParser | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | MboxParserMbox (mailbox) parser. | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | AbstractOfficeParserIntermediate layer to set  OfficeParserConfiguniformly. | 
| class  | EMFParserExtracts files embedded in EMF and offers a
 very rough capability to extract text if there
 is text stored in the EMF. | 
| class  | JackcessParserParser that handles Microsoft Access files via
 Jackcess | 
| class  | MSOwnerFileParserParser for temporary MSOFfice files. | 
| class  | OfficeParserDefines a Microsoft document content extractor. | 
| class  | OldExcelParserA POI-powered Tika Parser for very old versions of Excel, from
 pre-OLE2 days, such as Excel 4. | 
| class  | TNEFParserA POI-powered Tika Parser for TNEF (Transport Neutral
 Encoding Format) messages, aka winmail.dat | 
| class  | WMFParserThis parser offers a very rough capability to extract text if there
 is text stored in the WMF files. | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | ChmParser | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | OneNoteParserOneNote tika parser capable of parsing Microsoft OneNote files. | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | OOXMLParserOffice Open XML (OOXML) parser. | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | Word2006MLParser | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | OutlookPSTParserParser for MS Outlook PST email storage files | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | RTFParserRTF parser | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | AbstractXML2003Parser | 
| class  | SpreadsheetMLParserParses wordml 2003 format Excel files. | 
| class  | WordMLParserParses wordml 2003 format word files. | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | MIFParser | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | Mp3ParserThe  Mp3Parseris used to parse ID3 Version 1 Tag information
 from an MP3 file, if available. | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | MP4ParserParser for the MP4 media container format, as well as the older
 QuickTime format that MP4 is based on. | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | AbstractMultipleParserAbstract base class for parser wrappers which may / will
 process a given stream multiple times, merging the results
 of the various parsers used. | 
| class  | FallbackParserTries multiple parsers in turn, until one succeeds. | 
| class  | SupplementingParserRuns the input stream through all available parsers,
 merging the metadata from them based on the
  AbstractMultipleParser.MetadataPolicychosen. | 
| Modifier and Type | Method and Description | 
|---|---|
| List<Parser> | AbstractMultipleParser. getAllParsers() | 
| Modifier and Type | Method and Description | 
|---|---|
| protected boolean | FallbackParser. parserCompleted(Parser parser,
               Metadata metadata,
               ContentHandler handler,
               ParseContext context,
               Exception exception) | 
| protected abstract boolean | AbstractMultipleParser. parserCompleted(Parser parser,
               Metadata metadata,
               ContentHandler handler,
               ParseContext context,
               Exception exception)Used to notify implementations that a Parser has Finished
 or Failed, and to allow them to decide to continue or
 abort further parsing | 
| protected boolean | SupplementingParser. parserCompleted(Parser parser,
               Metadata metadata,
               ContentHandler handler,
               ParseContext context,
               Exception exception) | 
| protected void | AbstractMultipleParser. parserPrepare(Parser parser,
             Metadata metadata,
             ParseContext context)Used to allow implementations to prepare or change things
 before parsing occurs | 
| Constructor and Description | 
|---|
| AbstractMultipleParser(MediaTypeRegistry registry,
                      AbstractMultipleParser.MetadataPolicy policy,
                      Parser... parsers) | 
| FallbackParser(MediaTypeRegistry registry,
              AbstractMultipleParser.MetadataPolicy policy,
              Parser... parsers) | 
| SupplementingParser(MediaTypeRegistry registry,
                   AbstractMultipleParser.MetadataPolicy policy,
                   Parser... parsers) | 
| Constructor and Description | 
|---|
| AbstractMultipleParser(MediaTypeRegistry registry,
                      AbstractMultipleParser.MetadataPolicy policy,
                      Collection<? extends Parser> parsers) | 
| AbstractMultipleParser(MediaTypeRegistry registry,
                      Collection<? extends Parser> parsers,
                      Map<String,Param> params) | 
| FallbackParser(MediaTypeRegistry registry,
              AbstractMultipleParser.MetadataPolicy policy,
              Collection<? extends Parser> parsers) | 
| FallbackParser(MediaTypeRegistry registry,
              Collection<? extends Parser> parsers,
              Map<String,Param> params) | 
| SupplementingParser(MediaTypeRegistry registry,
                   AbstractMultipleParser.MetadataPolicy policy,
                   Collection<? extends Parser> parsers) | 
| SupplementingParser(MediaTypeRegistry registry,
                   Collection<? extends Parser> parsers,
                   Map<String,Param> params) | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | NamedEntityParserThis implementation of  Parserextracts
 entity names from text content and adds it to the metadata. | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | NetCDFParser | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | TesseractOCRParserTesseractOCRParser powered by tesseract-ocr engine. | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | FlatOpenDocumentParser | 
| class  | OpenDocumentContentParserParser for ODF  content.xmlfiles. | 
| class  | OpenDocumentMetaParserParser for OpenDocument  meta.xmlfiles. | 
| class  | OpenDocumentParserOpenOffice parser | 
| Modifier and Type | Method and Description | 
|---|---|
| Parser | OpenDocumentParser. getContentParser() | 
| Parser | OpenDocumentParser. getMetaParser() | 
| Modifier and Type | Method and Description | 
|---|---|
| void | OpenDocumentParser. setContentParser(Parser content) | 
| void | OpenDocumentParser. setMetaParser(Parser meta) | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | PDFParserPDF parser. | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | CompressorParserParser for various compression formats. | 
| class  | PackageParserParser for various packaging formats. | 
| class  | RarParserParser for Rar files. | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | PooledTimeSeriesParserUses the Pooled Time Series algorithm + command line tool, to
 generate a numeric representation of the video suitable for
 similarity searches. | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | PRTParserA basic text extracting parser for the CADKey PRT (CAD Drawing)
 format. | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | AgeRecogniserParser for extracting features from text. | 
| class  | ObjectRecognitionParserThis parser recognises objects from Images. | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | TensorflowImageRecParserThis is an implementation of  ObjectRecogniserpowered by
  Tensorflow 
 convolutional neural network (CNN). | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | SAS7BDATParserProcesses the SAS7BDAT data columnar database file used by SAS and
 other similar languages. | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | SentimentAnalysisParserThis parser classifies documents based on the sentiment of document. | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | SQLite3ParserThis is the main class for parsing SQLite3 files. | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | Latin1StringsParserParser to extract printable Latin1 strings from arbitrary files with pure java
 without running any external process. | 
| class  | StringsParserParser that uses the "strings" (or strings-alternative) command to find the
 printable strings in a object, or other binary, file
 (application/octet-stream). | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | TMXParserParser for Translation Memory eXchange (TMX) files. | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | AmazonTranscribeAmazon Transcribe
 implementation. | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | TXTParserPlain text parser. | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | FLVParser
 Parser for metadata contained in Flash Videos (.flv). | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | WACZParser | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | WARCParser | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | QuattroProParserParser for Corel QuattroPro documents (part of Corel WordPerfect
 Office Suite). | 
| class  | WordPerfectParserParser for Corel WordPerfect documents. | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | XLIFF12ParserParser for XLIFF 1.2 files. | 
| class  | XLZParserParser for XLZ Archives. | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | DcXMLParserDublin Core metadata parser | 
| class  | FictionBookParser | 
| class  | TextAndAttributeXMLParser | 
| class  | XMLParserXML parser. | 
| class  | XMLProfiler | 
| Modifier and Type | Method and Description | 
|---|---|
| static Parser | TikaResource. createParser() | 
| Modifier and Type | Method and Description | 
|---|---|
| static void | TikaResource. fillMetadata(Parser parser,
            Metadata metadata,
            javax.ws.rs.core.MultivaluedMap<String,String> httpHeaders) | 
| static void | TikaResource. parse(Parser parser,
     org.slf4j.Logger logger,
     String path,
     InputStream inputStream,
     ContentHandler handler,
     Metadata metadata,
     ParseContext parseContext)Use this to call a parser and unify exception handling. | 
| Modifier and Type | Method and Description | 
|---|---|
| static String | ParserUtils. getParserClassname(Parser parser)Identifies the real class name of the  Parser, unwrapping
 anyParserDecoratordecorations on top of it. | 
| static void | ParserUtils. recordParserDetails(Parser parser,
                   Metadata metadata) | 
| static void | ParserUtils. recordParserFailure(Parser parser,
                   Throwable failure,
                   Metadata metadata) | 
Copyright © 2007–2022 The Apache Software Foundation. All rights reserved.