Uses of Interface
org.apache.tika.config.SelfConfiguring
Packages that use SelfConfiguring
Package
Description
Media type detection.
Media type information.
Tika parsers.
-
Uses of SelfConfiguring in org.apache.tika.detect
Subinterfaces of SelfConfiguring in org.apache.tika.detectModifier and TypeInterfaceDescriptioninterfaceContent type detector.interfaceCharacter encoding detector.interfaceMarker interface for encoding detectors that arbitrate among candidates collected by base detectors rather than detecting encoding directly from the stream.Classes in org.apache.tika.detect that implement SelfConfiguringModifier and TypeClassDescriptionclassEncoding detector that identifies the character set from a byte-order mark (BOM) at the start of the stream.classContent type detector that combines multiple different detection mechanisms.classA composite encoding detector that runs child detectors.classA composite detector that orchestrates the detection pipeline: MimeTypes (magic byte) detection Container and other detectors loaded via SPI TextDetector as fallback for unknown types Returns the most specific type detectedclassA composite encoding detector based on all theEncodingDetectorimplementations available through theservice provider mechanism.classA version ofDefaultDetectorfor probabilistic mime detectors, which use statistical techniques to blend the results of differing underlying detectors when attempting to detect the type of a given file.classDummy detector that returns application/octet-stream for all documents.classThis runs the linux 'file' command against a file.classContent type detection based on magic bytes, i.e. type-specific patterns near the beginning of the document input stream.classDetector for Matroska (MKV and WEBM) files based on the EBML header.classEncoding detector that extracts a declared charset from Tika metadata without reading any bytes from the stream.classContent type detection based on the resource name.classclassDeprecated.after 2.5.0 this functionality was moved to the CompositeDetectorclassAlways returns the charset passed in via the initializerclassContent type detection of plain text documents.classclassContent type detection based on a content type hint.classDetector to identify zero length files as application/x-zerovalue -
Uses of SelfConfiguring in org.apache.tika.detect.apple
Classes in org.apache.tika.detect.apple that implement SelfConfiguringModifier and TypeClassDescriptionclassDetector for BPList with utility functions for PList. -
Uses of SelfConfiguring in org.apache.tika.detect.gzip
Classes in org.apache.tika.detect.gzip that implement SelfConfiguringModifier and TypeClassDescriptionclassThis is designed to detect commonly gzipped file types such as warc.gz. -
Uses of SelfConfiguring in org.apache.tika.detect.magika
Classes in org.apache.tika.detect.magika that implement SelfConfiguringModifier and TypeClassDescriptionclassSimple wrapper around Google's magika: https://github.com/google/magika The tool must be installed on the host where Tika is running. -
Uses of SelfConfiguring in org.apache.tika.detect.microsoft
Classes in org.apache.tika.detect.microsoft that implement SelfConfiguringModifier and TypeClassDescriptionclassA detector that works on a POIFS OLE2 document to figure out exactly what the file is. -
Uses of SelfConfiguring in org.apache.tika.detect.ogg
Classes in org.apache.tika.detect.ogg that implement SelfConfiguringModifier and TypeClassDescriptionclassDetector for identifying specific file types stored within an Ogg container. -
Uses of SelfConfiguring in org.apache.tika.detect.ole
Classes in org.apache.tika.detect.ole that implement SelfConfiguringModifier and TypeClassDescriptionclassA detector that works on a POIFS OLE2 document to figure out exactly what the file is. -
Uses of SelfConfiguring in org.apache.tika.detect.siegfried
Classes in org.apache.tika.detect.siegfried that implement SelfConfiguringModifier and TypeClassDescriptionclassSimple wrapper around Siegfried https://github.com/richardlehane/siegfried The default behavior is to run detection, report the results in the metadata and then return null so that other detectors will be used. -
Uses of SelfConfiguring in org.apache.tika.detect.zip
Classes in org.apache.tika.detect.zip that implement SelfConfiguringModifier and TypeClassDescriptionclassThis class is designed to detect subtypes of zip-based file formats.classclassA zip container detector that uses only streaming detection, never opening the file as a ZipFile. -
Uses of SelfConfiguring in org.apache.tika.example
Classes in org.apache.tika.example that implement SelfConfiguringModifier and TypeClassDescriptionclassParses the output of /bin/ls and counts the number of files and the number of executables using Tika.classclassclassclassDeprecated.Currently not suitable for real use, more a demo / prototype!class -
Uses of SelfConfiguring in org.apache.tika.inference
Classes in org.apache.tika.inference that implement SelfConfiguringModifier and TypeClassDescriptionclassParser that sends images to a CLIP-like embedding endpoint (OpenAI-compatible/v1/embeddingswith image input) and stores the resulting vector in metadata. -
Uses of SelfConfiguring in org.apache.tika.langdetect.charsoup
Classes in org.apache.tika.langdetect.charsoup that implement SelfConfiguringModifier and TypeClassDescriptionclassCharSoup language detector using INT8-quantized multinomial logistic regression trained on Wikipedia (primary corpus) with MADLAD supplements for thin languages. -
Uses of SelfConfiguring in org.apache.tika.language.translate
Subinterfaces of SelfConfiguring in org.apache.tika.language.translateClasses in org.apache.tika.language.translate that implement SelfConfiguringModifier and TypeClassDescriptionclassA translator which picks the first availableTranslatorimplementations available through theservice provider mechanism.classDummy translator that always declines to give any text. -
Uses of SelfConfiguring in org.apache.tika.language.translate.impl
Classes in org.apache.tika.language.translate.impl that implement SelfConfiguringModifier and TypeClassDescriptionclassclassCachedTranslator.classAbstract class used to interact with command line/external Translators.classAn implementation of a REST client to the Google Translate v2 API.classThis translator is designed to work with a TCP-IP available Joshua translation server, specifically the REST-based Joshua server.classAn implementation of a REST client for the Premium MT API v1.classTranslator that uses the Marian NMT decoder for translation.classWrapper class to access the Windows translation service.classTranslator that uses the Moses decoder for translation.classThis translator is designed to work with a TCP-IP available RTG translation server, specifically the REST-based RTG server.classAn implementation of a REST client for the YANDEX Translate API. -
Uses of SelfConfiguring in org.apache.tika.mime
Classes in org.apache.tika.mime that implement SelfConfiguringModifier and TypeClassDescriptionfinal classThis class is a MimeType repository.classSelector for combining different mime detection results based on probability -
Uses of SelfConfiguring in org.apache.tika.ml.chardetect
Classes in org.apache.tika.ml.chardetect that implement SelfConfiguringModifier and TypeClassDescriptionclassNaive-Bayes pipeline detector: structural checks for wide Unicode + BOMs before falling through to the bigram NB classifier for everything else.classNaive-Bayes byte-bigram charset classifier.classUTF-16 specialist detector of the mixture-of-experts charset detection architecture. -
Uses of SelfConfiguring in org.apache.tika.ml.junkdetect
Classes in org.apache.tika.ml.junkdetect that implement SelfConfiguringModifier and TypeClassDescriptionclassAMetaEncodingDetectorthat arbitrates charset candidates by asking aTextQualityDetectorwhich decoded candidate looks most like natural text. -
Uses of SelfConfiguring in org.apache.tika.parser
Subinterfaces of SelfConfiguring in org.apache.tika.parserClasses in org.apache.tika.parser that implement SelfConfiguringModifier and TypeClassDescriptionclassAbstract base class for parsers that use the AutoDetectReader and need to use anEncodingDetector.classAbstract base class for parsers that call external processes.classDeprecated.for removal in 4.xclassclassComposite parser that delegates parsing tasks to a component parser based on the declared content type of the incoming document.classDecrypts the incoming document stream and delegates further parsing to another parser instance.classA composite parser based on all theParserimplementations available through theservice provider mechanism.classBase class for parser implementations that want to delegate parts of the task of parsing an input document to another parser.classDummy parser that always produces an empty XHTML document without even attempting to parse the given document stream.classDummy parser that always throws aTikaExceptionwithout even attempting to parse the given document stream.classclassDecorator base class for theParserinterface.static classA ParserDecorator that filters supported mime types.classParser decorator that post-processes the results from a decorated parser.classThis is a helper class that wraps a parser in a recursive handler.classclassThe RecursiveParserWrapper wraps the parser sent into the parsecontext and then uses that parser to store state (among many other things). -
Uses of SelfConfiguring in org.apache.tika.parser.apple
Classes in org.apache.tika.parser.apple that implement SelfConfiguringModifier and TypeClassDescriptionclassParser that strips the header off of AppleSingle and AppleDouble files.classParser for Apple's plist and bplist. -
Uses of SelfConfiguring in org.apache.tika.parser.asm
Classes in org.apache.tika.parser.asm that implement SelfConfiguring -
Uses of SelfConfiguring in org.apache.tika.parser.audio
Classes in org.apache.tika.parser.audio that implement SelfConfiguring -
Uses of SelfConfiguring in org.apache.tika.parser.code
Classes in org.apache.tika.parser.code that implement SelfConfiguringModifier and TypeClassDescriptionclassGeneric Source code parser for Java, Groovy, C++. -
Uses of SelfConfiguring in org.apache.tika.parser.crypto
Classes in org.apache.tika.parser.crypto that implement SelfConfiguringModifier and TypeClassDescriptionclassBasic parser for PKCS7 data.classTika parser for Time Stamped Data Envelope (application/timestamped-data) -
Uses of SelfConfiguring in org.apache.tika.parser.csv
Classes in org.apache.tika.parser.csv that implement SelfConfiguringModifier and TypeClassDescriptionclassUnless theTikaCoreProperties.CONTENT_TYPE_USER_OVERRIDEis set, this parser tries to assess whether the file is a text file, csv or tsv. -
Uses of SelfConfiguring in org.apache.tika.parser.ctakes
Classes in org.apache.tika.parser.ctakes that implement SelfConfiguringModifier and TypeClassDescriptionclassCTAKESParser decorates aParserand leverages onCTAKESContentHandlerto extract biomedical information from clinical text using Apache cTAKES. -
Uses of SelfConfiguring in org.apache.tika.parser.dbf
Classes in org.apache.tika.parser.dbf that implement SelfConfiguring -
Uses of SelfConfiguring in org.apache.tika.parser.dgn
Classes in org.apache.tika.parser.dgn that implement SelfConfiguring -
Uses of SelfConfiguring in org.apache.tika.parser.dif
Classes in org.apache.tika.parser.dif that implement SelfConfiguring -
Uses of SelfConfiguring in org.apache.tika.parser.dwg
Classes in org.apache.tika.parser.dwg that implement SelfConfiguringModifier and TypeClassDescriptionclassclassDWG (CAD Drawing) parser.classDWGReadParser (CAD Drawing) parser. -
Uses of SelfConfiguring in org.apache.tika.parser.envi
Classes in org.apache.tika.parser.envi that implement SelfConfiguring -
Uses of SelfConfiguring in org.apache.tika.parser.epub
Classes in org.apache.tika.parser.epub that implement SelfConfiguringModifier and TypeClassDescriptionclassParser for EPUB OPS*.htmlfiles.classEpub parserclassUse this to parse the .opf files -
Uses of SelfConfiguring in org.apache.tika.parser.executable
Classes in org.apache.tika.parser.executable that implement SelfConfiguringModifier and TypeClassDescriptionclassParser for executable files.classParser for universal executable files. -
Uses of SelfConfiguring in org.apache.tika.parser.external
Classes in org.apache.tika.parser.external that implement SelfConfiguringModifier and TypeClassDescriptionclassParser that uses an external program (like ffmpeg, exiftool or sox) to extract text content and metadata from a given document. -
Uses of SelfConfiguring in org.apache.tika.parser.feed
Classes in org.apache.tika.parser.feed that implement SelfConfiguring -
Uses of SelfConfiguring in org.apache.tika.parser.font
Classes in org.apache.tika.parser.font that implement SelfConfiguringModifier and TypeClassDescriptionclassParser for AFM Font FilesclassParser for TrueType font files (TTF). -
Uses of SelfConfiguring in org.apache.tika.parser.gdal
Classes in org.apache.tika.parser.gdal that implement SelfConfiguringModifier and TypeClassDescriptionclassWraps execution of the Geospatial Data Abstraction Library (GDAL)gdalinfotool used to extract geospatial information out of hundreds of geo file formats. -
Uses of SelfConfiguring in org.apache.tika.parser.geo.topic
Classes in org.apache.tika.parser.geo.topic that implement SelfConfiguring -
Uses of SelfConfiguring in org.apache.tika.parser.geoinfo
Classes in org.apache.tika.parser.geoinfo that implement SelfConfiguring -
Uses of SelfConfiguring in org.apache.tika.parser.geopkg
Classes in org.apache.tika.parser.geopkg that implement SelfConfiguringModifier and TypeClassDescriptionclassCustomization of sqlite parser to skip certain common blob columns. -
Uses of SelfConfiguring in org.apache.tika.parser.grib
Classes in org.apache.tika.parser.grib that implement SelfConfiguring -
Uses of SelfConfiguring in org.apache.tika.parser.hdf
Classes in org.apache.tika.parser.hdf that implement SelfConfiguringModifier and TypeClassDescriptionclassSince theNetCDFParserdepends on the NetCDF-Java API, we are able to use it to parse HDF files as well. -
Uses of SelfConfiguring in org.apache.tika.parser.html
Classes in org.apache.tika.parser.html that implement SelfConfiguringModifier and TypeClassDescriptionclassCharacter encoding detector for determining the character encoding of a HTML document based on the potential charset parameter found in a Content-Type http-equiv meta tag somewhere near the beginning.classHTML parser. -
Uses of SelfConfiguring in org.apache.tika.parser.html.charsetdetector
Classes in org.apache.tika.parser.html.charsetdetector that implement SelfConfiguringModifier and TypeClassDescriptionfinal classFull WHATWG prescan charset detector for HTML: HTTP Content-Type header →<meta charset>/<meta http-equiv>tag, per https://html.spec.whatwg.org/multipage/parsing.html#the-input-byte-stream. -
Uses of SelfConfiguring in org.apache.tika.parser.http
Classes in org.apache.tika.parser.http that implement SelfConfiguring -
Uses of SelfConfiguring in org.apache.tika.parser.hwp
Classes in org.apache.tika.parser.hwp that implement SelfConfiguring -
Uses of SelfConfiguring in org.apache.tika.parser.image
Classes in org.apache.tika.parser.image that implement SelfConfiguringModifier and TypeClassDescriptionclassclassParser for the Better Portable Graphics (BPG) File Format.classclassA basic parser class for Apple ICNS icon filesclassclassclassTries to scrape XMP out of JXLclassParser for the Adobe Photoshop PSD File Format.classclass -
Uses of SelfConfiguring in org.apache.tika.parser.indesign
Classes in org.apache.tika.parser.indesign that implement SelfConfiguring -
Uses of SelfConfiguring in org.apache.tika.parser.iptc
Classes in org.apache.tika.parser.iptc that implement SelfConfiguring -
Uses of SelfConfiguring in org.apache.tika.parser.isatab
Classes in org.apache.tika.parser.isatab that implement SelfConfiguring -
Uses of SelfConfiguring in org.apache.tika.parser.iwork
Classes in org.apache.tika.parser.iwork that implement SelfConfiguringModifier and TypeClassDescriptionclassA parser for the IWork container files. -
Uses of SelfConfiguring in org.apache.tika.parser.iwork.iwana
Classes in org.apache.tika.parser.iwork.iwana that implement SelfConfiguringModifier and TypeClassDescriptionclassclassFor now, this parser isn't even registered. -
Uses of SelfConfiguring in org.apache.tika.parser.jdbc
Classes in org.apache.tika.parser.jdbc that implement SelfConfiguringModifier and TypeClassDescriptionclassAbstract class that handles iterating through tables within a database. -
Uses of SelfConfiguring in org.apache.tika.parser.journal
Classes in org.apache.tika.parser.journal that implement SelfConfiguring -
Uses of SelfConfiguring in org.apache.tika.parser.mail
Classes in org.apache.tika.parser.mail that implement SelfConfiguring -
Uses of SelfConfiguring in org.apache.tika.parser.mat
Classes in org.apache.tika.parser.mat that implement SelfConfiguring -
Uses of SelfConfiguring in org.apache.tika.parser.mbox
Classes in org.apache.tika.parser.mbox that implement SelfConfiguring -
Uses of SelfConfiguring in org.apache.tika.parser.microsoft
Classes in org.apache.tika.parser.microsoft that implement SelfConfiguringModifier and TypeClassDescriptionclassIntermediate layer to setOfficeParserConfiguniformly.classExtracts files embedded in EMF and offers a very rough capability to extract text if there is text stored in the EMF.classParser that handles Microsoft Access files via JackcessclassParser for temporary MSOFfice files.classDefines a Microsoft document content extractor.classA POI-powered Tika Parser for very old versions of Excel, from pre-OLE2 days, such as Excel 4.classA POI-powered Tika Parser for TNEF (Transport Neutral Encoding Format) messages, aka winmail.datclassThis parser offers a very rough capability to extract text if there is text stored in the WMF files. -
Uses of SelfConfiguring in org.apache.tika.parser.microsoft.activemime
Classes in org.apache.tika.parser.microsoft.activemime that implement SelfConfiguringModifier and TypeClassDescriptionclassActiveMime is a macro container format used in some mso files. -
Uses of SelfConfiguring in org.apache.tika.parser.microsoft.chm
Classes in org.apache.tika.parser.microsoft.chm that implement SelfConfiguring -
Uses of SelfConfiguring in org.apache.tika.parser.microsoft.libpst
Classes in org.apache.tika.parser.microsoft.libpst that implement SelfConfiguringModifier and TypeClassDescriptionclassThis is an optional PST parser that relies on the user installing the GPL-3 libpst/readpst commandline tool and configuring Tika to call this library via tika-config.xml -
Uses of SelfConfiguring in org.apache.tika.parser.microsoft.onenote
Classes in org.apache.tika.parser.microsoft.onenote that implement SelfConfiguringModifier and TypeClassDescriptionclassOneNote tika parser capable of parsing Microsoft OneNote files. -
Uses of SelfConfiguring in org.apache.tika.parser.microsoft.ooxml
Classes in org.apache.tika.parser.microsoft.ooxml that implement SelfConfiguring -
Uses of SelfConfiguring in org.apache.tika.parser.microsoft.ooxml.xwpf.ml2006
Classes in org.apache.tika.parser.microsoft.ooxml.xwpf.ml2006 that implement SelfConfiguring -
Uses of SelfConfiguring in org.apache.tika.parser.microsoft.pst
Classes in org.apache.tika.parser.microsoft.pst that implement SelfConfiguringModifier and TypeClassDescriptionclassParser for MS Outlook PST email storage filesclass -
Uses of SelfConfiguring in org.apache.tika.parser.microsoft.rtf
Classes in org.apache.tika.parser.microsoft.rtf that implement SelfConfiguring -
Uses of SelfConfiguring in org.apache.tika.parser.microsoft.xml
Classes in org.apache.tika.parser.microsoft.xml that implement SelfConfiguringModifier and TypeClassDescriptionclassclassParses wordml 2003 format Excel files.classParses wordml 2003 format word files. -
Uses of SelfConfiguring in org.apache.tika.parser.mif
Classes in org.apache.tika.parser.mif that implement SelfConfiguring -
Uses of SelfConfiguring in org.apache.tika.parser.mp3
Classes in org.apache.tika.parser.mp3 that implement SelfConfiguringModifier and TypeClassDescriptionclassTheMp3Parseris used to parse ID3 Version 1 Tag information from an MP3 file, if available. -
Uses of SelfConfiguring in org.apache.tika.parser.mp4
Classes in org.apache.tika.parser.mp4 that implement SelfConfiguringModifier and TypeClassDescriptionclassParser for the MP4 media container format, as well as the older QuickTime format that MP4 is based on. -
Uses of SelfConfiguring in org.apache.tika.parser.multiple
Classes in org.apache.tika.parser.multiple that implement SelfConfiguringModifier and TypeClassDescriptionclassAbstract base class for parser wrappers which may / will process a given stream multiple times, merging the results of the various parsers used.classTries multiple parsers in turn, until one succeeds.classRuns the input stream through all available parsers, merging the metadata from them based on theAbstractMultipleParser.MetadataPolicychosen. -
Uses of SelfConfiguring in org.apache.tika.parser.ner
Classes in org.apache.tika.parser.ner that implement SelfConfiguringModifier and TypeClassDescriptionclassThis implementation ofParserextracts entity names from text content and adds it to the metadata. -
Uses of SelfConfiguring in org.apache.tika.parser.netcdf
Classes in org.apache.tika.parser.netcdf that implement SelfConfiguringModifier and TypeClassDescriptionclass -
Uses of SelfConfiguring in org.apache.tika.parser.ocr
Classes in org.apache.tika.parser.ocr that implement SelfConfiguringModifier and TypeClassDescriptionclassTesseractOCRParser powered by tesseract-ocr engine. -
Uses of SelfConfiguring in org.apache.tika.parser.ocr.tess4j
Classes in org.apache.tika.parser.ocr.tess4j that implement SelfConfiguringModifier and TypeClassDescriptionclassOCR parser using Tess4J, which provides a Java JNA wrapper around the native Tesseract library. -
Uses of SelfConfiguring in org.apache.tika.parser.ocrencode
Classes in org.apache.tika.parser.ocrencode that implement SelfConfiguringModifier and TypeClassDescriptionclassParser that base64-encodes image content instead of performing OCR text extraction. -
Uses of SelfConfiguring in org.apache.tika.parser.odf
Classes in org.apache.tika.parser.odf that implement SelfConfiguringModifier and TypeClassDescriptionclassclassParser for ODFcontent.xmlfiles.classParser for OpenDocumentmeta.xmlfiles.classOpenOffice parser -
Uses of SelfConfiguring in org.apache.tika.parser.ogg
Classes in org.apache.tika.parser.ogg that implement SelfConfiguringModifier and TypeClassDescriptionclassParser for FLAC audio files (both native FLAC and OGG-FLAC).classParent parser for the various Ogg Audio formats, such as Vorbis and Opus.classGeneral parser for Ogg files where we don't know what the specific kind is.classParser for OGG Opus audio files.classParser for OGG Speex audio files.classParser for OGG Theora video files, which may also contain one or more soundtrack streams.classParser for OGG Vorbis audio files. -
Uses of SelfConfiguring in org.apache.tika.parser.pdf
Classes in org.apache.tika.parser.pdf that implement SelfConfiguring -
Uses of SelfConfiguring in org.apache.tika.parser.pkg
Classes in org.apache.tika.parser.pkg that implement SelfConfiguringModifier and TypeClassDescriptionclassAbstract base class for archive parsers that provides common functionality for handling embedded documents within archives.classParser for various compression formats.classParser for streaming archive formats: AR, ARJ, CPIO, DUMP, TAR.classParser for Rar files.classParser for 7z (Seven Zip) archives.classParser for Rar files.classParser for ZIP and JAR archives using file-based access for complete metadata extraction. -
Uses of SelfConfiguring in org.apache.tika.parser.prt
Classes in org.apache.tika.parser.prt that implement SelfConfiguringModifier and TypeClassDescriptionclassA basic text extracting parser for the CADKey PRT (CAD Drawing) format. -
Uses of SelfConfiguring in org.apache.tika.parser.sas
Classes in org.apache.tika.parser.sas that implement SelfConfiguringModifier and TypeClassDescriptionclassProcesses the SAS7BDAT data columnar database file used by SAS and other similar languages. -
Uses of SelfConfiguring in org.apache.tika.parser.sqlite3
Classes in org.apache.tika.parser.sqlite3 that implement SelfConfiguringModifier and TypeClassDescriptionclassThis is the implementation of the db parser for SQLite.classThis is the main class for parsing SQLite3 files. -
Uses of SelfConfiguring in org.apache.tika.parser.strings
Classes in org.apache.tika.parser.strings that implement SelfConfiguringModifier and TypeClassDescriptionclassParser to extract printable Latin1 strings from arbitrary files with pure java without running any external process.classParser that uses the "strings" (or strings-alternative) command to find the printable strings in a object, or other binary, file (application/octet-tis). -
Uses of SelfConfiguring in org.apache.tika.parser.tmx
Classes in org.apache.tika.parser.tmx that implement SelfConfiguringModifier and TypeClassDescriptionclassParser for Translation Memory eXchange (TMX) files. -
Uses of SelfConfiguring in org.apache.tika.parser.transcribe.aws
Classes in org.apache.tika.parser.transcribe.aws that implement SelfConfiguring -
Uses of SelfConfiguring in org.apache.tika.parser.txt
Classes in org.apache.tika.parser.txt that implement SelfConfiguringModifier and TypeClassDescriptionclassclassPlain text parser.class -
Uses of SelfConfiguring in org.apache.tika.parser.video
Classes in org.apache.tika.parser.video that implement SelfConfiguringModifier and TypeClassDescriptionclassParser for metadata contained in Flash Videos (.flv). -
Uses of SelfConfiguring in org.apache.tika.parser.vlm
Classes in org.apache.tika.parser.vlm that implement SelfConfiguringModifier and TypeClassDescriptionclassAbstract base class for parsers that delegate to a remote Vision-Language Model (VLM) endpoint for OCR and document understanding.classVLM parser for the Anthropic Claude Messages API.classVLM parser for the Google GeminigenerateContentAPI.classVLM parser for OpenAI-compatible chat completions endpoints (OpenAI, Azure OpenAI, OpenRouter, vLLM, Ollama, LiteLLM, Together AI, Groq, Fireworks, Mistral, NVIDIA NIM, Jina, local FastAPI wrappers, etc.). -
Uses of SelfConfiguring in org.apache.tika.parser.wacz
Classes in org.apache.tika.parser.wacz that implement SelfConfiguring -
Uses of SelfConfiguring in org.apache.tika.parser.warc
Classes in org.apache.tika.parser.warc that implement SelfConfiguringModifier and TypeClassDescriptionclassThis uses jwarc to parse warc files and arc files -
Uses of SelfConfiguring in org.apache.tika.parser.wordperfect
Classes in org.apache.tika.parser.wordperfect that implement SelfConfiguringModifier and TypeClassDescriptionclassParser for Corel QuattroPro documents (part of Corel WordPerfect Office Suite).classParser for Corel WordPerfect documents. -
Uses of SelfConfiguring in org.apache.tika.parser.xliff
Classes in org.apache.tika.parser.xliff that implement SelfConfiguringModifier and TypeClassDescriptionclassParser for XLIFF 1.2 files.classParser for XLZ Archives. -
Uses of SelfConfiguring in org.apache.tika.parser.xml
Classes in org.apache.tika.parser.xml that implement SelfConfiguringModifier and TypeClassDescriptionclassDublin Core metadata parserclassclassclassXML parser.class -
Uses of SelfConfiguring in org.apache.tika.renderer
Subinterfaces of SelfConfiguring in org.apache.tika.rendererClasses in org.apache.tika.renderer that implement SelfConfiguring -
Uses of SelfConfiguring in org.apache.tika.renderer.pdf.pdfbox
Subinterfaces of SelfConfiguring in org.apache.tika.renderer.pdf.pdfboxModifier and TypeInterfaceDescriptioninterfacestub interface for the PDFParser to use to figure out if it needs to pass on the PDDocument or create a temp file to be used by a file-based renderer down the road.Classes in org.apache.tika.renderer.pdf.pdfbox that implement SelfConfiguring -
Uses of SelfConfiguring in org.apache.tika.renderer.pdf.poppler
Classes in org.apache.tika.renderer.pdf.poppler that implement SelfConfiguringModifier and TypeClassDescriptionclassRenderer that uses Poppler'spdftoppmcommand to convert PDF pages to PNG images.