Uses of Class
org.apache.tika.metadata.Metadata
Package
Description
Apache Tika.
Media type detection.
Extraction of component documents.
Forked parser.
IO utilities.
Multi-valued metadata container, and set of constant metadata fields.
Media type information.
Tika parsers.
External parser process.
SAX utilities.
Utilities.
-
Uses of Metadata in org.apache.tika
Modifier and TypeMethodDescriptionTika.detect
(InputStream stream, Metadata metadata) Detects the media type of the given document.Parses the given file and returns the extracted text content.Tika.parse
(InputStream stream, Metadata metadata) Parses the given document and returns the extracted text content.Parses the file at the given path and returns the extracted text content.Tika.parseToString
(InputStream stream, Metadata metadata) Parses the given document and returns the extracted text content.Tika.parseToString
(InputStream stream, Metadata metadata, int maxLength) Parses the given document and returns the extracted text content. -
Uses of Metadata in org.apache.tika.batch
Modifier and TypeMethodDescriptionFileResource.getMetadata()
This gets the metadata available before the parsing of the file.Modifier and TypeMethodDescriptionOutputStreamFactory.getOutputStream
(Metadata metadata) protected void
FileResourceConsumer.parse
(String resourceId, Parser parser, InputStream is, ContentHandler handler, Metadata m, ParseContext parseContext) Utility method to handle logging equivalently among all implementing classes.protected boolean
-
Uses of Metadata in org.apache.tika.batch.fs
Modifier and TypeMethodDescriptionFSOutputStreamFactory.getOutputStream
(Metadata metadata) This tries to create a file based on theFSUtil.HANDLE_EXISTING
value that was passed in during initialization.boolean
-
Uses of Metadata in org.apache.tika.detect
Modifier and TypeMethodDescriptionCompositeDetector.detect
(InputStream input, Metadata metadata) CompositeEncodingDetector.detect
(InputStream input, Metadata metadata) Detector.detect
(InputStream input, Metadata metadata) Detects the content type of the given input document.EmptyDetector.detect
(InputStream input, Metadata metadata) EncodingDetector.detect
(InputStream input, Metadata metadata) Detects the character encoding of the given text document, ornull
if the encoding of the document can not be detected.FileCommandDetector.detect
(InputStream input, Metadata metadata) MagicDetector.detect
(InputStream input, Metadata metadata) NameDetector.detect
(InputStream input, Metadata metadata) Detects the content type of an input document based on the document name given in the input metadata.NonDetectingEncodingDetector.detect
(InputStream input, Metadata metadata) OverrideDetector.detect
(InputStream input, Metadata metadata) Deprecated.TextDetector.detect
(InputStream input, Metadata metadata) Looks at the beginning of the document input stream to determine whether the document is text or not.TrainedModelDetector.detect
(InputStream input, Metadata metadata) TypeDetector.detect
(InputStream input, Metadata metadata) Detects the content type of an input document based on a type hint given in the input metadata.ZeroSizeFileDetector.detect
(InputStream stream, Metadata metadata) ModifierConstructorDescriptionAutoDetectReader
(InputStream stream, Metadata metadata) AutoDetectReader
(InputStream stream, Metadata metadata, ServiceLoader loader) AutoDetectReader
(InputStream stream, Metadata metadata, EncodingDetector encodingDetector) -
Uses of Metadata in org.apache.tika.detect.apple
-
Uses of Metadata in org.apache.tika.detect.gzip
Modifier and TypeMethodDescriptionGZipSpecializationDetector.detect
(InputStream input, Metadata metadata) -
Uses of Metadata in org.apache.tika.detect.microsoft
Modifier and TypeMethodDescriptionPOIFSContainerDetector.detect
(InputStream input, Metadata metadata) -
Uses of Metadata in org.apache.tika.detect.ole
-
Uses of Metadata in org.apache.tika.detect.siegfried
Modifier and TypeMethodDescriptionSiegfriedDetector.detect
(InputStream input, Metadata metadata) protected static MediaType
SiegfriedDetector.processResult
(FileProcessResult result, Metadata metadata, boolean returnMime) -
Uses of Metadata in org.apache.tika.detect.zip
Modifier and TypeMethodDescriptionDefaultZipContainerDetector.detect
(InputStream input, Metadata metadata) DeprecatedStreamingZipContainerDetector.detect
(InputStream is, Metadata metadata) StreamingZipContainerDetector.detect
(InputStream input, Metadata metadata) -
Uses of Metadata in org.apache.tika.dl.imagerec
Modifier and TypeMethodDescriptionDL4JInceptionV3Net.recognise
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) DL4JVGG16Net.recognise
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) -
Uses of Metadata in org.apache.tika.embedder
Modifier and TypeMethodDescriptionvoid
Embedder.embed
(Metadata metadata, InputStream originalStream, OutputStream outputStream, ParseContext context) Embeds related document metadata from the given metadata object into the given output stream.void
ExternalEmbedder.embed
(Metadata metadata, InputStream inputStream, OutputStream outputStream, ParseContext context) Executes the configured external command and passes the given document stream as a simple XHTML document to the given SAX content handler.ExternalEmbedder.getCommandMetadataSegments
(Metadata metadata) Constructs a collection of command line arguments responsible for setting individual metadata fields based on the givenmetadata
. -
Uses of Metadata in org.apache.tika.eval.app
Modifier and TypeMethodDescriptionprotected static ContentTags
AbstractProfiler.getContent
(org.apache.tika.eval.app.EvalFilePaths evalFilePaths, Metadata metadata) protected org.apache.tika.eval.app.EvalFilePaths
AbstractProfiler.getPathsFromExtractCrawl
(Metadata metadata, Path extracts) protected org.apache.tika.eval.app.EvalFilePaths
AbstractProfiler.getPathsFromSrcCrawl
(Metadata metadata, Path srcDir, Path extracts) protected void
AbstractProfiler.writeExceptionData
(String fileId, Metadata m, TableInfo exceptionTable) protected void
AbstractProfiler.writeProfileData
(org.apache.tika.eval.app.EvalFilePaths fps, int i, ContentTags contentTags, Metadata m, String fileId, String containerId, List<Integer> numAttachments, TableInfo profileTable) Modifier and TypeMethodDescriptionprotected long
AbstractProfiler.getSourceFileLength
(org.apache.tika.eval.app.EvalFilePaths fps, List<Metadata> metadataList) -
Uses of Metadata in org.apache.tika.eval.app.io
-
Uses of Metadata in org.apache.tika.eval.core.metadata
-
Uses of Metadata in org.apache.tika.example
Modifier and TypeMethodDescriptionParsingExample.recursiveParserWrapperExample()
For documents that may contain embedded documents, it might be helpful to create list of metadata objects, one for the container document and one for each embedded document.Modifier and TypeMethodDescriptionEncryptedPrescriptionDetector.detect
(InputStream stream, Metadata metadata) protected ContentHandler
PrescriptionParser.getContentHandler
(ContentHandler handler, Metadata metadata, ParseContext context) void
DirListParser.parse
(InputStream is, ContentHandler handler, Metadata metadata) void
DirListParser.parse
(InputStream is, ContentHandler handler, Metadata metadata, ParseContext context) void
EncryptedPrescriptionParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) void
LanguageDetectingParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) void
PickBestTextEncodingParser.parse
(InputStream stream, ContentHandlerFactory handlers, Metadata metadata, ParseContext context) Deprecated.void
PickBestTextEncodingParser.parse
(InputStream stream, ContentHandler handler, Metadata originalMetadata, ParseContext context) Deprecated.protected boolean
PickBestTextEncodingParser.parserCompleted
(Parser parser, Metadata metadata, ContentHandler handler, ParseContext context, Exception exception) Deprecated.protected void
PickBestTextEncodingParser.parserPrepare
(Parser parser, Metadata metadata, ParseContext context) Deprecated.static String
MyFirstTika.parseUsingAutoDetect
(String filename, TikaConfig tikaConfig, Metadata metadata) static String
MyFirstTika.parseUsingComponents
(String filename, TikaConfig tikaConfig, Metadata metadata) -
Uses of Metadata in org.apache.tika.extractor
Modifier and TypeMethodDescriptionvoid
AbstractEmbeddedDocumentBytesHandler.add
(int id, Metadata metadata, InputStream bytes) void
BasicEmbeddedDocumentBytesHandler.add
(int id, Metadata metadata, InputStream is) void
EmbeddedDocumentBytesHandler.add
(int id, Metadata metadata, InputStream inputStream) AbstractEmbeddedDocumentBytesHandler.getEmitKey
(String containerEmitKey, int embeddedId, EmbeddedDocumentBytesConfig embeddedDocumentBytesConfig, Metadata metadata) EmbeddedDocumentUtil.getExtension
(TikaInputStream is, Metadata metadata) EmbeddedDocumentExtractorFactory.newInstance
(Metadata metadata, ParseContext parseContext) ParsingEmbeddedDocumentExtractorFactory.newInstance
(Metadata metadata, ParseContext parseContext) RUnpackExtractorFactory.newInstance
(Metadata metadata, ParseContext parseContext) void
EmbeddedDocumentExtractor.parseEmbedded
(InputStream stream, ContentHandler handler, Metadata metadata, boolean outputHtml) Processes the supplied embedded resource, calling the delegating parser with the appropriate details.void
EmbeddedDocumentUtil.parseEmbedded
(InputStream inputStream, ContentHandler handler, Metadata metadata, boolean outputHtml) void
ParsingEmbeddedDocumentExtractor.parseEmbedded
(InputStream stream, ContentHandler handler, Metadata metadata, boolean outputHtml) void
RUnpackExtractor.parseEmbedded
(InputStream stream, ContentHandler handler, Metadata metadata, boolean outputHtml) static void
EmbeddedDocumentUtil.recordEmbeddedStreamException
(Throwable t, Metadata m) static void
EmbeddedDocumentUtil.recordException
(Throwable t, Metadata m) boolean
boolean
Checks if a document with the given metadata matches the specified selection criteria.boolean
boolean
boolean
EmbeddedDocumentExtractor.shouldParseEmbedded
(Metadata metadata) boolean
EmbeddedDocumentUtil.shouldParseEmbedded
(Metadata m) boolean
ParsingEmbeddedDocumentExtractor.shouldParseEmbedded
(Metadata metadata) boolean
DefaultEmbeddedStreamTranslator.shouldTranslate
(InputStream inputStream, Metadata metadata) This should sniff the stream to determine if it needs to be translated.boolean
EmbeddedStreamTranslator.shouldTranslate
(InputStream inputStream, Metadata metadata) DefaultEmbeddedStreamTranslator.translate
(InputStream inputStream, Metadata metadata) This will consume the InputStream and return a new stream of translated bytes.EmbeddedStreamTranslator.translate
(InputStream inputStream, Metadata metadata) -
Uses of Metadata in org.apache.tika.extractor.microsoft
Modifier and TypeMethodDescriptionboolean
MSEmbeddedStreamTranslator.shouldTranslate
(InputStream inputStream, Metadata metadata) MSEmbeddedStreamTranslator.translate
(InputStream inputStream, Metadata metadata) -
Uses of Metadata in org.apache.tika.fork
Modifier and TypeMethodDescriptionvoid
ForkParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) This sends the objects to the server for parsing, and the server via the proxies acts on the handler as if it were updating it directly. -
Uses of Metadata in org.apache.tika.io
Modifier and TypeMethodDescriptionTemporaryResources.createTempFile
(Metadata metadata) Creates a temporary file that will automatically be deleted when theTemporaryResources.close()
method is called, returning its path.static TikaInputStream
Creates a TikaInputStream from the given array of bytes.static TikaInputStream
Deprecated.static TikaInputStream
TikaInputStream.get
(InputStream stream, TemporaryResources tmp, Metadata metadata) Casts or wraps the given stream to a TikaInputStream instance.static TikaInputStream
Creates a TikaInputStream from the resource at the given URI.static TikaInputStream
Creates a TikaInputStream from the resource at the given URL.static TikaInputStream
Creates a TikaInputStream from the file at the given path.static TikaInputStream
TikaInputStream.get
(Path path, Metadata metadata, TemporaryResources tmp) static TikaInputStream
Creates a TikaInputStream from the given database BLOB. -
Uses of Metadata in org.apache.tika.langdetect.opennlp.metadatafilter
-
Uses of Metadata in org.apache.tika.langdetect.optimaize.metadatafilter
-
Uses of Metadata in org.apache.tika.metadata
Modifier and TypeMethodDescriptionstatic void
XMPDM.ChannelTypePropertyConverter.convertAndSet
(Metadata metadata, Object value) Deprecated.How convert+set might work -
Uses of Metadata in org.apache.tika.metadata.filter
Modifier and TypeMethodDescriptionvoid
void
void
void
void
void
void
void
void
abstract void
void
-
Uses of Metadata in org.apache.tika.mime
Modifier and TypeMethodDescriptionMimeTypes.detect
(InputStream input, Metadata metadata) Automatically detects the MIME type of a document based on magic markers in the stream prefix and any given metadata hints.ProbabilisticMimeDetectionSelector.detect
(InputStream input, Metadata metadata) -
Uses of Metadata in org.apache.tika.parser
Modifier and TypeMethodDescriptionvoid
ParseRecord.addMetadata
(Metadata metadata) void
DigestingParser.Digester.digest
(InputStream is, Metadata m, ParseContext parseContext) Digests an InputStream and sets the appropriate value(s) in the metadata.protected Parser
Returns the parser that best matches the given metadata.protected Parser
CompositeParser.getParser
(Metadata metadata, ParseContext context) PasswordProvider.getPassword
(Metadata metadata) Looks up the password for a document with the given metadata, and returns it for the Parser.static String
RecursiveParserWrapper.getResourceName
(Metadata metadata, AtomicInteger counter) void
AbstractParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata) Deprecated.use theParser.parse(InputStream, ContentHandler, Metadata, ParseContext)
method insteadvoid
AutoDetectParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata) void
AutoDetectParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) void
CompositeParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) Delegates the call to the matching component parser.void
CryptoParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) void
DelegatingParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) Looks up the delegate parser from the parsing context and delegates the parse operation to it.void
DigestingParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) void
EmptyParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) void
ErrorParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) void
NetworkParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) void
Parser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) Parses a document stream into a sequence of XHTML SAX events.void
ParserDecorator.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) Delegates the method call to the decorated parser.void
ParserPostProcessor.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) Forwards the call to the delegated parser and post-processes the results as described above.void
RecursiveParserWrapper.parse
(InputStream stream, ContentHandler recursiveParserWrapperHandler, Metadata metadata, ParseContext context) void
RegexCaptureParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) ModifierConstructorDescriptionParsingReader
(Parser parser, InputStream stream, Metadata metadata, ParseContext context) Creates a reader for the text content of the given binary stream with the given document metadata.ParsingReader
(Parser parser, InputStream stream, Metadata metadata, ParseContext context, Executor executor) Creates a reader for the text content of the given binary stream with the given document metadata. -
Uses of Metadata in org.apache.tika.parser.apple
Modifier and TypeMethodDescriptionvoid
AppleSingleFileParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) void
PListParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) -
Uses of Metadata in org.apache.tika.parser.asm
Modifier and TypeMethodDescriptionvoid
ClassParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) -
Uses of Metadata in org.apache.tika.parser.audio
Modifier and TypeMethodDescriptionvoid
AudioParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) void
MidiParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) -
Uses of Metadata in org.apache.tika.parser.code
Modifier and TypeMethodDescriptionvoid
SourceCodeParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) -
Uses of Metadata in org.apache.tika.parser.crypto
Modifier and TypeMethodDescriptionvoid
Pkcs7Parser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) void
TSDParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) -
Uses of Metadata in org.apache.tika.parser.csv
Modifier and TypeMethodDescriptionvoid
TextAndCSVParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) -
Uses of Metadata in org.apache.tika.parser.ctakes
Modifier and TypeMethodDescriptionCTAKESContentHandler.getMetadata()
Returns metadata that includes cTAKES annotations.Modifier and TypeMethodDescriptionvoid
CTAKESParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) ModifierConstructorDescriptionCTAKESContentHandler
(ContentHandler handler, Metadata metadata) Creates a newCTAKESContentHandler
for the givenContentHandler
and Metadata objects.CTAKESContentHandler
(ContentHandler handler, Metadata metadata, CTAKESConfig config) Creates a newCTAKESContentHandler
for the givenContentHandler
and Metadata objects. -
Uses of Metadata in org.apache.tika.parser.dbf
Modifier and TypeMethodDescriptionvoid
DBFParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) -
Uses of Metadata in org.apache.tika.parser.dgn
Modifier and TypeMethodDescriptionvoid
DGN8Parser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) -
Uses of Metadata in org.apache.tika.parser.dif
Modifier and TypeMethodDescriptionprotected ContentHandler
DIFParser.getContentHandler
(ContentHandler handler, Metadata metadata, ParseContext context) void
DIFParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) -
Uses of Metadata in org.apache.tika.parser.digest
Modifier and TypeMethodDescriptionvoid
CompositeDigester.digest
(InputStream is, Metadata m, ParseContext parseContext) void
InputStreamDigester.digest
(InputStream is, Metadata metadata, ParseContext parseContext) -
Uses of Metadata in org.apache.tika.parser.dwg
Modifier and TypeMethodDescriptionvoid
DWGParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) void
DWGReadParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) -
Uses of Metadata in org.apache.tika.parser.envi
Modifier and TypeMethodDescriptionvoid
EnviHeaderParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) -
Uses of Metadata in org.apache.tika.parser.epub
Modifier and TypeMethodDescriptionprotected ContentHandler
OPFParser.getContentHandler
(ContentHandler handler, Metadata metadata, ParseContext context) void
EpubContentParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) void
EpubParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) -
Uses of Metadata in org.apache.tika.parser.executable
Modifier and TypeMethodDescriptionvoid
ExecutableParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) void
ExecutableParser.parseELF
(XHTMLContentHandler xhtml, Metadata metadata, InputStream stream, byte[] first4) Parses a Unix ELF filevoid
ExecutableParser.parseMachO
(XHTMLContentHandler xhtml, Metadata metadata, InputStream stream, byte[] first4) Parses a Mach-O filevoid
ExecutableParser.parsePE
(XHTMLContentHandler xhtml, Metadata metadata, InputStream stream, byte[] first4) Parses a DOS or Windows PE file -
Uses of Metadata in org.apache.tika.parser.external
Modifier and TypeMethodDescriptionvoid
ExternalParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) Executes the configured external command and passes the given document stream as a simple XHTML document to the given SAX content handler. -
Uses of Metadata in org.apache.tika.parser.external2
Modifier and TypeMethodDescriptionvoid
ExternalParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) -
Uses of Metadata in org.apache.tika.parser.feed
Modifier and TypeMethodDescriptionvoid
FeedParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) -
Uses of Metadata in org.apache.tika.parser.font
Modifier and TypeMethodDescriptionvoid
AdobeFontMetricParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) void
TrueTypeParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) -
Uses of Metadata in org.apache.tika.parser.gdal
Modifier and TypeMethodDescriptionvoid
GDALParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) -
Uses of Metadata in org.apache.tika.parser.geo.topic
Modifier and TypeMethodDescriptionvoid
GeoParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) -
Uses of Metadata in org.apache.tika.parser.geoinfo
Modifier and TypeMethodDescriptionvoid
GeographicInformationParser.parse
(InputStream inputStream, ContentHandler contentHandler, Metadata metadata, ParseContext parseContext) -
Uses of Metadata in org.apache.tika.parser.geopkg
Modifier and TypeMethodDescriptionvoid
GeoPkgParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) -
Uses of Metadata in org.apache.tika.parser.grib
Modifier and TypeMethodDescriptionvoid
GribParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) -
Uses of Metadata in org.apache.tika.parser.hdf
Modifier and TypeMethodDescriptionvoid
HDFParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) protected void
HDFParser.unravelStringMet
(ucar.nc2.NetcdfFile ncFile, ucar.nc2.Group group, Metadata met) -
Uses of Metadata in org.apache.tika.parser.html
Modifier and TypeMethodDescriptionHtmlEncodingDetector.detect
(InputStream input, Metadata metadata) void
JSoupParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) void
JSoupParser.parseString
(String html, ContentHandler handler, Metadata metadata, ParseContext context) -
Uses of Metadata in org.apache.tika.parser.html.charsetdetector
Modifier and TypeMethodDescriptionStandardHtmlEncodingDetector.detect
(InputStream input, Metadata metadata) -
Uses of Metadata in org.apache.tika.parser.http
Modifier and TypeMethodDescriptionvoid
HttpParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) -
Uses of Metadata in org.apache.tika.parser.hwp
Modifier and TypeMethodDescriptionvoid
HwpTextExtractorV5.extract
(InputStream source, Metadata metadata, XHTMLContentHandler xhtml) extract Text from HWP Stream.void
HwpV5Parser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) -
Uses of Metadata in org.apache.tika.parser.image
Modifier and TypeMethodDescriptionvoid
AbstractImageParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) void
ICNSParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) void
JXLParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) void
PSDParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) void
WebPParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) ModifierConstructorDescriptionImageMetadataExtractor
(Metadata metadata) ImageMetadataExtractor
(Metadata metadata, org.apache.tika.parser.image.ImageMetadataExtractor.DirectoryHandler... handlers) -
Uses of Metadata in org.apache.tika.parser.indesign
Modifier and TypeMethodDescriptionvoid
IDMLParser.parse
(InputStream stream, ContentHandler baseHandler, Metadata metadata, ParseContext context) -
Uses of Metadata in org.apache.tika.parser.iptc
Modifier and TypeMethodDescriptionvoid
IptcAnpaParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata) Deprecated.This method will be removed in Apache Tika 1.0.void
IptcAnpaParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) -
Uses of Metadata in org.apache.tika.parser.isatab
Modifier and TypeMethodDescriptionvoid
ISArchiveParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) static void
ISATabUtils.parseAssay
(InputStream stream, XHTMLContentHandler xhtml, Metadata metadata, ParseContext context) static void
ISATabUtils.parseInvestigation
(InputStream stream, XHTMLContentHandler handler, Metadata metadata, ParseContext context) static void
ISATabUtils.parseInvestigation
(InputStream stream, XHTMLContentHandler handler, Metadata metadata, ParseContext context, String studyFileName) static void
ISATabUtils.parseStudy
(InputStream stream, XHTMLContentHandler xhtml, Metadata metadata, ParseContext context) -
Uses of Metadata in org.apache.tika.parser.iwork
Modifier and TypeMethodDescriptionvoid
IWorkPackageParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) -
Uses of Metadata in org.apache.tika.parser.iwork.iwana
Modifier and TypeMethodDescriptionvoid
IWork13PackageParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) void
IWork18PackageParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) -
Uses of Metadata in org.apache.tika.parser.jdbc
Modifier and TypeMethodDescriptionprotected void
AbstractDBParser.extractMetadata
(Connection connection, Metadata metadata) This is called before parsing the tables to extract metadata from the db, if any.protected Blob
protected Connection
AbstractDBParser.getConnection
(InputStream stream, Metadata metadata, ParseContext context) Override this for special configuration of the connection, such as limiting the number of rows to be held in memory.protected abstract String
AbstractDBParser.getConnectionString
(InputStream stream, Metadata metadata, ParseContext parseContext) Implement for db specific connection information, e.g.AbstractDBParser.getTableNames
(Connection connection, Metadata metadata, ParseContext context) Returns the names of the tables to processvoid
AbstractDBParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) -
Uses of Metadata in org.apache.tika.parser.journal
Modifier and TypeMethodDescriptionvoid
GrobidRESTParser.parse
(String filePath, ContentHandler handler, Metadata metadata, ParseContext context) void
JournalParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) -
Uses of Metadata in org.apache.tika.parser.mail
Modifier and TypeMethodDescriptionvoid
RFC822Parser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) -
Uses of Metadata in org.apache.tika.parser.mailcommons
Modifier and TypeMethodDescriptionstatic void
MailUtil.addPersonAndEmail
(String string, Property personProperty, Property emailProperty, Metadata metadata) This tries to split a "from" or "to" value into a person field and an email field.static void
MailUtil.setPersonAndEmail
(String string, Property personProperty, Property emailProperty, Metadata metadata) This tries to split a "from" or "to" value into a person field and an email field. -
Uses of Metadata in org.apache.tika.parser.mat
Modifier and TypeMethodDescriptionvoid
MatParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) -
Uses of Metadata in org.apache.tika.parser.mbox
Modifier and TypeMethodDescriptionvoid
MboxParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) -
Uses of Metadata in org.apache.tika.parser.microsoft
Modifier and TypeFieldDescriptionprotected final Metadata
AbstractPOIFSExtractor.parentMetadata
protected final Metadata
AbstractPOIFSExtractor.parentMetadata
protected final Metadata
AbstractPOIFSExtractor.parentMetadata
protected final Metadata
AbstractPOIFSExtractor.parentMetadata
Modifier and TypeMethodDescriptionstatic void
OutlookExtractor.addEvenIfNull
(Property property, String value, Metadata metadata) static void
void
EMFParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) void
JackcessParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) void
MSOwnerFileParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) Extracts owner from MS temp filevoid
OfficeParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) Extracts properties and text from an MS Document input streamprotected void
OfficeParser.parse
(org.apache.poi.poifs.filesystem.DirectoryNode root, ParseContext context, Metadata metadata, XHTMLContentHandler xhtml) void
OldExcelParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) Extracts properties and text from an MS Document input streamvoid
TNEFParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) Extracts properties and text from an MS Document input streamvoid
WMFParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) ModifierConstructorDescriptionExcelExtractor
(ParseContext context, Metadata metadata) HSLFExtractor
(ParseContext context, Metadata metadata) OutlookExtractor
(org.apache.poi.poifs.filesystem.DirectoryNode root, Metadata metadata, ParseContext context) SummaryExtractor
(Metadata metadata) WordExtractor
(ParseContext context, Metadata metadata) -
Uses of Metadata in org.apache.tika.parser.microsoft.activemime
Modifier and TypeMethodDescriptionvoid
ActiveMimeParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) -
Uses of Metadata in org.apache.tika.parser.microsoft.chm
Modifier and TypeMethodDescriptionvoid
ChmParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) -
Uses of Metadata in org.apache.tika.parser.microsoft.libpst
Modifier and TypeMethodDescriptionvoid
LibPstParser.parse
(InputStream inputStream, ContentHandler contentHandler, Metadata metadata, ParseContext parseContext) ModifierConstructorDescriptionEmailVisitor
(Path root, boolean processEmailAsMsg, XHTMLContentHandler xhtml, Metadata parentMetadata, ParseContext parseContext) -
Uses of Metadata in org.apache.tika.parser.microsoft.onenote
Modifier and TypeMethodDescriptionvoid
OneNoteParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) -
Uses of Metadata in org.apache.tika.parser.microsoft.onenote.fsshttpb
Modifier and TypeMethodDescriptionvoid
MSOneStorePackage.walkTree
(OneNoteTreeWalkerOptions options, Metadata metadata, XHTMLContentHandler xhtml) -
Uses of Metadata in org.apache.tika.parser.microsoft.ooxml
Modifier and TypeMethodDescriptionvoid
void
AbstractOOXMLExtractor.getXHTML
(ContentHandler handler, Metadata metadata, ParseContext context) void
OOXMLExtractor.getXHTML
(ContentHandler handler, Metadata metadata, ParseContext context) Parses the document into a sequence of XHTML SAX events sent to the given content handler.void
XSSFBExcelExtractorDecorator.getXHTML
(ContentHandler handler, Metadata metadata, ParseContext context) void
XSSFExcelExtractorDecorator.getXHTML
(ContentHandler handler, Metadata metadata, ParseContext context) AbstractOOXMLExtractor.loadLinkedRelationships
(org.apache.poi.openxml4j.opc.PackagePart bodyPart, boolean includeInternal, Metadata metadata) This is used by the SAX docx and pptx decorators to load hyperlinks and other linked objectsstatic void
OOXMLExtractorFactory.parse
(InputStream stream, ContentHandler baseHandler, Metadata metadata, ParseContext context) void
OOXMLParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) ModifierConstructorDescriptionSXSLFPowerPointExtractorDecorator
(Metadata metadata, ParseContext context, XSLFEventBasedPowerPointExtractor extractor) SXWPFWordExtractorDecorator
(Metadata metadata, ParseContext context, XWPFEventBasedWordExtractor extractor) XSLFPowerPointExtractorDecorator
(Metadata metadata, ParseContext context, org.apache.poi.xslf.extractor.XSLFExtractor extractor) XWPFWordExtractorDecorator
(Metadata metadata, ParseContext context, org.apache.poi.xwpf.extractor.XWPFWordExtractor extractor) -
Uses of Metadata in org.apache.tika.parser.microsoft.ooxml.xwpf.ml2006
Modifier and TypeMethodDescriptionvoid
Word2006MLParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) -
Uses of Metadata in org.apache.tika.parser.microsoft.pst
Modifier and TypeMethodDescriptionvoid
OutlookPSTParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) void
PSTMailItemParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) -
Uses of Metadata in org.apache.tika.parser.microsoft.rtf
Modifier and TypeMethodDescriptionvoid
RTFParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) -
Uses of Metadata in org.apache.tika.parser.microsoft.xml
Modifier and TypeMethodDescriptionprotected ContentHandler
AbstractXML2003Parser.getContentHandler
(ContentHandler ch, Metadata md, ParseContext context) protected ContentHandler
SpreadsheetMLParser.getContentHandler
(ContentHandler ch, Metadata metadata, ParseContext context) protected ContentHandler
WordMLParser.getContentHandler
(ContentHandler ch, Metadata metadata, ParseContext context) void
AbstractXML2003Parser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) protected abstract void
AbstractXML2003Parser.setContentType
(Metadata contentType) void
SpreadsheetMLParser.setContentType
(Metadata metadata) void
WordMLParser.setContentType
(Metadata metadata) -
Uses of Metadata in org.apache.tika.parser.mif
Modifier and TypeMethodDescriptionMIFParser.getContentHandler
(ContentHandler handler, Metadata metadata) Get the content handler to use.void
MIFParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) -
Uses of Metadata in org.apache.tika.parser.mp3
Modifier and TypeMethodDescriptionvoid
Mp3Parser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) -
Uses of Metadata in org.apache.tika.parser.mp4
Modifier and TypeMethodDescriptionvoid
MP4Parser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) ModifierConstructorDescriptionTikaMp4BoxHandler
(com.drew.metadata.Metadata metadata, Metadata tikaMetadata, XHTMLContentHandler xhtml) -
Uses of Metadata in org.apache.tika.parser.mp4.boxes
ModifierConstructorDescriptionTikaUserDataBox
(String box, byte[] payload, Metadata metadata, XHTMLContentHandler xhtml) -
Uses of Metadata in org.apache.tika.parser.multiple
Modifier and TypeMethodDescriptionprotected static Metadata
AbstractMultipleParser.mergeMetadata
(Metadata newMetadata, Metadata lastMetadata, AbstractMultipleParser.MetadataPolicy policy) Modifier and TypeMethodDescriptionprotected static Metadata
AbstractMultipleParser.mergeMetadata
(Metadata newMetadata, Metadata lastMetadata, AbstractMultipleParser.MetadataPolicy policy) void
AbstractMultipleParser.parse
(InputStream stream, ContentHandlerFactory handlers, Metadata metadata, ParseContext context) Deprecated.TheContentHandlerFactory
override is still experimental and the method signature is subject to change before Tika 2.0void
AbstractMultipleParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) Processes the given Stream through one or more parsers, resetting things between parsers as requested by policy.protected abstract boolean
AbstractMultipleParser.parserCompleted
(Parser parser, Metadata metadata, ContentHandler handler, ParseContext context, Exception exception) Used to notify implementations that a Parser has Finished or Failed, and to allow them to decide to continue or abort further parsingprotected boolean
FallbackParser.parserCompleted
(Parser parser, Metadata metadata, ContentHandler handler, ParseContext context, Exception exception) protected boolean
SupplementingParser.parserCompleted
(Parser parser, Metadata metadata, ContentHandler handler, ParseContext context, Exception exception) protected void
AbstractMultipleParser.parserPrepare
(Parser parser, Metadata metadata, ParseContext context) Used to allow implementations to prepare or change things before parsing occurs -
Uses of Metadata in org.apache.tika.parser.ner
Modifier and TypeMethodDescriptionvoid
NamedEntityParser.parse
(InputStream inputStream, ContentHandler contentHandler, Metadata metadata, ParseContext parseContext) -
Uses of Metadata in org.apache.tika.parser.netcdf
Modifier and TypeMethodDescriptionvoid
NetCDFParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) -
Uses of Metadata in org.apache.tika.parser.ocr
Modifier and TypeMethodDescriptionvoid
TesseractOCRParser.parse
(Image image, ContentHandler handler, Metadata metadata, ParseContext context) void
TesseractOCRParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext parseContext) -
Uses of Metadata in org.apache.tika.parser.odf
Modifier and TypeMethodDescriptionprotected ContentHandler
OpenDocumentMetaParser.getContentHandler
(ContentHandler ch, Metadata md, ParseContext context) void
FlatOpenDocumentParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) void
OpenDocumentContentParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) void
OpenDocumentMetaParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) void
OpenDocumentParser.parse
(InputStream stream, ContentHandler baseHandler, Metadata metadata, ParseContext context) -
Uses of Metadata in org.apache.tika.parser.pdf
Modifier and TypeMethodDescriptionvoid
Checks to see if a document's content should be extracted based on metadata values and the value ofAccessChecker.allowExtractionForAccessibility
in the constructor.static void
PDMetadataExtractor.extract
(org.apache.jempbox.xmp.XMPMetadata xmp, Metadata metadata, ParseContext context) static void
PDMetadataExtractor.extract
(org.apache.pdfbox.pdmodel.common.PDMetadata pdMetadata, Metadata metadata, ParseContext context) protected org.apache.pdfbox.pdmodel.PDDocument
PDFParser.getPDDocument
(InputStream inputStream, String password, org.apache.pdfbox.io.RandomAccessStreamCache.StreamCacheCreateFunction streamCacheCreateFunction, Metadata metadata, ParseContext parseContext) protected org.apache.pdfbox.pdmodel.PDDocument
PDFParser.getPDDocument
(InputStream stream, TikaInputStream tstream, String password, org.apache.pdfbox.io.RandomAccessStreamCache.StreamCacheCreateFunction streamCacheCreateFunction, Metadata metadata, ParseContext context) protected org.apache.pdfbox.pdmodel.PDDocument
PDFParser.getPDDocument
(Path path, String password, org.apache.pdfbox.io.RandomAccessStreamCache.StreamCacheCreateFunction streamCacheCreateFunction, Metadata metadata, ParseContext parseContext) void
PDFParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) static void
PDFMarkedContent2XHTML.process
(org.apache.pdfbox.pdmodel.PDDocument pdDocument, ContentHandler handler, ParseContext context, Metadata metadata, PDFParserConfig config) Converts the given PDF document (and related metadata) to a stream of XHTML SAX events sent to the given content handler. -
Uses of Metadata in org.apache.tika.parser.pdf.image
Modifier and TypeMethodDescriptionprotected void
ImageGraphicsEngine.extractInlineImageMetadataOnly
(org.apache.pdfbox.pdmodel.graphics.image.PDImage pdImage, Metadata metadata) protected String
ImageGraphicsEngine.getSuffix
(org.apache.pdfbox.pdmodel.graphics.image.PDImage pdImage, Metadata metadata) ImageGraphicsEngineFactory.newEngine
(org.apache.pdfbox.pdmodel.PDPage page, int pageNumber, EmbeddedDocumentExtractor embeddedDocumentExtractor, PDFParserConfig pdfParserConfig, Map<org.apache.pdfbox.cos.COSStream, Integer> processedInlineImages, AtomicInteger imageCounter, XHTMLContentHandler xhtml, Metadata parentMetadata, ParseContext parseContext) ModifierConstructorDescriptionprotected
ImageGraphicsEngine
(org.apache.pdfbox.pdmodel.PDPage page, int pageNumber, EmbeddedDocumentExtractor embeddedDocumentExtractor, PDFParserConfig pdfParserConfig, Map<org.apache.pdfbox.cos.COSStream, Integer> processedInlineImages, AtomicInteger imageCounter, XHTMLContentHandler xhtml, Metadata parentMetadata, ParseContext parseContext) -
Uses of Metadata in org.apache.tika.parser.pkg
Modifier and TypeMethodDescriptionprotected static Metadata
PackageParser.handleEntryMetadata
(String name, Date createAt, Date modifiedAt, Long size, XHTMLContentHandler xhtml) Modifier and TypeMethodDescriptionboolean
CompressorParserOptions.decompressConcatenated
(Metadata metadata) void
CompressorParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) void
PackageParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) void
RarParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) void
UnrarParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) -
Uses of Metadata in org.apache.tika.parser.pot
Modifier and TypeMethodDescriptionvoid
PooledTimeSeriesParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) Parses a document stream into a sequence of XHTML SAX events. -
Uses of Metadata in org.apache.tika.parser.prt
Modifier and TypeMethodDescriptionvoid
PRTParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) -
Uses of Metadata in org.apache.tika.parser.recognition
Modifier and TypeMethodDescriptionvoid
AgeRecogniser.parse
(InputStream inputStream, ContentHandler handler, Metadata metadata, ParseContext context) void
ObjectRecognitionParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) List<? extends RecognisedObject>
ObjectRecogniser.recognise
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) Recognise the objects in the stream -
Uses of Metadata in org.apache.tika.parser.recognition.tf
Modifier and TypeMethodDescriptionprotected URI
protected URI
TensorflowImageRecParser.recognise
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) TensorflowRESTRecogniser.recognise
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) -
Uses of Metadata in org.apache.tika.parser.sas
Modifier and TypeMethodDescriptionvoid
SAS7BDATParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) -
Uses of Metadata in org.apache.tika.parser.sentiment
Modifier and TypeMethodDescriptionvoid
SentimentAnalysisParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) Performs the parse -
Uses of Metadata in org.apache.tika.parser.sqlite3
Modifier and TypeMethodDescriptionprotected void
SQLite3DBParser.extractMetadata
(Connection connection, Metadata metadata) protected Blob
protected Connection
SQLite3DBParser.getConnection
(InputStream stream, Metadata metadata, ParseContext context) protected String
SQLite3DBParser.getConnectionString
(InputStream is, Metadata metadata, ParseContext context) SQLite3DBParser.getTableNames
(Connection connection, Metadata metadata, ParseContext context) void
SQLite3Parser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) -
Uses of Metadata in org.apache.tika.parser.strings
Modifier and TypeMethodDescriptionvoid
Latin1StringsParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) void
StringsParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) -
Uses of Metadata in org.apache.tika.parser.tmx
Modifier and TypeMethodDescriptionvoid
TMXParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) -
Uses of Metadata in org.apache.tika.parser.transcribe.aws
Modifier and TypeMethodDescriptionvoid
AmazonTranscribe.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) Starts AWS Transcribe Job with language specification. -
Uses of Metadata in org.apache.tika.parser.txt
Modifier and TypeMethodDescriptionBOMDetector.detect
(InputStream input, Metadata metadata) Icu4jEncodingDetector.detect
(InputStream input, Metadata metadata) UniversalEncodingDetector.detect
(InputStream input, Metadata metadata) void
TXTParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) -
Uses of Metadata in org.apache.tika.parser.video
Modifier and TypeMethodDescriptionvoid
FLVParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) -
Uses of Metadata in org.apache.tika.parser.wacz
Modifier and TypeMethodDescriptionvoid
WACZParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) -
Uses of Metadata in org.apache.tika.parser.warc
Modifier and TypeMethodDescriptionvoid
WARCParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) -
Uses of Metadata in org.apache.tika.parser.wordperfect
Modifier and TypeMethodDescriptionvoid
QuattroProParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) void
WordPerfectParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) -
Uses of Metadata in org.apache.tika.parser.xliff
Modifier and TypeMethodDescriptionvoid
XLIFF12Parser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) void
XLZParser.parse
(InputStream stream, ContentHandler baseHandler, Metadata metadata, ParseContext context) -
Uses of Metadata in org.apache.tika.parser.xml
Modifier and TypeMethodDescriptionprotected ContentHandler
DcXMLParser.getContentHandler
(ContentHandler handler, Metadata metadata, ParseContext context) protected ContentHandler
FictionBookParser.getContentHandler
(ContentHandler handler, Metadata metadata, ParseContext context) protected ContentHandler
TextAndAttributeXMLParser.getContentHandler
(ContentHandler handler, Metadata metadata, ParseContext context) protected ContentHandler
XMLParser.getContentHandler
(ContentHandler handler, Metadata metadata, ParseContext context) void
XMLParser.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) void
XMLProfiler.parse
(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) ModifierConstructorDescriptionAttributeDependantMetadataHandler
(Metadata metadata, String nameHoldingAttribute, String namePrefix) AttributeMetadataHandler
(String uri, String localName, Metadata metadata, String name) AttributeMetadataHandler
(String uri, String localName, Metadata metadata, Property property) ElementMetadataHandler
(String uri, String localName, Metadata metadata, String name) Constructor for string metadata keys.ElementMetadataHandler
(String uri, String localName, Metadata metadata, String name, boolean allowDuplicateValues, boolean allowEmptyValues) Constructor for string metadata keys which allows change of behavior for duplicate and empty entry values.ElementMetadataHandler
(String uri, String localName, Metadata metadata, Property targetProperty) Constructor for Property metadata keys.ElementMetadataHandler
(String uri, String localName, Metadata metadata, Property targetProperty, boolean allowDuplicateValues, boolean allowEmptyValues) Constructor for Property metadata keys which allows change of behavior for duplicate and empty entry values.MetadataHandler
(Metadata metadata, String name) Deprecated.MetadataHandler
(Metadata metadata, Property property) Deprecated. -
Uses of Metadata in org.apache.tika.parser.xmp
Modifier and TypeMethodDescriptionstatic void
JempboxExtractor.extractDublinCore
(org.apache.jempbox.xmp.XMPMetadata xmpMetadata, Metadata metadata) Tries to extract Dublin Core schema from XMP.static void
XMPMetadataExtractor.extractDublinCoreSchema
(org.apache.xmpbox.XMPMetadata xmp, Metadata metadata) Extracts Dublin Core.static void
XMPMetadataExtractor.extractXMPBasicSchema
(org.apache.xmpbox.XMPMetadata xmp, Metadata metadata) Extracts basic schema metadata from XMP.static void
JempboxExtractor.extractXMPMM
(org.apache.jempbox.xmp.XMPMetadata xmp, Metadata metadata) Extracts Media Management metadata from XMP.static void
XMPMetadataExtractor.parse
(InputStream stream, Metadata metadata) Parse the XMP Packets. -
Uses of Metadata in org.apache.tika.pipes
ModifierConstructorDescriptionFetchEmitTuple
(String id, FetchKey fetchKey, EmitKey emitKey, Metadata metadata) FetchEmitTuple
(String id, FetchKey fetchKey, EmitKey emitKey, Metadata metadata, ParseContext parseContext) FetchEmitTuple
(String id, FetchKey fetchKey, EmitKey emitKey, Metadata metadata, ParseContext parseContext, FetchEmitTuple.ON_PARSE_EXCEPTION onParseException) -
Uses of Metadata in org.apache.tika.pipes.emitter
Modifier and TypeMethodDescriptionvoid
StreamEmitter.emit
(String emitKey, InputStream inputStream, Metadata userMetadata, ParseContext parseContext) Modifier and TypeMethodDescriptionvoid
Emitter.emit
(String emitKey, List<Metadata> metadataList, ParseContext parseContext) void
EmptyEmitter.emit
(String emitKey, List<Metadata> metadataList, ParseContext parseContext) -
Uses of Metadata in org.apache.tika.pipes.emitter.azblob
Modifier and TypeMethodDescriptionvoid
AZBlobEmitter.emit
(String path, InputStream is, Metadata userMetadata, ParseContext parseContext) Modifier and TypeMethodDescriptionvoid
AZBlobEmitter.emit
(String emitKey, List<Metadata> metadataList, ParseContext parseContext) Requires the src-bucket/path/to/my/file.txt in theTikaCoreProperties.SOURCE_PATH
. -
Uses of Metadata in org.apache.tika.pipes.emitter.fs
Modifier and TypeMethodDescriptionvoid
FileSystemEmitter.emit
(String path, InputStream inputStream, Metadata userMetadata, ParseContext parseContext) Modifier and TypeMethodDescriptionvoid
FileSystemEmitter.emit
(String emitKey, List<Metadata> metadataList, ParseContext parseContext) -
Uses of Metadata in org.apache.tika.pipes.emitter.gcs
Modifier and TypeMethodDescriptionvoid
GCSEmitter.emit
(String path, InputStream is, Metadata userMetadata, ParseContext parseContext) Modifier and TypeMethodDescriptionvoid
GCSEmitter.emit
(String emitKey, List<Metadata> metadataList, ParseContext parseContext) Requires the src-bucket/path/to/my/file.txt in theTikaCoreProperties.SOURCE_PATH
. -
Uses of Metadata in org.apache.tika.pipes.emitter.jdbc
Modifier and TypeMethodDescriptionvoid
JDBCEmitter.emit
(String emitKey, List<Metadata> metadataList, ParseContext parseContext) This executes the emit with each call. -
Uses of Metadata in org.apache.tika.pipes.emitter.kafka
Modifier and TypeMethodDescriptionvoid
KafkaEmitter.emit
(String emitKey, List<Metadata> metadataList, ParseContext parseContext) -
Uses of Metadata in org.apache.tika.pipes.emitter.opensearch
Modifier and TypeMethodDescriptionprotected static String
OpenSearchClient.metadataToJsonContainerInsert
(Metadata metadata, OpenSearchEmitter.AttachmentStrategy attachmentStrategy) protected static String
OpenSearchClient.metadataToJsonEmbeddedInsert
(Metadata metadata, OpenSearchEmitter.AttachmentStrategy attachmentStrategy, String emitKey, String embeddedFileFieldName) Modifier and TypeMethodDescriptionvoid
OpenSearchEmitter.emit
(String emitKey, List<Metadata> metadataList, ParseContext parseContext) void
OpenSearchClient.emitDocument
(String emitKey, List<Metadata> metadataList) -
Uses of Metadata in org.apache.tika.pipes.emitter.s3
Modifier and TypeMethodDescriptionvoid
S3Emitter.emit
(String path, InputStream is, Metadata userMetadata, ParseContext parseContext) Modifier and TypeMethodDescriptionvoid
S3Emitter.emit
(String emitKey, List<Metadata> metadataList, ParseContext parseContext) Requires the src-bucket/path/to/my/file.txt in theTikaCoreProperties.SOURCE_PATH
. -
Uses of Metadata in org.apache.tika.pipes.emitter.solr
Modifier and TypeMethodDescriptionvoid
SolrEmitter.emit
(String emitKey, List<Metadata> metadataList, ParseContext parseContext) -
Uses of Metadata in org.apache.tika.pipes.extractor
Modifier and TypeMethodDescriptionvoid
EmittingEmbeddedDocumentBytesHandler.add
(int id, Metadata metadata, InputStream inputStream) -
Uses of Metadata in org.apache.tika.pipes.fetcher
Modifier and TypeMethodDescriptionEmptyFetcher.fetch
(String fetchKey, Metadata metadata, ParseContext parseContext) Fetcher.fetch
(String fetchKey, Metadata metadata, ParseContext parseContext) default InputStream
RangeFetcher.fetch
(String fetchKey, long startOffset, long endOffset, Metadata metadata, ParseContext parseContext) -
Uses of Metadata in org.apache.tika.pipes.fetcher.azblob
Modifier and TypeMethodDescriptionAZBlobFetcher.fetch
(String fetchKey, Metadata metadata, ParseContext parseContext) -
Uses of Metadata in org.apache.tika.pipes.fetcher.fs
Modifier and TypeMethodDescriptionFileSystemFetcher.fetch
(String fetchKey, Metadata metadata, ParseContext parseContext) -
Uses of Metadata in org.apache.tika.pipes.fetcher.gcs
Modifier and TypeMethodDescriptionGCSFetcher.fetch
(String fetchKey, Metadata metadata, ParseContext parseContext) -
Uses of Metadata in org.apache.tika.pipes.fetcher.http
Modifier and TypeMethodDescriptionHttpFetcher.fetch
(String fetchKey, long startRange, long endRange, Metadata metadata, ParseContext parseContext) HttpFetcher.fetch
(String fetchKey, Metadata metadata, ParseContext parseContext) -
Uses of Metadata in org.apache.tika.pipes.fetcher.s3
Modifier and TypeMethodDescriptionS3Fetcher.fetch
(String fetchKey, long startRange, long endRange, Metadata metadata, ParseContext parseContext) S3Fetcher.fetch
(String fetchKey, Metadata metadata, ParseContext parseContext) -
Uses of Metadata in org.apache.tika.pipes.fetcher.url
Modifier and TypeMethodDescriptionUrlFetcher.fetch
(String fetchKey, Metadata metadata, ParseContext parseContext) -
Uses of Metadata in org.apache.tika.pipes.fetchers.microsoftgraph
Modifier and TypeMethodDescriptionMicrosoftGraphFetcher.fetch
(String fetchKey, Metadata metadata, ParseContext parseContext) -
Uses of Metadata in org.apache.tika.pipes.reporters.opensearch
Modifier and TypeMethodDescriptionvoid
OpenSearchClient.emitDocument
(String emitKey, String routing, Metadata metadata) void
OpenSearchClient.writeDoc
(Metadata metadata, StringWriter writer) -
Uses of Metadata in org.apache.tika.renderer
Modifier and TypeMethodDescriptionCompositeRenderer.render
(InputStream is, Metadata metadata, ParseContext parseContext, RenderRequest... requests) Renderer.render
(InputStream is, Metadata metadata, ParseContext parseContext, RenderRequest... requests) ModifierConstructorDescriptionRenderResult
(RenderResult.STATUS status, int id, Object result, Metadata metadata) -
Uses of Metadata in org.apache.tika.renderer.pdf.mutool
Modifier and TypeMethodDescriptionMuPDFRenderer.render
(InputStream is, Metadata metadata, ParseContext parseContext, RenderRequest... requests) -
Uses of Metadata in org.apache.tika.renderer.pdf.pdfbox
Modifier and TypeMethodDescriptionPDFBoxRenderer.render
(InputStream is, Metadata metadata, ParseContext parseContext, RenderRequest... requests) protected RenderResult
PDFBoxRenderer.renderPage
(org.apache.pdfbox.rendering.PDFRenderer renderer, int id, int pageNumber, Metadata metadata, ParseContext parseContext) -
Uses of Metadata in org.apache.tika.sax
Modifier and TypeMethodDescriptionContentHandlerDecoratorFactory.decorate
(ContentHandler contentHandler, Metadata metadata, ParseContext parseContext) void
AbstractRecursiveParserWrapperHandler.endDocument
(ContentHandler contentHandler, Metadata metadata) This is called after the full parse has completed.void
RecursiveParserWrapperHandler.endDocument
(ContentHandler contentHandler, Metadata metadata) void
AbstractRecursiveParserWrapperHandler.endEmbeddedDocument
(ContentHandler contentHandler, Metadata metadata) This is called after parsing each embedded document.void
RecursiveParserWrapperHandler.endEmbeddedDocument
(ContentHandler contentHandler, Metadata metadata) This is called after parsing an embedded document.void
void
AbstractRecursiveParserWrapperHandler.startEmbeddedDocument
(ContentHandler contentHandler, Metadata metadata) This is called before parsing each embedded document.void
RecursiveParserWrapperHandler.startEmbeddedDocument
(ContentHandler contentHandler, Metadata metadata) This is called before parsing an embedded documentModifierConstructorDescriptionDIFContentHandler
(ContentHandler delegate, Metadata metadata) PhoneExtractingContentHandler
(ContentHandler handler, Metadata metadata) Creates a decorator for the given SAX event handler and Metadata object.StandardsExtractingContentHandler
(ContentHandler handler, Metadata metadata) Creates a decorator for the given SAX event handler and Metadata object.XHTMLContentHandler
(ContentHandler handler, Metadata metadata) -
Uses of Metadata in org.apache.tika.serialization
Modifier and TypeMethodDescriptionstatic Metadata
Read metadata from reader.static Metadata
JsonMetadata.readMetadataObject
(com.fasterxml.jackson.core.JsonParser jParser) expects that jParser has not yet started on object or for jParser to be pointing to the start object.Modifier and TypeMethodDescriptionRead metadata from reader.Modifier and TypeMethodDescriptionvoid
static void
Serializes a Metadata object to Json.static void
JsonMetadata.writeMetadataObject
(Metadata metadata, com.fasterxml.jackson.core.JsonGenerator jsonGenerator, boolean prettyPrint) -
Uses of Metadata in org.apache.tika.server.core
Modifier and TypeMethodDescriptionvoid
CompositeParseContextConfig.configure
(jakarta.ws.rs.core.MultivaluedMap<String, String> httpHeaders, Metadata metadata, ParseContext context) void
ParseContextConfig.configure
(jakarta.ws.rs.core.MultivaluedMap<String, String> headers, Metadata metadata, ParseContext context) Configures the parseContext with present headers.DefaultInputStreamFactory.getInputStream
(InputStream is, Metadata metadata, jakarta.ws.rs.core.HttpHeaders httpHeaders, jakarta.ws.rs.core.UriInfo uriInfo) FetcherStreamFactory.getInputStream
(InputStream is, Metadata metadata, jakarta.ws.rs.core.HttpHeaders httpHeaders, jakarta.ws.rs.core.UriInfo uriInfo) InputStreamFactory.getInputStream
(InputStream is, Metadata metadata, jakarta.ws.rs.core.HttpHeaders httpHeaders, jakarta.ws.rs.core.UriInfo uriInfo) -
Uses of Metadata in org.apache.tika.server.core.config
Modifier and TypeMethodDescriptionvoid
DocumentSelectorConfig.configure
(jakarta.ws.rs.core.MultivaluedMap<String, String> httpHeaders, Metadata mtadata, ParseContext context) void
PasswordProviderConfig.configure
(jakarta.ws.rs.core.MultivaluedMap<String, String> httpHeaders, Metadata metadata, ParseContext context) void
TimeoutConfig.configure
(jakarta.ws.rs.core.MultivaluedMap<String, String> httpHeaders, Metadata metadata, ParseContext context) -
Uses of Metadata in org.apache.tika.server.core.resource
Modifier and TypeMethodDescriptionTikaResource.getJson
(InputStream is, jakarta.ws.rs.core.HttpHeaders httpHeaders, jakarta.ws.rs.core.UriInfo info, String handlerTypeName) TikaResource.getJsonFromMultipart
(org.apache.cxf.jaxrs.ext.multipart.Attachment att, jakarta.ws.rs.core.HttpHeaders httpHeaders, jakarta.ws.rs.core.UriInfo info, String handlerTypeName) protected Metadata
MetadataResource.parseMetadata
(InputStream is, Metadata metadata, jakarta.ws.rs.core.MultivaluedMap<String, String> httpHeaders, jakarta.ws.rs.core.UriInfo info) Modifier and TypeMethodDescriptionRecursiveMetadataResource.parseMetadata
(InputStream is, Metadata metadata, jakarta.ws.rs.core.MultivaluedMap<String, String> httpHeaders, jakarta.ws.rs.core.UriInfo info, HandlerConfig handlerConfig) Modifier and TypeMethodDescriptionstatic void
TikaResource.fillMetadata
(Parser parser, Metadata metadata, jakarta.ws.rs.core.MultivaluedMap<String, String> httpHeaders) static void
TikaResource.fillParseContext
(jakarta.ws.rs.core.MultivaluedMap<String, String> httpHeaders, Metadata metadata, ParseContext parseContext) static InputStream
TikaResource.getInputStream
(InputStream is, Metadata metadata, jakarta.ws.rs.core.HttpHeaders headers, jakarta.ws.rs.core.UriInfo uriInfo) static void
TikaResource.logRequest
(org.slf4j.Logger logger, String endpoint, Metadata metadata) static void
UnpackerResource.metadataToCsv
(Metadata metadata, OutputStream outputStream) static void
TikaResource.parse
(Parser parser, org.slf4j.Logger logger, String path, InputStream inputStream, ContentHandler handler, Metadata metadata, ParseContext parseContext) Use this to call a parser and unify exception handling.protected Metadata
MetadataResource.parseMetadata
(InputStream is, Metadata metadata, jakarta.ws.rs.core.MultivaluedMap<String, String> httpHeaders, jakarta.ws.rs.core.UriInfo info) RecursiveMetadataResource.parseMetadata
(InputStream is, Metadata metadata, jakarta.ws.rs.core.MultivaluedMap<String, String> httpHeaders, jakarta.ws.rs.core.UriInfo info, HandlerConfig handlerConfig) jakarta.ws.rs.core.StreamingOutput
TikaResource.produceText
(InputStream is, Metadata metadata, jakarta.ws.rs.core.MultivaluedMap<String, String> httpHeaders, jakarta.ws.rs.core.UriInfo info) -
Uses of Metadata in org.apache.tika.server.core.writer
Modifier and TypeMethodDescriptionlong
CSVMessageBodyWriter.getSize
(Metadata data, Class<?> type, Type genericType, Annotation[] annotations, jakarta.ws.rs.core.MediaType mediaType) long
JSONMessageBodyWriter.getSize
(Metadata data, Class<?> type, Type genericType, Annotation[] annotations, jakarta.ws.rs.core.MediaType mediaType) long
JSONObjWriter.getSize
(Metadata data, Class<?> type, Type genericType, Annotation[] annotations, jakarta.ws.rs.core.MediaType mediaType) long
TextMessageBodyWriter.getSize
(Metadata data, Class<?> type, Type genericType, Annotation[] annotations, jakarta.ws.rs.core.MediaType mediaType) void
CSVMessageBodyWriter.writeTo
(Metadata metadata, Class<?> type, Type genericType, Annotation[] annotations, jakarta.ws.rs.core.MediaType mediaType, jakarta.ws.rs.core.MultivaluedMap<String, Object> httpHeaders, OutputStream entityStream) void
JSONMessageBodyWriter.writeTo
(Metadata metadata, Class<?> type, Type genericType, Annotation[] annotations, jakarta.ws.rs.core.MediaType mediaType, jakarta.ws.rs.core.MultivaluedMap<String, Object> httpHeaders, OutputStream entityStream) void
TextMessageBodyWriter.writeTo
(Metadata metadata, Class<?> type, Type genericType, Annotation[] annotations, jakarta.ws.rs.core.MediaType mediaType, jakarta.ws.rs.core.MultivaluedMap<String, Object> httpHeaders, OutputStream entityStream) -
Uses of Metadata in org.apache.tika.server.standard.config
Modifier and TypeMethodDescriptionvoid
PDFServerConfig.configure
(jakarta.ws.rs.core.MultivaluedMap<String, String> httpHeaders, Metadata metadata, ParseContext parseContext) Configures the parseContext with present headers.void
TesseractServerConfig.configure
(jakarta.ws.rs.core.MultivaluedMap<String, String> httpHeaders, Metadata metadata, ParseContext parseContext) Configures the parseContext with present headers. -
Uses of Metadata in org.apache.tika.server.standard.writer
Modifier and TypeMethodDescriptionlong
XMPMessageBodyWriter.getSize
(Metadata data, Class<?> type, Type genericType, Annotation[] annotations, jakarta.ws.rs.core.MediaType mediaType) void
XMPMessageBodyWriter.writeTo
(Metadata metadata, Class<?> type, Type genericType, Annotation[] annotations, jakarta.ws.rs.core.MediaType mediaType, jakarta.ws.rs.core.MultivaluedMap<String, Object> httpHeaders, OutputStream entityStream) -
Uses of Metadata in org.apache.tika.utils
Modifier and TypeMethodDescriptionstatic Metadata
ParserUtils.cloneMetadata
(Metadata m) Does a deep clone of a Metadata object.Modifier and TypeMethodDescriptionstatic Metadata
ParserUtils.cloneMetadata
(Metadata m) Does a deep clone of a Metadata object.static InputStream
ParserUtils.ensureStreamReReadable
(InputStream stream, TemporaryResources tmp, Metadata metadata) Ensures that the Stream will be able to be re-read, by buffering to a temporary file if required.static void
ParserUtils.recordParserDetails
(String parserClassName, Metadata metadata) static void
ParserUtils.recordParserDetails
(Parser parser, Metadata metadata) static void
ParserUtils.recordParserFailure
(Parser parser, Throwable failure, Metadata metadata) -
Uses of Metadata in org.apache.tika.xmp
Modifier and TypeClassDescriptionclass
Provides a conversion of the Metadata map from Tika to the XMP data model by also providing the Metadata API for clients to ease transition.Modifier and TypeMethodDescriptionvoid
void
Converts the Metadata information to XMP.ModifierConstructorDescriptionXMPMetadata
(Metadata meta) XMPMetadata
(Metadata meta, String mimetype) Initializes the data by converting the Metadata information to XMP. -
Uses of Metadata in org.apache.tika.xmp.convert
Modifier and TypeMethodDescriptionstatic com.adobe.internal.xmp.XMPMeta
static com.adobe.internal.xmp.XMPMeta
Convert the given Tika metadata map to XMP object.abstract com.adobe.internal.xmp.XMPMeta
com.adobe.internal.xmp.XMPMeta
com.adobe.internal.xmp.XMPMeta
Converts a TikaMetadata
-object into anXMPMeta
containing the useful properties.com.adobe.internal.xmp.XMPMeta
com.adobe.internal.xmp.XMPMeta
com.adobe.internal.xmp.XMPMeta
com.adobe.internal.xmp.XMPMeta
void
AbstractConverter.setMetadata
(Metadata metadata)
TikaInputStream.get(Path, Metadata)
.