|
||||||||||
PREV NEXT | FRAMES NO FRAMES |
Uses of Metadata in org.apache.tika |
---|
Methods in org.apache.tika with parameters of type Metadata | |
---|---|
java.lang.String |
Tika.detect(java.io.InputStream stream,
Metadata metadata)
Detects the media type of the given document. |
java.io.Reader |
Tika.parse(java.io.InputStream stream,
Metadata metadata)
Parses the given document and returns the extracted text content. |
java.lang.String |
Tika.parseToString(java.io.InputStream stream,
Metadata metadata)
Parses the given document and returns the extracted text content. |
Uses of Metadata in org.apache.tika.detect |
---|
Methods in org.apache.tika.detect with parameters of type Metadata | |
---|---|
MediaType |
TypeDetector.detect(java.io.InputStream input,
Metadata metadata)
Detects the content type of an input document based on a type hint given in the input metadata. |
MediaType |
TextDetector.detect(java.io.InputStream input,
Metadata metadata)
Looks at the beginning of the document input stream to determine whether the document is text or not. |
MediaType |
NameDetector.detect(java.io.InputStream input,
Metadata metadata)
Detects the content type of an input document based on the document name given in the input metadata. |
MediaType |
MagicDetector.detect(java.io.InputStream input,
Metadata metadata)
|
MediaType |
Detector.detect(java.io.InputStream input,
Metadata metadata)
Detects the content type of the given input document. |
MediaType |
CompositeDetector.detect(java.io.InputStream input,
Metadata metadata)
|
Uses of Metadata in org.apache.tika.mime |
---|
Methods in org.apache.tika.mime with parameters of type Metadata | |
---|---|
MediaType |
MimeTypes.detect(java.io.InputStream input,
Metadata metadata)
Automatically detects the MIME type of a document based on magic markers in the stream prefix and any given metadata hints. |
Uses of Metadata in org.apache.tika.parser |
---|
Methods in org.apache.tika.parser with parameters of type Metadata | |
---|---|
protected Parser |
CompositeParser.getParser(Metadata metadata)
Returns the parser that best matches the given metadata. |
void |
ParserDecorator.parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata)
Deprecated. This method will be removed in Apache Tika 1.0. |
void |
Parser.parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata)
Deprecated. This method will be removed in Apache Tika 1.0. |
void |
ExternalParser.parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata)
Deprecated. This method will be removed in Apache Tika 1.0. |
void |
ErrorParser.parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata)
Deprecated. This method will be removed in Apache Tika 1.0. |
void |
EmptyParser.parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata)
Deprecated. This method will be removed in Apache Tika 1.0. |
void |
DelegatingParser.parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata)
Deprecated. This method will be removed in Apache Tika 1.0. |
void |
CompositeParser.parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata)
Deprecated. This method will be removed in Apache Tika 1.0. |
void |
AutoDetectParser.parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata)
|
void |
ParserPostProcessor.parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata,
ParseContext context)
Forwards the call to the delegated parser and post-processes the results as described above. |
void |
ParserDecorator.parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata,
ParseContext context)
Delegates the method call to the decorated parser. |
void |
Parser.parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata,
ParseContext context)
Parses a document stream into a sequence of XHTML SAX events. |
void |
ExternalParser.parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata,
ParseContext context)
Executes the configured external command and passes the given document stream as a simple XHTML document to the given SAX content handler. |
void |
ErrorParser.parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata,
ParseContext context)
|
void |
EmptyParser.parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata,
ParseContext context)
|
void |
DelegatingParser.parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata,
ParseContext context)
Looks up the delegate parser from the parsing context and delegates the parse operation to it. |
void |
CompositeParser.parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata,
ParseContext context)
Delegates the call to the matching component parser. |
void |
AutoDetectParser.parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata,
ParseContext context)
|
Constructors in org.apache.tika.parser with parameters of type Metadata | |
---|---|
ParsingReader(Parser parser,
java.io.InputStream stream,
Metadata metadata)
Deprecated. This method will be removed in Apache Tika 1.0 |
|
ParsingReader(Parser parser,
java.io.InputStream stream,
Metadata metadata,
java.util.concurrent.Executor executor)
Deprecated. This method will be removed in Apache Tika 1.0 |
|
ParsingReader(Parser parser,
java.io.InputStream stream,
Metadata metadata,
ParseContext context)
Creates a reader for the text content of the given binary stream with the given document metadata. |
|
ParsingReader(Parser parser,
java.io.InputStream stream,
Metadata metadata,
ParseContext context,
java.util.concurrent.Executor executor)
Creates a reader for the text content of the given binary stream with the given document metadata. |
Uses of Metadata in org.apache.tika.parser.asm |
---|
Methods in org.apache.tika.parser.asm with parameters of type Metadata | |
---|---|
void |
ClassParser.parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata)
Deprecated. This method will be removed in Apache Tika 1.0. |
void |
ClassParser.parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata,
ParseContext context)
|
Uses of Metadata in org.apache.tika.parser.audio |
---|
Methods in org.apache.tika.parser.audio with parameters of type Metadata | |
---|---|
void |
MidiParser.parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata)
Deprecated. This method will be removed in Apache Tika 1.0. |
void |
AudioParser.parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata)
Deprecated. This method will be removed in Apache Tika 1.0. |
void |
MidiParser.parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata,
ParseContext context)
|
void |
AudioParser.parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata,
ParseContext context)
|
Uses of Metadata in org.apache.tika.parser.epub |
---|
Methods in org.apache.tika.parser.epub with parameters of type Metadata | |
---|---|
void |
EpubParser.parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata)
Deprecated. This method will be removed in Apache Tika 1.0. |
void |
EpubContentParser.parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata)
Deprecated. This method will be removed in Apache Tika 1.0. |
void |
EpubParser.parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata,
ParseContext context)
|
void |
EpubContentParser.parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata,
ParseContext context)
|
Uses of Metadata in org.apache.tika.parser.html |
---|
Methods in org.apache.tika.parser.html with parameters of type Metadata | |
---|---|
void |
HtmlParser.parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata)
Deprecated. This method will be removed in Apache Tika 1.0. |
void |
HtmlParser.parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata,
ParseContext context)
|
Uses of Metadata in org.apache.tika.parser.image |
---|
Methods in org.apache.tika.parser.image with parameters of type Metadata | |
---|---|
void |
ImageParser.parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata)
Deprecated. This method will be removed in Apache Tika 1.0. |
void |
ImageParser.parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata,
ParseContext context)
|
Uses of Metadata in org.apache.tika.parser.jpeg |
---|
Methods in org.apache.tika.parser.jpeg with parameters of type Metadata | |
---|---|
void |
JpegParser.parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata)
Deprecated. This method will be removed in Apache Tika 1.0. |
void |
JpegParser.parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata,
ParseContext context)
|
Uses of Metadata in org.apache.tika.parser.mbox |
---|
Methods in org.apache.tika.parser.mbox with parameters of type Metadata | |
---|---|
void |
MboxParser.parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata)
|
void |
MboxParser.parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata,
ParseContext context)
|
Uses of Metadata in org.apache.tika.parser.microsoft |
---|
Methods in org.apache.tika.parser.microsoft with parameters of type Metadata | |
---|---|
void |
OfficeParser.parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata)
Deprecated. This method will be removed in Apache Tika 1.0. |
void |
OfficeParser.parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata,
ParseContext context)
Extracts properties and text from an MS Document input stream |
Uses of Metadata in org.apache.tika.parser.microsoft.ooxml |
---|
Methods in org.apache.tika.parser.microsoft.ooxml with parameters of type Metadata | |
---|---|
void |
MetadataExtractor.extract(Metadata metadata)
|
void |
OOXMLExtractor.getXHTML(org.xml.sax.ContentHandler handler,
Metadata metadata)
Parses the document into a sequence of XHTML SAX events sent to the given content handler. |
void |
AbstractOOXMLExtractor.getXHTML(org.xml.sax.ContentHandler handler,
Metadata metadata)
|
void |
OOXMLParser.parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata)
Deprecated. This method will be removed in Apache Tika 1.0. |
void |
OOXMLParser.parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata,
ParseContext context)
|
Uses of Metadata in org.apache.tika.parser.mp3 |
---|
Methods in org.apache.tika.parser.mp3 with parameters of type Metadata | |
---|---|
void |
Mp3Parser.parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata)
Deprecated. This method will be removed in Apache Tika 1.0. |
void |
Mp3Parser.parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata,
ParseContext context)
|
Uses of Metadata in org.apache.tika.parser.odf |
---|
Methods in org.apache.tika.parser.odf with parameters of type Metadata | |
---|---|
protected org.xml.sax.ContentHandler |
OpenDocumentMetaParser.getContentHandler(org.xml.sax.ContentHandler ch,
Metadata md)
|
void |
OpenDocumentParser.parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata)
Deprecated. This method will be removed in Apache Tika 1.0. |
void |
OpenDocumentContentParser.parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata)
Deprecated. This method will be removed in Apache Tika 1.0. |
void |
OpenDocumentParser.parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata,
ParseContext context)
|
void |
OpenDocumentContentParser.parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata,
ParseContext context)
|
Uses of Metadata in org.apache.tika.parser.pdf |
---|
Methods in org.apache.tika.parser.pdf with parameters of type Metadata | |
---|---|
void |
PDFParser.parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata)
Deprecated. This method will be removed in Apache Tika 1.0. |
void |
PDFParser.parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata,
ParseContext context)
|
Uses of Metadata in org.apache.tika.parser.pkg |
---|
Methods in org.apache.tika.parser.pkg with parameters of type Metadata | |
---|---|
void |
ZipParser.parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata,
ParseContext context)
Parses the given stream as a Zip file. |
void |
TarParser.parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata,
ParseContext context)
Parses the given stream as a tar file. |
void |
GzipParser.parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata,
ParseContext context)
Parses the given stream as a gzip file. |
void |
CpioParser.parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata,
ParseContext context)
Parses the given stream as a cpio file. |
void |
Bzip2Parser.parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata,
ParseContext context)
Parses the given stream as a bzip2 file. |
void |
ArParser.parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata,
ParseContext context)
Parses the given stream as an ar archive. |
protected void |
PackageParser.parseArchive(org.apache.commons.compress.archivers.ArchiveInputStream archive,
org.xml.sax.ContentHandler handler,
Metadata metadata,
ParseContext context)
Parses the given stream as a package of multiple underlying files. |
Uses of Metadata in org.apache.tika.parser.rtf |
---|
Methods in org.apache.tika.parser.rtf with parameters of type Metadata | |
---|---|
void |
RTFParser.parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata)
Deprecated. This method will be removed in Apache Tika 1.0. |
void |
RTFParser.parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata,
ParseContext context)
|
Uses of Metadata in org.apache.tika.parser.txt |
---|
Methods in org.apache.tika.parser.txt with parameters of type Metadata | |
---|---|
void |
TXTParser.parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata)
Deprecated. This method will be removed in Apache Tika 1.0. |
void |
TXTParser.parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata,
ParseContext context)
|
Uses of Metadata in org.apache.tika.parser.xml |
---|
Methods in org.apache.tika.parser.xml with parameters of type Metadata | |
---|---|
protected org.xml.sax.ContentHandler |
XMLParser.getContentHandler(org.xml.sax.ContentHandler handler,
Metadata metadata)
|
protected org.xml.sax.ContentHandler |
DcXMLParser.getContentHandler(org.xml.sax.ContentHandler ch,
Metadata md)
|
void |
XMLParser.parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata)
Deprecated. This method will be removed in Apache Tika 1.0. |
void |
XMLParser.parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
Metadata metadata,
ParseContext context)
|
Constructors in org.apache.tika.parser.xml with parameters of type Metadata | |
---|---|
MetadataHandler(Metadata metadata,
java.lang.String name)
|
Uses of Metadata in org.apache.tika.sax |
---|
Constructors in org.apache.tika.sax with parameters of type Metadata | |
---|---|
XHTMLContentHandler(org.xml.sax.ContentHandler handler,
Metadata metadata)
|
|
||||||||||
PREV NEXT | FRAMES NO FRAMES |