Package org.apache.tika.extractor
Class RUnpackExtractor
java.lang.Object
org.apache.tika.extractor.ParsingEmbeddedDocumentExtractor
org.apache.tika.extractor.RUnpackExtractor
- All Implemented Interfaces:
EmbeddedDocumentExtractor
Recursive Unpacker and text and metadata extractor.
- Since:
- Apache Tika 3.0.0
-
Field Summary
Fields inherited from class org.apache.tika.extractor.ParsingEmbeddedDocumentExtractor
context
-
Constructor Summary
-
Method Summary
Modifier and TypeMethodDescriptionvoid
parseEmbedded
(InputStream stream, ContentHandler handler, Metadata metadata, boolean outputHtml) Processes the supplied embedded resource, calling the delegating parser with the appropriate details.void
setEmbeddedBytesSelector
(EmbeddedBytesSelector embeddedBytesSelector) Methods inherited from class org.apache.tika.extractor.ParsingEmbeddedDocumentExtractor
getDelegatingParser, isWriteFileNameToContent, setWriteFileNameToContent, shouldParseEmbedded
-
Constructor Details
-
RUnpackExtractor
-
-
Method Details
-
parseEmbedded
public void parseEmbedded(InputStream stream, ContentHandler handler, Metadata metadata, boolean outputHtml) throws SAXException, IOException Description copied from interface:EmbeddedDocumentExtractor
Processes the supplied embedded resource, calling the delegating parser with the appropriate details.- Specified by:
parseEmbedded
in interfaceEmbeddedDocumentExtractor
- Overrides:
parseEmbedded
in classParsingEmbeddedDocumentExtractor
- Parameters:
stream
- The embedded resourcehandler
- The handler to usemetadata
- The metadata for the embedded resourceoutputHtml
- Should we output HTML for this resource, or has the parser already done so?- Throws:
SAXException
IOException
-
setEmbeddedBytesSelector
-
getEmbeddedBytesSelector
-