Package org.apache.tika.extractor
Class RUnpackExtractor
java.lang.Object
org.apache.tika.extractor.ParsingEmbeddedDocumentExtractor
org.apache.tika.extractor.RUnpackExtractor
- All Implemented Interfaces:
EmbeddedDocumentExtractor
Recursive Unpacker and text and metadata extractor.
- Since:
- Apache Tika 3.0.0
-
Field Summary
Fields inherited from class org.apache.tika.extractor.ParsingEmbeddedDocumentExtractor
context -
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptionvoidparseEmbedded(InputStream stream, ContentHandler handler, Metadata metadata, boolean outputHtml) Processes the supplied embedded resource, calling the delegating parser with the appropriate details.voidsetEmbeddedBytesSelector(EmbeddedBytesSelector embeddedBytesSelector) Methods inherited from class org.apache.tika.extractor.ParsingEmbeddedDocumentExtractor
getDelegatingParser, isWriteFileNameToContent, setWriteFileNameToContent, shouldParseEmbedded
-
Constructor Details
-
RUnpackExtractor
-
-
Method Details
-
parseEmbedded
public void parseEmbedded(InputStream stream, ContentHandler handler, Metadata metadata, boolean outputHtml) throws SAXException, IOException Description copied from interface:EmbeddedDocumentExtractorProcesses the supplied embedded resource, calling the delegating parser with the appropriate details.- Specified by:
parseEmbeddedin interfaceEmbeddedDocumentExtractor- Overrides:
parseEmbeddedin classParsingEmbeddedDocumentExtractor- Parameters:
stream- The embedded resourcehandler- The handler to usemetadata- The metadata for the embedded resourceoutputHtml- Should we output HTML for this resource, or has the parser already done so?- Throws:
SAXExceptionIOException
-
setEmbeddedBytesSelector
-
getEmbeddedBytesSelector
-