Package org.apache.tika.extractor
Class RUnpackExtractor
- java.lang.Object
-
- org.apache.tika.extractor.ParsingEmbeddedDocumentExtractor
-
- org.apache.tika.extractor.RUnpackExtractor
-
- All Implemented Interfaces:
EmbeddedDocumentExtractor
public class RUnpackExtractor extends ParsingEmbeddedDocumentExtractor
Recursive Unpacker and text and metadata extractor.- Since:
- Apache Tika 3.0.0
-
-
Field Summary
-
Fields inherited from class org.apache.tika.extractor.ParsingEmbeddedDocumentExtractor
context
-
-
Constructor Summary
Constructors Constructor Description RUnpackExtractor(ParseContext context, long maxEmbeddedBytesForExtraction)
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description EmbeddedBytesSelector
getEmbeddedBytesSelector()
void
parseEmbedded(InputStream stream, ContentHandler handler, Metadata metadata, boolean outputHtml)
Processes the supplied embedded resource, calling the delegating parser with the appropriate details.void
setEmbeddedBytesSelector(EmbeddedBytesSelector embeddedBytesSelector)
-
Methods inherited from class org.apache.tika.extractor.ParsingEmbeddedDocumentExtractor
getDelegatingParser, isWriteFileNameToContent, setWriteFileNameToContent, shouldParseEmbedded
-
-
-
-
Constructor Detail
-
RUnpackExtractor
public RUnpackExtractor(ParseContext context, long maxEmbeddedBytesForExtraction)
-
-
Method Detail
-
parseEmbedded
public void parseEmbedded(InputStream stream, ContentHandler handler, Metadata metadata, boolean outputHtml) throws SAXException, IOException
Description copied from interface:EmbeddedDocumentExtractor
Processes the supplied embedded resource, calling the delegating parser with the appropriate details.- Specified by:
parseEmbedded
in interfaceEmbeddedDocumentExtractor
- Overrides:
parseEmbedded
in classParsingEmbeddedDocumentExtractor
- Parameters:
stream
- The embedded resourcehandler
- The handler to usemetadata
- The metadata for the embedded resourceoutputHtml
- Should we output HTML for this resource, or has the parser already done so?- Throws:
SAXException
IOException
-
setEmbeddedBytesSelector
public void setEmbeddedBytesSelector(EmbeddedBytesSelector embeddedBytesSelector)
-
getEmbeddedBytesSelector
public EmbeddedBytesSelector getEmbeddedBytesSelector()
-
-