Package org.apache.tika.extractor
Class ParsingEmbeddedDocumentExtractor
java.lang.Object
org.apache.tika.extractor.ParsingEmbeddedDocumentExtractor
- All Implemented Interfaces:
- EmbeddedDocumentExtractor
- Direct Known Subclasses:
- RUnpackExtractor
Helper class for parsers of package archives or other compound document
 formats that support embedded or attached component documents.
- Since:
- Apache Tika 0.8
- 
Field SummaryFields
- 
Constructor SummaryConstructors
- 
Method SummaryModifier and TypeMethodDescriptionbooleanvoidparseEmbedded(InputStream stream, ContentHandler handler, Metadata metadata, boolean outputHtml) Processes the supplied embedded resource, calling the delegating parser with the appropriate details.voidsetWriteFileNameToContent(boolean writeFileNameToContent) booleanshouldParseEmbedded(Metadata metadata) 
- 
Field Details- 
context
 
- 
- 
Constructor Details- 
ParsingEmbeddedDocumentExtractor
 
- 
- 
Method Details- 
shouldParseEmbedded- Specified by:
- shouldParseEmbeddedin interface- EmbeddedDocumentExtractor
 
- 
parseEmbeddedpublic void parseEmbedded(InputStream stream, ContentHandler handler, Metadata metadata, boolean outputHtml) throws SAXException, IOException Description copied from interface:EmbeddedDocumentExtractorProcesses the supplied embedded resource, calling the delegating parser with the appropriate details.- Specified by:
- parseEmbeddedin interface- EmbeddedDocumentExtractor
- Parameters:
- stream- The embedded resource
- handler- The handler to use
- metadata- The metadata for the embedded resource
- outputHtml- Should we output HTML for this resource, or has the parser already done so?
- Throws:
- SAXException
- IOException
 
- 
getDelegatingParser
- 
setWriteFileNameToContentpublic void setWriteFileNameToContent(boolean writeFileNameToContent) 
- 
isWriteFileNameToContentpublic boolean isWriteFileNameToContent()
 
-