Package org.apache.tika.extractor
Interface EmbeddedDocumentExtractor
- All Known Implementing Classes:
ParsingEmbeddedDocumentExtractor,RUnpackExtractor
public interface EmbeddedDocumentExtractor
-
Method Summary
Modifier and TypeMethodDescriptionvoidparseEmbedded(InputStream stream, ContentHandler handler, Metadata metadata, boolean outputHtml) Processes the supplied embedded resource, calling the delegating parser with the appropriate details.booleanshouldParseEmbedded(Metadata metadata)
-
Method Details
-
shouldParseEmbedded
-
parseEmbedded
void parseEmbedded(InputStream stream, ContentHandler handler, Metadata metadata, boolean outputHtml) throws SAXException, IOException Processes the supplied embedded resource, calling the delegating parser with the appropriate details.- Parameters:
stream- The embedded resourcehandler- The handler to usemetadata- The metadata for the embedded resourceoutputHtml- Should we output HTML for this resource, or has the parser already done so?- Throws:
SAXExceptionIOException
-