Package org.apache.tika.extractor
Interface EmbeddedDocumentExtractor
- All Known Implementing Classes:
ParsingEmbeddedDocumentExtractor
,RUnpackExtractor
public interface EmbeddedDocumentExtractor
-
Method Summary
Modifier and TypeMethodDescriptionvoid
parseEmbedded
(InputStream stream, ContentHandler handler, Metadata metadata, boolean outputHtml) Processes the supplied embedded resource, calling the delegating parser with the appropriate details.boolean
shouldParseEmbedded
(Metadata metadata)
-
Method Details
-
shouldParseEmbedded
-
parseEmbedded
void parseEmbedded(InputStream stream, ContentHandler handler, Metadata metadata, boolean outputHtml) throws SAXException, IOException Processes the supplied embedded resource, calling the delegating parser with the appropriate details.- Parameters:
stream
- The embedded resourcehandler
- The handler to usemetadata
- The metadata for the embedded resourceoutputHtml
- Should we output HTML for this resource, or has the parser already done so?- Throws:
SAXException
IOException
-