Package org.apache.tika.extractor
Interface EmbeddedDocumentExtractor
-
- All Known Implementing Classes:
ParsingEmbeddedDocumentExtractor
,RUnpackExtractor
public interface EmbeddedDocumentExtractor
-
-
Method Summary
All Methods Instance Methods Abstract Methods Modifier and Type Method Description void
parseEmbedded(InputStream stream, ContentHandler handler, Metadata metadata, boolean outputHtml)
Processes the supplied embedded resource, calling the delegating parser with the appropriate details.boolean
shouldParseEmbedded(Metadata metadata)
-
-
-
Method Detail
-
shouldParseEmbedded
boolean shouldParseEmbedded(Metadata metadata)
-
parseEmbedded
void parseEmbedded(InputStream stream, ContentHandler handler, Metadata metadata, boolean outputHtml) throws SAXException, IOException
Processes the supplied embedded resource, calling the delegating parser with the appropriate details.- Parameters:
stream
- The embedded resourcehandler
- The handler to usemetadata
- The metadata for the embedded resourceoutputHtml
- Should we output HTML for this resource, or has the parser already done so?- Throws:
SAXException
IOException
-
-