org.apache.tika.extractor
Interface EmbeddedDocumentExtractor
- All Known Implementing Classes:
- ParsingEmbeddedDocumentExtractor
public interface EmbeddedDocumentExtractor
shouldParseEmbedded
boolean shouldParseEmbedded(Metadata metadata)
parseEmbedded
void parseEmbedded(InputStream stream,
ContentHandler handler,
Metadata metadata,
boolean outputHtml)
throws SAXException,
IOException
- Processes the supplied embedded resource, calling the delegating
parser with the appropriate details.
- Parameters:
stream
- The embedded resourcehandler
- The handler to usemetadata
- The metadata for the embedded resourceoutputHtml
- Should we output HTML for this resource, or has the parser already done so?
- Throws:
SAXException
IOException
Copyright © 2007-2011 The Apache Software Foundation. All Rights Reserved.