org.apache.tika.extractor
Interface EmbeddedDocumentExtractor

All Known Implementing Classes:
ParsingEmbeddedDocumentExtractor

public interface EmbeddedDocumentExtractor


Method Summary
 void parseEmbedded(InputStream stream, ContentHandler handler, Metadata metadata, boolean outputHtml)
          Processes the supplied embedded resource, calling the delegating parser with the appropriate details.
 boolean shouldParseEmbedded(Metadata metadata)
           
 

Method Detail

shouldParseEmbedded

boolean shouldParseEmbedded(Metadata metadata)

parseEmbedded

void parseEmbedded(InputStream stream,
                   ContentHandler handler,
                   Metadata metadata,
                   boolean outputHtml)
                   throws SAXException,
                          IOException
Processes the supplied embedded resource, calling the delegating parser with the appropriate details.

Parameters:
stream - The embedded resource
handler - The handler to use
metadata - The metadata for the embedded resource
outputHtml - Should we output HTML for this resource, or has the parser already done so?
Throws:
SAXException
IOException


Copyright © 2007-2011 The Apache Software Foundation. All Rights Reserved.