org.apache.tika.extractor
Class ParsingEmbeddedDocumentExtractor

java.lang.Object
  extended by org.apache.tika.extractor.ParsingEmbeddedDocumentExtractor
All Implemented Interfaces:
EmbeddedDocumentExtractor

public class ParsingEmbeddedDocumentExtractor
extends Object
implements EmbeddedDocumentExtractor

Helper class for parsers of package archives or other compound document formats that support embedded or attached component documents.

Since:
Apache Tika 0.8

Constructor Summary
ParsingEmbeddedDocumentExtractor(ParseContext context)
           
 
Method Summary
 void parseEmbedded(InputStream stream, ContentHandler handler, Metadata metadata, boolean outputHtml)
          Processes the supplied embedded resource, calling the delegating parser with the appropriate details.
 boolean shouldParseEmbedded(Metadata metadata)
           
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

ParsingEmbeddedDocumentExtractor

public ParsingEmbeddedDocumentExtractor(ParseContext context)
Method Detail

shouldParseEmbedded

public boolean shouldParseEmbedded(Metadata metadata)
Specified by:
shouldParseEmbedded in interface EmbeddedDocumentExtractor

parseEmbedded

public void parseEmbedded(InputStream stream,
                          ContentHandler handler,
                          Metadata metadata,
                          boolean outputHtml)
                   throws SAXException,
                          IOException
Description copied from interface: EmbeddedDocumentExtractor
Processes the supplied embedded resource, calling the delegating parser with the appropriate details.

Specified by:
parseEmbedded in interface EmbeddedDocumentExtractor
Parameters:
stream - The embedded resource
handler - The handler to use
metadata - The metadata for the embedded resource
outputHtml - Should we output HTML for this resource, or has the parser already done so?
Throws:
SAXException
IOException


Copyright © 2007-2011 The Apache Software Foundation. All Rights Reserved.