Package org.apache.tika.extractor

Extraction of component documents.

See:
          Description

Interface Summary
ContainerExtractor Tika container extractor interface.
DocumentSelector Interface for different document selection strategies for purposes like embedded document extraction by a ContainerExtractor instance.
EmbeddedDocumentExtractor  
EmbeddedResourceHandler Tika container extractor callback interface.
 

Class Summary
ParserContainerExtractor An implementation of ContainerExtractor powered by the regular Parser API.
ParsingEmbeddedDocumentExtractor Helper class for parsers of package archives or other compound document formats that support embedded or attached component documents.
 

Package org.apache.tika.extractor Description

Extraction of component documents.



Copyright © 2007-2012 The Apache Software Foundation. All Rights Reserved.