Package org.apache.tika.extractor

Interface Summary
ContainerExtractor Tika container extractor interface.
DocumentSelector Interface for different document selection strategies for purposes like embedded document extraction by a ContainerExtractor instance.
EmbeddedDocumentExtractor  
EmbeddedResourceHandler Tika container extractor callback interface.
 

Class Summary
ParserContainerExtractor An implementation of ContainerExtractor powered by the regular Parser API.
ParsingEmbeddedDocumentExtractor Helper class for parsers of package archives or other compound document formats that support embedded or attached component documents.
 



Copyright © 2007-2011 The Apache Software Foundation. All Rights Reserved.