Package org.apache.tika.extractor

Interface Summary
ContainerExtractor Tika container extractor interface.
DocumentSelector Interface for different document selection strategies for purposes like embedded document extraction by a ContainerExtractor instance.
EmbeddedResourceHandler Tika container extractor callback interface.
 

Class Summary
EmbeddedDocumentExtractor Helper class for parsers of package archives or other compound document formats that support embedded or attached component documents.
ParserContainerExtractor An implementation of ContainerExtractor powered by the regular Parser classes.
 



Copyright © 2007-2010 The Apache Software Foundation. All Rights Reserved.