Interface | Description |
---|---|
ContainerExtractor |
Tika container extractor interface.
|
DocumentSelector |
Interface for different document selection strategies for purposes like
embedded document extraction by a
ContainerExtractor instance. |
EmbeddedDocumentExtractor | |
EmbeddedResourceHandler |
Tika container extractor callback interface.
|
Class | Description |
---|---|
ParserContainerExtractor |
An implementation of
ContainerExtractor powered by the regular
Parser API. |
ParsingEmbeddedDocumentExtractor |
Helper class for parsers of package archives or other compound document
formats that support embedded or attached component documents.
|
Copyright © 2007-2013 The Apache Software Foundation. All Rights Reserved.