Interface | Description |
---|---|
ContainerExtractor |
Tika container extractor interface.
|
DocumentSelector |
Interface for different document selection strategies for purposes like
embedded document extraction by a
ContainerExtractor instance. |
EmbeddedDocumentExtractor | |
EmbeddedResourceHandler |
Tika container extractor callback interface.
|
Class | Description |
---|---|
EmbeddedDocumentUtil |
Utility class to handle common issues with embedded documents.
|
ParserContainerExtractor |
An implementation of
ContainerExtractor powered by the regular
Parser API. |
ParsingEmbeddedDocumentExtractor |
Helper class for parsers of package archives or other compound document
formats that support embedded or attached component documents.
|
Copyright © 2007–2022 The Apache Software Foundation. All rights reserved.