Uses of Interface
org.apache.tika.extractor.EmbeddedDocumentExtractor
Packages that use EmbeddedDocumentExtractor
Package
Description
Extraction of component documents.
-
Uses of EmbeddedDocumentExtractor in org.apache.tika.extractor
Classes in org.apache.tika.extractor that implement EmbeddedDocumentExtractorModifier and TypeClassDescriptionclass
Helper class for parsers of package archives or other compound document formats that support embedded or attached component documents.Methods in org.apache.tika.extractor that return EmbeddedDocumentExtractorModifier and TypeMethodDescriptionstatic EmbeddedDocumentExtractor
EmbeddedDocumentUtil.getEmbeddedDocumentExtractor
(ParseContext context) This offers a uniform way to get an EmbeddedDocumentExtractor from a ParseContext.EmbeddedDocumentExtractorFactory.newInstance
(Metadata metadata, ParseContext parseContext) ParsingEmbeddedDocumentExtractorFactory.newInstance
(Metadata metadata, ParseContext parseContext) -
Uses of EmbeddedDocumentExtractor in org.apache.tika.parser.microsoft
Methods in org.apache.tika.parser.microsoft with parameters of type EmbeddedDocumentExtractorModifier and TypeMethodDescriptionstatic void
OfficeParser.extractMacros
(org.apache.poi.poifs.filesystem.POIFSFileSystem fs, ContentHandler xhtml, EmbeddedDocumentExtractor embeddedDocumentExtractor) Helper to extract macros from an NPOIFS/vbaProject.bin -
Uses of EmbeddedDocumentExtractor in org.apache.tika.parser.pdf.image
Fields in org.apache.tika.parser.pdf.image declared as EmbeddedDocumentExtractorModifier and TypeFieldDescriptionprotected final EmbeddedDocumentExtractor
ImageGraphicsEngine.embeddedDocumentExtractor
Methods in org.apache.tika.parser.pdf.image with parameters of type EmbeddedDocumentExtractorModifier and TypeMethodDescriptionImageGraphicsEngineFactory.newEngine
(org.apache.pdfbox.pdmodel.PDPage page, int pageNumber, EmbeddedDocumentExtractor embeddedDocumentExtractor, PDFParserConfig pdfParserConfig, Map<org.apache.pdfbox.cos.COSStream, Integer> processedInlineImages, AtomicInteger imageCounter, XHTMLContentHandler xhtml, Metadata parentMetadata, ParseContext parseContext) Constructors in org.apache.tika.parser.pdf.image with parameters of type EmbeddedDocumentExtractorModifierConstructorDescriptionprotected
ImageGraphicsEngine
(org.apache.pdfbox.pdmodel.PDPage page, int pageNumber, EmbeddedDocumentExtractor embeddedDocumentExtractor, PDFParserConfig pdfParserConfig, Map<org.apache.pdfbox.cos.COSStream, Integer> processedInlineImages, AtomicInteger imageCounter, XHTMLContentHandler xhtml, Metadata parentMetadata, ParseContext parseContext)