Class POIXMLTextExtractorDecorator
- java.lang.Object
-
- org.apache.tika.parser.microsoft.ooxml.AbstractOOXMLExtractor
-
- org.apache.tika.parser.microsoft.ooxml.POIXMLTextExtractorDecorator
-
- All Implemented Interfaces:
OOXMLExtractor
public class POIXMLTextExtractorDecorator extends AbstractOOXMLExtractor
-
-
Field Summary
-
Fields inherited from class org.apache.tika.parser.microsoft.ooxml.AbstractOOXMLExtractor
config, EMBEDDED_RELATIONSHIPS, extractor
-
-
Constructor Summary
Constructors Constructor Description POIXMLTextExtractorDecorator(ParseContext context, org.apache.poi.ooxml.extractor.POIXMLTextExtractor extractor)
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description protected voidbuildXHTML(XHTMLContentHandler xhtml)Populates theXHTMLContentHandlerobject received as parameter.protected List<org.apache.poi.openxml4j.opc.PackagePart>getMainDocumentParts()Return a list of the main parts of the document, used when searching for embedded resources.-
Methods inherited from class org.apache.tika.parser.microsoft.ooxml.AbstractOOXMLExtractor
getDocument, getEmbeddedPartMetadataMap, getJustFileName, getMetadataExtractor, getXHTML, handleEmbeddedFile, loadLinkedRelationships
-
-
-
-
Constructor Detail
-
POIXMLTextExtractorDecorator
public POIXMLTextExtractorDecorator(ParseContext context, org.apache.poi.ooxml.extractor.POIXMLTextExtractor extractor)
-
-
Method Detail
-
buildXHTML
protected void buildXHTML(XHTMLContentHandler xhtml) throws SAXException
Description copied from class:AbstractOOXMLExtractorPopulates theXHTMLContentHandlerobject received as parameter.- Specified by:
buildXHTMLin classAbstractOOXMLExtractor- Throws:
SAXException
-
getMainDocumentParts
protected List<org.apache.poi.openxml4j.opc.PackagePart> getMainDocumentParts()
Description copied from class:AbstractOOXMLExtractorReturn a list of the main parts of the document, used when searching for embedded resources. This should be all the parts of the document that end up with things embedded into them.- Specified by:
getMainDocumentPartsin classAbstractOOXMLExtractor
-
-