public class XPSExtractorDecorator extends AbstractOOXMLExtractor
config, EMBEDDED_RELATIONSHIPS, extractor
Constructor and Description |
---|
XPSExtractorDecorator(ParseContext context,
org.apache.poi.ooxml.extractor.POIXMLTextExtractor extractor) |
Modifier and Type | Method and Description |
---|---|
protected void |
buildXHTML(XHTMLContentHandler xhtml)
Populates the
XHTMLContentHandler object received as parameter. |
org.apache.poi.ooxml.POIXMLDocument |
getDocument()
Returns the opened document.
|
protected List<org.apache.poi.openxml4j.opc.PackagePart> |
getMainDocumentParts()
Return a list of the main parts of the document, used
when searching for embedded resources.
|
getJustFileName, getMetadataExtractor, getXHTML, handleEmbeddedFile, loadLinkedRelationships
public XPSExtractorDecorator(ParseContext context, org.apache.poi.ooxml.extractor.POIXMLTextExtractor extractor) throws TikaException
TikaException
public org.apache.poi.ooxml.POIXMLDocument getDocument()
OOXMLExtractor
getDocument
in interface OOXMLExtractor
getDocument
in class AbstractOOXMLExtractor
OOXMLExtractor.getDocument()
protected void buildXHTML(XHTMLContentHandler xhtml) throws SAXException, IOException
AbstractOOXMLExtractor
XHTMLContentHandler
object received as parameter.buildXHTML
in class AbstractOOXMLExtractor
SAXException
IOException
protected List<org.apache.poi.openxml4j.opc.PackagePart> getMainDocumentParts() throws TikaException
AbstractOOXMLExtractor
getMainDocumentParts
in class AbstractOOXMLExtractor
TikaException
Copyright © 2007–2020 The Apache Software Foundation. All rights reserved.