Class XWPFWordExtractorDecorator
java.lang.Object
org.apache.tika.parser.microsoft.ooxml.AbstractOOXMLExtractor
org.apache.tika.parser.microsoft.ooxml.XWPFWordExtractorDecorator
- All Implemented Interfaces:
OOXMLExtractor
-
Field Summary
Fields inherited from class org.apache.tika.parser.microsoft.ooxml.AbstractOOXMLExtractor
config, EMBEDDED_RELATIONSHIPS, extractor -
Constructor Summary
ConstructorsConstructorDescriptionXWPFWordExtractorDecorator(Metadata metadata, ParseContext context, org.apache.poi.xwpf.extractor.XWPFWordExtractor extractor) XWPFWordExtractorDecorator(ParseContext context, org.apache.poi.xwpf.extractor.XWPFWordExtractor extractor) Deprecated. -
Method Summary
Modifier and TypeMethodDescriptionprotected voidbuildXHTML(XHTMLContentHandler xhtml) Populates theXHTMLContentHandlerobject received as parameter.protected Map<String,EmbeddedPartMetadata> protected List<org.apache.poi.openxml4j.opc.PackagePart>Include main body and anything else that can have an attachment/embedded objectMethods inherited from class org.apache.tika.parser.microsoft.ooxml.AbstractOOXMLExtractor
getDocument, getJustFileName, getMetadataExtractor, getXHTML, handleEmbeddedFile, loadLinkedRelationships
-
Constructor Details
-
XWPFWordExtractorDecorator
public XWPFWordExtractorDecorator(Metadata metadata, ParseContext context, org.apache.poi.xwpf.extractor.XWPFWordExtractor extractor) -
XWPFWordExtractorDecorator
@Deprecated public XWPFWordExtractorDecorator(ParseContext context, org.apache.poi.xwpf.extractor.XWPFWordExtractor extractor) - Parameters:
context-extractor-
-
-
Method Details
-
buildXHTML
protected void buildXHTML(XHTMLContentHandler xhtml) throws SAXException, org.apache.xmlbeans.XmlException, IOException Description copied from class:AbstractOOXMLExtractorPopulates theXHTMLContentHandlerobject received as parameter.- Specified by:
buildXHTMLin classAbstractOOXMLExtractor- Throws:
SAXExceptionorg.apache.xmlbeans.XmlExceptionIOException- See Also:
-
XWPFWordExtractor.getText()
-
getEmbeddedPartMetadataMap
- Overrides:
getEmbeddedPartMetadataMapin classAbstractOOXMLExtractor
-
getMainDocumentParts
Include main body and anything else that can have an attachment/embedded object- Specified by:
getMainDocumentPartsin classAbstractOOXMLExtractor
-
XWPFWordExtractorDecorator(Metadata, ParseContext, XWPFWordExtractor)