Class XSSFBExcelExtractorDecorator
java.lang.Object
org.apache.tika.parser.microsoft.ooxml.AbstractOOXMLExtractor
org.apache.tika.parser.microsoft.ooxml.XSSFExcelExtractorDecorator
org.apache.tika.parser.microsoft.ooxml.XSSFBExcelExtractorDecorator
- All Implemented Interfaces:
OOXMLExtractor
-
Nested Class Summary
Nested classes/interfaces inherited from class org.apache.tika.parser.microsoft.ooxml.XSSFExcelExtractorDecorator
XSSFExcelExtractorDecorator.HeaderFooterFromString, XSSFExcelExtractorDecorator.SheetTextAsHTML, XSSFExcelExtractorDecorator.XSSFSheetInterestingPartsCapturer -
Field Summary
Fields inherited from class org.apache.tika.parser.microsoft.ooxml.XSSFExcelExtractorDecorator
drawingHyperlinks, formatter, hfHelper, metadata, parseContext, sheetPartsFields inherited from class org.apache.tika.parser.microsoft.ooxml.AbstractOOXMLExtractor
config, EMBEDDED_RELATIONSHIPS, extractor -
Constructor Summary
ConstructorsConstructorDescriptionXSSFBExcelExtractorDecorator(ParseContext context, org.apache.poi.ooxml.extractor.POIXMLTextExtractor extractor, Locale locale) -
Method Summary
Modifier and TypeMethodDescriptionprotected voidbuildXHTML(XHTMLContentHandler xhtml) Populates theXHTMLContentHandlerobject received as parameter.protected voidconfigureExtractor(org.apache.poi.ooxml.extractor.POIXMLTextExtractor extractor, Locale locale) protected voidextractHeaderFooter(String hf, XHTMLContentHandler xhtml) voidgetXHTML(ContentHandler handler, Metadata metadata, ParseContext context) Parses the document into a sequence of XHTML SAX events sent to the given content handler.Methods inherited from class org.apache.tika.parser.microsoft.ooxml.XSSFExcelExtractorDecorator
addDrawingHyperLinks, extractHyperLinks, getMainDocumentParts, processShapes, processSheetMethods inherited from class org.apache.tika.parser.microsoft.ooxml.AbstractOOXMLExtractor
getDocument, getEmbeddedPartMetadataMap, getJustFileName, getMetadataExtractor, handleEmbeddedFile, loadLinkedRelationships
-
Constructor Details
-
XSSFBExcelExtractorDecorator
public XSSFBExcelExtractorDecorator(ParseContext context, org.apache.poi.ooxml.extractor.POIXMLTextExtractor extractor, Locale locale)
-
-
Method Details
-
configureExtractor
protected void configureExtractor(org.apache.poi.ooxml.extractor.POIXMLTextExtractor extractor, Locale locale) - Overrides:
configureExtractorin classXSSFExcelExtractorDecorator
-
getXHTML
public void getXHTML(ContentHandler handler, Metadata metadata, ParseContext context) throws SAXException, org.apache.xmlbeans.XmlException, IOException, TikaException Description copied from interface:OOXMLExtractorParses the document into a sequence of XHTML SAX events sent to the given content handler.- Specified by:
getXHTMLin interfaceOOXMLExtractor- Overrides:
getXHTMLin classXSSFExcelExtractorDecorator- Throws:
SAXExceptionorg.apache.xmlbeans.XmlExceptionIOExceptionTikaException- See Also:
-
buildXHTML
protected void buildXHTML(XHTMLContentHandler xhtml) throws SAXException, org.apache.xmlbeans.XmlException, IOException Description copied from class:AbstractOOXMLExtractorPopulates theXHTMLContentHandlerobject received as parameter.- Overrides:
buildXHTMLin classXSSFExcelExtractorDecorator- Throws:
SAXExceptionorg.apache.xmlbeans.XmlExceptionIOException- See Also:
-
XSSFBEventBasedExcelExtractor.getText()
-