Class XSSFBExcelExtractorDecorator
java.lang.Object
org.apache.tika.parser.microsoft.ooxml.AbstractOOXMLExtractor
org.apache.tika.parser.microsoft.ooxml.XSSFExcelExtractorDecorator
org.apache.tika.parser.microsoft.ooxml.XSSFBExcelExtractorDecorator
- All Implemented Interfaces:
OOXMLExtractor
-
Nested Class Summary
Nested classes/interfaces inherited from class org.apache.tika.parser.microsoft.ooxml.XSSFExcelExtractorDecorator
XSSFExcelExtractorDecorator.HeaderFooterFromString, XSSFExcelExtractorDecorator.SheetTextAsHTML, XSSFExcelExtractorDecorator.XSSFSheetInterestingPartsCapturer
-
Field Summary
Fields inherited from class org.apache.tika.parser.microsoft.ooxml.XSSFExcelExtractorDecorator
drawingHyperlinks, formatter, hfHelper, metadata, parseContext, sheetParts
Fields inherited from class org.apache.tika.parser.microsoft.ooxml.AbstractOOXMLExtractor
config, EMBEDDED_RELATIONSHIPS, extractor
-
Constructor Summary
ConstructorDescriptionXSSFBExcelExtractorDecorator
(ParseContext context, org.apache.poi.ooxml.extractor.POIXMLTextExtractor extractor, Locale locale) -
Method Summary
Modifier and TypeMethodDescriptionprotected void
buildXHTML
(XHTMLContentHandler xhtml) Populates theXHTMLContentHandler
object received as parameter.protected void
configureExtractor
(org.apache.poi.ooxml.extractor.POIXMLTextExtractor extractor, Locale locale) protected void
extractHeaderFooter
(String hf, XHTMLContentHandler xhtml) void
getXHTML
(ContentHandler handler, Metadata metadata, ParseContext context) Parses the document into a sequence of XHTML SAX events sent to the given content handler.Methods inherited from class org.apache.tika.parser.microsoft.ooxml.XSSFExcelExtractorDecorator
addDrawingHyperLinks, extractHyperLinks, getMainDocumentParts, processShapes, processSheet
Methods inherited from class org.apache.tika.parser.microsoft.ooxml.AbstractOOXMLExtractor
getDocument, getEmbeddedPartMetadataMap, getJustFileName, getMetadataExtractor, handleEmbeddedFile, loadLinkedRelationships
-
Constructor Details
-
XSSFBExcelExtractorDecorator
public XSSFBExcelExtractorDecorator(ParseContext context, org.apache.poi.ooxml.extractor.POIXMLTextExtractor extractor, Locale locale)
-
-
Method Details
-
configureExtractor
protected void configureExtractor(org.apache.poi.ooxml.extractor.POIXMLTextExtractor extractor, Locale locale) - Overrides:
configureExtractor
in classXSSFExcelExtractorDecorator
-
getXHTML
public void getXHTML(ContentHandler handler, Metadata metadata, ParseContext context) throws SAXException, org.apache.xmlbeans.XmlException, IOException, TikaException Description copied from interface:OOXMLExtractor
Parses the document into a sequence of XHTML SAX events sent to the given content handler.- Specified by:
getXHTML
in interfaceOOXMLExtractor
- Overrides:
getXHTML
in classXSSFExcelExtractorDecorator
- Throws:
SAXException
org.apache.xmlbeans.XmlException
IOException
TikaException
- See Also:
-
buildXHTML
protected void buildXHTML(XHTMLContentHandler xhtml) throws SAXException, org.apache.xmlbeans.XmlException, IOException Description copied from class:AbstractOOXMLExtractor
Populates theXHTMLContentHandler
object received as parameter.- Overrides:
buildXHTML
in classXSSFExcelExtractorDecorator
- Throws:
SAXException
org.apache.xmlbeans.XmlException
IOException
- See Also:
-
XSSFBEventBasedExcelExtractor.getText()
-