org.apache.tika.parser.microsoft.ooxml
Class XSSFExcelExtractorDecorator

java.lang.Object
  extended by org.apache.tika.parser.microsoft.ooxml.AbstractOOXMLExtractor
      extended by org.apache.tika.parser.microsoft.ooxml.XSSFExcelExtractorDecorator
All Implemented Interfaces:
OOXMLExtractor

public class XSSFExcelExtractorDecorator
extends AbstractOOXMLExtractor


Constructor Summary
XSSFExcelExtractorDecorator(ParseContext context, org.apache.poi.xssf.extractor.XSSFExcelExtractor extractor, java.util.Locale locale)
           
 
Method Summary
protected  void buildXHTML(XHTMLContentHandler xhtml)
          Populates the XHTMLContentHandler object received as parameter.
protected  java.util.List<org.apache.poi.openxml4j.opc.PackagePart> getMainDocumentParts()
          In Excel files, sheets have things embedded in them, and sheet drawings which have the images
 MetadataExtractor getMetadataExtractor()
          POIXMLTextExtractor.getMetadataTextExtractor() not yet supported for OOXML by POI.
 
Methods inherited from class org.apache.tika.parser.microsoft.ooxml.AbstractOOXMLExtractor
getDocument, getXHTML, handleEmbedded
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

XSSFExcelExtractorDecorator

public XSSFExcelExtractorDecorator(ParseContext context,
                                   org.apache.poi.xssf.extractor.XSSFExcelExtractor extractor,
                                   java.util.Locale locale)
Method Detail

buildXHTML

protected void buildXHTML(XHTMLContentHandler xhtml)
                   throws org.xml.sax.SAXException,
                          org.apache.xmlbeans.XmlException,
                          java.io.IOException
Description copied from class: AbstractOOXMLExtractor
Populates the XHTMLContentHandler object received as parameter.

Specified by:
buildXHTML in class AbstractOOXMLExtractor
Throws:
org.xml.sax.SAXException
org.apache.xmlbeans.XmlException
java.io.IOException
See Also:
XSSFExcelExtractor.getText()

getMainDocumentParts

protected java.util.List<org.apache.poi.openxml4j.opc.PackagePart> getMainDocumentParts()
                                                                                 throws TikaException
In Excel files, sheets have things embedded in them, and sheet drawings which have the images

Specified by:
getMainDocumentParts in class AbstractOOXMLExtractor
Throws:
TikaException

getMetadataExtractor

public MetadataExtractor getMetadataExtractor()
Description copied from interface: OOXMLExtractor
POIXMLTextExtractor.getMetadataTextExtractor() not yet supported for OOXML by POI.

Specified by:
getMetadataExtractor in interface OOXMLExtractor
Overrides:
getMetadataExtractor in class AbstractOOXMLExtractor
See Also:
OOXMLExtractor.getMetadataExtractor()


Copyright © 2007-2011 The Apache Software Foundation. All Rights Reserved.