org.apache.tika.parser.microsoft.ooxml
Class XSLFPowerPointExtractorDecorator

java.lang.Object
  extended by org.apache.tika.parser.microsoft.ooxml.AbstractOOXMLExtractor
      extended by org.apache.tika.parser.microsoft.ooxml.XSLFPowerPointExtractorDecorator
All Implemented Interfaces:
OOXMLExtractor

public class XSLFPowerPointExtractorDecorator
extends AbstractOOXMLExtractor


Field Summary
 
Fields inherited from class org.apache.tika.parser.microsoft.ooxml.AbstractOOXMLExtractor
extractor
 
Constructor Summary
XSLFPowerPointExtractorDecorator(org.apache.poi.xslf.extractor.XSLFPowerPointExtractor extractor)
           
 
Method Summary
protected  void buildXHTML(XHTMLContentHandler xhtml)
          Populates the XHTMLContentHandler object received as parameter.
protected  java.util.List<org.apache.poi.openxml4j.opc.PackagePart> getMainDocumentParts()
          In PowerPoint files, slides have things embedded in them, and slide drawings which have the images
 
Methods inherited from class org.apache.tika.parser.microsoft.ooxml.AbstractOOXMLExtractor
getDocument, getMetadataExtractor, getXHTML, handleEmbedded
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

XSLFPowerPointExtractorDecorator

public XSLFPowerPointExtractorDecorator(org.apache.poi.xslf.extractor.XSLFPowerPointExtractor extractor)
Method Detail

buildXHTML

protected void buildXHTML(XHTMLContentHandler xhtml)
                   throws org.xml.sax.SAXException,
                          org.apache.xmlbeans.XmlException,
                          java.io.IOException
Description copied from class: AbstractOOXMLExtractor
Populates the XHTMLContentHandler object received as parameter.

Specified by:
buildXHTML in class AbstractOOXMLExtractor
Throws:
org.xml.sax.SAXException
org.apache.xmlbeans.XmlException
java.io.IOException
See Also:
XSLFPowerPointExtractor.getText()

getMainDocumentParts

protected java.util.List<org.apache.poi.openxml4j.opc.PackagePart> getMainDocumentParts()
                                                                                 throws TikaException
In PowerPoint files, slides have things embedded in them, and slide drawings which have the images

Specified by:
getMainDocumentParts in class AbstractOOXMLExtractor
Throws:
TikaException


Copyright © 2007-2010 The Apache Software Foundation. All Rights Reserved.