Interface OOXMLExtractor
- All Known Implementing Classes:
AbstractOOXMLExtractor,SXSLFPowerPointExtractorDecorator,SXWPFWordExtractorDecorator,VSDXExtractorDecorator,XPSExtractorDecorator,XSSFBExcelExtractorDecorator,XSSFExcelExtractorDecorator
public interface OOXMLExtractor
Interface implemented by all Tika OOXML extractors.
-
Method Summary
Modifier and TypeMethodDescriptionvoidgetXHTML(ContentHandler handler, Metadata metadata, ParseContext context) Parses the document into a sequence of XHTML SAX events sent to the given content handler.
-
Method Details
-
getMetadataExtractor
MetadataExtractor getMetadataExtractor() -
getXHTML
void getXHTML(ContentHandler handler, Metadata metadata, ParseContext context) throws SAXException, IOException, TikaException Parses the document into a sequence of XHTML SAX events sent to the given content handler.- Throws:
SAXExceptionIOExceptionTikaException
-