Interface | Description |
---|---|
OOXMLExtractor |
Interface implemented by all Tika OOXML extractors.
|
OOXMLWordAndPowerPointTextHandler.XWPFBodyContentsHandler |
Class | Description |
---|---|
AbstractOOXMLExtractor |
Base class for all Tika OOXML extractors.
|
MetadataExtractor |
OOXML metadata extractor.
|
OOXMLExtractorFactory |
Figures out the correct
OOXMLExtractor for the supplied document and
returns it. |
OOXMLParser |
Office Open XML (OOXML) parser.
|
OOXMLTikaBodyPartHandler | |
OOXMLWordAndPowerPointTextHandler |
This class is intended to handle anything that might contain IBodyElements:
main document, headers, footers, notes, slides, etc.
|
ParagraphProperties | |
POIXMLTextExtractorDecorator | |
RunProperties |
WARNING: This class is mutable.
|
SXSLFPowerPointExtractorDecorator |
SAX/Streaming pptx extractior
|
SXWPFWordExtractorDecorator |
This is an experimental, alternative extractor for docx files.
|
XSLFPowerPointExtractorDecorator | |
XSSFBExcelExtractorDecorator | |
XSSFExcelExtractorDecorator | |
XSSFExcelExtractorDecorator.HeaderFooterFromString | |
XSSFExcelExtractorDecorator.SheetTextAsHTML |
Turns formatted sheet events into HTML
|
XSSFExcelExtractorDecorator.XSSFSheetInterestingPartsCapturer |
Captures information on interesting tags, whilst
delegating the main work to the formatting handler
|
XWPFListManager | |
XWPFWordExtractorDecorator |
Enum | Description |
---|---|
OOXMLWordAndPowerPointTextHandler.EditType |
Copyright © 2007–2020 The Apache Software Foundation. All rights reserved.