Package org.apache.tika.parser.microsoft
Class OldExcelParser
java.lang.Object
org.apache.tika.parser.microsoft.OldExcelParser
- All Implemented Interfaces:
Serializable,SelfConfiguring,Parser
A POI-powered Tika Parser for very old versions of Excel, from
pre-OLE2 days, such as Excel 4.
- See Also:
-
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptiongetSupportedTypes(ParseContext context) Returns the set of media types supported by this parser when used with the given parse context.protected static voidparse(org.apache.poi.hssf.extractor.OldExcelExtractor extractor, XHTMLContentHandler xhtml) voidparse(TikaInputStream tis, ContentHandler handler, Metadata metadata, ParseContext context) Extracts properties and text from an MS Document input stream
-
Constructor Details
-
OldExcelParser
public OldExcelParser()
-
-
Method Details
-
parse
protected static void parse(org.apache.poi.hssf.extractor.OldExcelExtractor extractor, XHTMLContentHandler xhtml) throws TikaException, IOException, SAXException - Throws:
TikaExceptionIOExceptionSAXException
-
getSupportedTypes
Description copied from interface:ParserReturns the set of media types supported by this parser when used with the given parse context.- Specified by:
getSupportedTypesin interfaceParser- Parameters:
context- parse context- Returns:
- immutable set of media types
-
parse
public void parse(TikaInputStream tis, ContentHandler handler, Metadata metadata, ParseContext context) throws IOException, SAXException, TikaException Extracts properties and text from an MS Document input stream- Specified by:
parsein interfaceParserhandler- handler for the XHTML SAX events (output)metadata- document metadata (input and output)context- parse context- Throws:
IOException- if the document stream could not be readSAXException- if the SAX events could not be processedTikaException- if the document could not be parsed
-