Package org.apache.tika.parser.microsoft
package org.apache.tika.parser.microsoft
-
ClassDescriptionIntermediate layer to set
OfficeParserConfig
uniformly.Cell of content.Cell decorator.Extracts files embedded in EMF and offers a very rough capability to extract text if there is text stored in the EMF.Excel parser implementation which uses POI's Event API to handle the contents of a Workbook.Parser that handles Microsoft Access files via JackcessLinked cell.Computes the number text which goes at the beginning of each list paragraphParser for temporary MSOFfice files.Number cell.Defines a Microsoft document content extractor.A POI-powered Tika Parser for very old versions of Excel, from pre-OLE2 days, such as Excel 4.Outlook Message Parser.Extractor for Common OLE2 (HPSF) metadataText cell.Overrides Excel's General format to include more significant digits than the MS Spec allows.A Format that allows up to 15 significant digits for integers.A POI-powered Tika Parser for TNEF (Transport Neutral Encoding Format) messages, aka winmail.datThis parser offers a very rough capability to extract text if there is text stored in the WMF files.