Package org.apache.tika.parser.microsoft
Class OutlookExtractor
java.lang.Object
org.apache.tika.parser.microsoft.OutlookExtractor
Outlook Message Parser.
-
Nested Class Summary
Nested Classes -
Field Summary
FieldsModifier and TypeFieldDescriptionprotected final ParseContextprotected final OfficeParserConfigprotected final Metadata -
Constructor Summary
ConstructorsConstructorDescriptionOutlookExtractor(org.apache.poi.poifs.filesystem.DirectoryNode root, Metadata metadata, ParseContext context) OutlookExtractor(org.apache.poi.poifs.filesystem.DirectoryNode root, ParseContext context) Deprecated.OutlookExtractor(org.apache.poi.poifs.filesystem.POIFSFileSystem filesystem, ParseContext context) Deprecated.useOutlookExtractor(DirectoryNode, Metadata, ParseContext)Will be removed after 2.4.0 -
Method Summary
Modifier and TypeMethodDescriptionstatic voidaddEvenIfNull(Property property, String value, Metadata metadata) protected Detectorstatic StringgetMessageClass(String messageClass) protected MimeTypesDeprecated.protected StringReturns the password to be used for this file, or null if no / default password should be usedprotected TikaConfigprotected voidhandleEmbeddedOfficeDoc(org.apache.poi.poifs.filesystem.DirectoryEntry dir, String resourceName, XHTMLContentHandler xhtml, boolean outputHtml) Handle an office document that's embedded at the POIFS levelprotected voidhandleEmbeddedOfficeDoc(org.apache.poi.poifs.filesystem.DirectoryEntry dir, XHTMLContentHandler xhtml, boolean outputHtml) Handle an office document that's embedded at the POIFS levelprotected voidhandleEmbeddedResource(TikaInputStream resource, String filename, String relationshipID, String mediaType, XHTMLContentHandler xhtml, boolean outputHtml) protected voidhandleEmbeddedResource(TikaInputStream resource, String filename, String relationshipID, org.apache.poi.hpsf.ClassID storageClassID, String mediaType, XHTMLContentHandler xhtml, boolean outputHtml) protected voidhandleEmbeddedResource(TikaInputStream resource, Metadata embeddedMetadata, String filename, String relationshipID, org.apache.poi.hpsf.ClassID storageClassID, String mediaType, XHTMLContentHandler xhtml, boolean outputHtml) voidparse(XHTMLContentHandler xhtml) voidparse(XHTMLContentHandler xhtml, Metadata metadata) Deprecated.use {@link #parse(XHTMLContentHandler), will be removed after 2.4.0}
-
Field Details
-
parentMetadata
-
officeParserConfig
-
context
-
-
Constructor Details
-
OutlookExtractor
@Deprecated public OutlookExtractor(org.apache.poi.poifs.filesystem.POIFSFileSystem filesystem, ParseContext context) throws TikaException Deprecated.useOutlookExtractor(DirectoryNode, Metadata, ParseContext)Will be removed after 2.4.0- Parameters:
filesystem-context-- Throws:
TikaException
-
OutlookExtractor
@Deprecated public OutlookExtractor(org.apache.poi.poifs.filesystem.DirectoryNode root, ParseContext context) throws TikaException Deprecated.useOutlookExtractor(DirectoryNode, Metadata, ParseContext)Will be removed after 2.4.0- Parameters:
root-context-- Throws:
TikaException
-
OutlookExtractor
public OutlookExtractor(org.apache.poi.poifs.filesystem.DirectoryNode root, Metadata metadata, ParseContext context) throws TikaException - Throws:
TikaException
-
-
Method Details
-
addEvenIfNull
-
getMessageClass
-
parse
- Throws:
TikaExceptionSAXExceptionIOException
-
parse
@Deprecated public void parse(XHTMLContentHandler xhtml, Metadata metadata) throws TikaException, SAXException, IOException Deprecated.use {@link #parse(XHTMLContentHandler), will be removed after 2.4.0}- Parameters:
xhtml-metadata-- Throws:
TikaExceptionSAXExceptionIOException
-
getTikaConfig
-
getDetector
-
getMimeTypes
Deprecated.- Returns:
- mimetypes
-
getPassword
Returns the password to be used for this file, or null if no / default password should be used -
handleEmbeddedResource
protected void handleEmbeddedResource(TikaInputStream resource, String filename, String relationshipID, String mediaType, XHTMLContentHandler xhtml, boolean outputHtml) throws IOException, SAXException, TikaException - Throws:
IOExceptionSAXExceptionTikaException
-
handleEmbeddedResource
protected void handleEmbeddedResource(TikaInputStream resource, String filename, String relationshipID, org.apache.poi.hpsf.ClassID storageClassID, String mediaType, XHTMLContentHandler xhtml, boolean outputHtml) throws IOException, SAXException, TikaException - Throws:
IOExceptionSAXExceptionTikaException
-
handleEmbeddedResource
protected void handleEmbeddedResource(TikaInputStream resource, Metadata embeddedMetadata, String filename, String relationshipID, org.apache.poi.hpsf.ClassID storageClassID, String mediaType, XHTMLContentHandler xhtml, boolean outputHtml) throws IOException, SAXException, TikaException - Throws:
IOExceptionSAXExceptionTikaException
-
handleEmbeddedOfficeDoc
protected void handleEmbeddedOfficeDoc(org.apache.poi.poifs.filesystem.DirectoryEntry dir, XHTMLContentHandler xhtml, boolean outputHtml) throws IOException, SAXException, TikaException Handle an office document that's embedded at the POIFS level- Throws:
IOExceptionSAXExceptionTikaException
-
handleEmbeddedOfficeDoc
protected void handleEmbeddedOfficeDoc(org.apache.poi.poifs.filesystem.DirectoryEntry dir, String resourceName, XHTMLContentHandler xhtml, boolean outputHtml) throws IOException, SAXException, TikaException Handle an office document that's embedded at the POIFS level- Throws:
IOExceptionSAXExceptionTikaException
-
OutlookExtractor(DirectoryNode, Metadata, ParseContext)Will be removed after 2.4.0