Class OfficeParser

    • Constructor Detail

      • OfficeParser

        public OfficeParser()
    • Method Detail

      • getSupportedTypes

        public Set<MediaType> getSupportedTypes​(ParseContext context)
        Description copied from interface: Parser
        Returns the set of media types supported by this parser when used with the given parse context.
        Parameters:
        context - parse context
        Returns:
        immutable set of media types
      • extractMacros

        public static void extractMacros​(org.apache.poi.poifs.filesystem.POIFSFileSystem fs,
                                         ContentHandler xhtml,
                                         EmbeddedDocumentExtractor embeddedDocumentExtractor)
                                  throws IOException,
                                         SAXException
        Helper to extract macros from an NPOIFS/vbaProject.bin As of POI-3.15-final, there are still some bugs in VBAMacroReader. For now, we are swallowing NPE and other runtime exceptions
        Parameters:
        fs - NPOIFS to extract from
        xhtml - SAX writer
        embeddedDocumentExtractor - extractor for embedded documents
        Throws:
        IOException - on IOException if it occurs during the extraction of the embedded doc
        SAXException - on SAXException for writing to xhtml