Class AbstractOfficeParser

java.lang.Object
org.apache.tika.parser.microsoft.AbstractOfficeParser
All Implemented Interfaces:
Serializable, SelfConfiguring, Parser
Direct Known Subclasses:
OfficeParser, OOXMLParser, Word2006MLParser

public abstract class AbstractOfficeParser extends Object implements Parser
Intermediate layer to set OfficeParserConfig uniformly.
See Also:
  • Constructor Details

    • AbstractOfficeParser

      public AbstractOfficeParser()
  • Method Details

    • configure

      public void configure(ParseContext parseContext)
      Checks to see if the user has specified an OfficeParserConfig. If so, no changes are made; if not, one is added to the context.
      Parameters:
      parseContext -
    • setDefaultOfficeParserConfig

      protected void setDefaultOfficeParserConfig(OfficeParserConfig config)
      Allows subclasses to set the default configuration during construction.
      Parameters:
      config - the configuration to use as default
    • getDefaultConfig

      public OfficeParserConfig getDefaultConfig()
    • setByteArrayMaxOverride

      public void setByteArrayMaxOverride(int maxOverride)
      WARNING: this sets a static variable in POI. This allows users to override POI's protection of the allocation of overly large byte arrays. Use carefully; and please open up issues on POI's bugzilla to bump values for specific records. If the value is <&eq; 0, this value is ignored
      Parameters:
      maxOverride -
    • getByteArrayMaxOverride

      public int getByteArrayMaxOverride()