Package org.apache.tika.config
Class TikaConfig
- java.lang.Object
- 
- org.apache.tika.config.TikaConfig
 
- 
 public class TikaConfig extends Object Parse xml config file.
- 
- 
Field SummaryFields Modifier and Type Field Description static intDEFAULT_MAX_JSON_STRING_FIELD_LENGTHstatic StringMAX_JSON_STRING_FIELD_LENGTH_ELEMENT_NAMEprotected static AtomicIntegerTIMES_INSTANTIATED
 - 
Constructor SummaryConstructors Constructor Description TikaConfig()Creates a default Tika configuration.TikaConfig(File file)TikaConfig(File file, ServiceLoader loader)TikaConfig(InputStream stream)TikaConfig(ClassLoader loader)Creates a Tika configuration from the built-in media type rules and all theParserimplementations available through theservice provider mechanismin the given class loader.TikaConfig(String file)TikaConfig(URL url)TikaConfig(URL url, ClassLoader loader)TikaConfig(URL url, ServiceLoader loader)TikaConfig(Path path)TikaConfig(Path path, ServiceLoader loader)TikaConfig(Document document)TikaConfig(Document document, ServiceLoader loader)TikaConfig(Element element)TikaConfig(Element element, ClassLoader loader)
 - 
Method Summary
 
- 
- 
- 
Field Detail- 
DEFAULT_MAX_JSON_STRING_FIELD_LENGTHpublic static int DEFAULT_MAX_JSON_STRING_FIELD_LENGTH 
 - 
MAX_JSON_STRING_FIELD_LENGTH_ELEMENT_NAMEpublic static String MAX_JSON_STRING_FIELD_LENGTH_ELEMENT_NAME 
 - 
TIMES_INSTANTIATEDprotected static final AtomicInteger TIMES_INSTANTIATED 
 
- 
 - 
Constructor Detail- 
TikaConfigpublic TikaConfig(String file) throws TikaException, IOException, SAXException - Throws:
- TikaException
- IOException
- SAXException
 
 - 
TikaConfigpublic TikaConfig(Path path) throws TikaException, IOException, SAXException - Throws:
- TikaException
- IOException
- SAXException
 
 - 
TikaConfigpublic TikaConfig(Path path, ServiceLoader loader) throws TikaException, IOException, SAXException - Throws:
- TikaException
- IOException
- SAXException
 
 - 
TikaConfigpublic TikaConfig(File file) throws TikaException, IOException, SAXException - Throws:
- TikaException
- IOException
- SAXException
 
 - 
TikaConfigpublic TikaConfig(File file, ServiceLoader loader) throws TikaException, IOException, SAXException - Throws:
- TikaException
- IOException
- SAXException
 
 - 
TikaConfigpublic TikaConfig(URL url) throws TikaException, IOException, SAXException - Throws:
- TikaException
- IOException
- SAXException
 
 - 
TikaConfigpublic TikaConfig(URL url, ClassLoader loader) throws TikaException, IOException, SAXException - Throws:
- TikaException
- IOException
- SAXException
 
 - 
TikaConfigpublic TikaConfig(URL url, ServiceLoader loader) throws TikaException, IOException, SAXException - Throws:
- TikaException
- IOException
- SAXException
 
 - 
TikaConfigpublic TikaConfig(InputStream stream) throws TikaException, IOException, SAXException - Throws:
- TikaException
- IOException
- SAXException
 
 - 
TikaConfigpublic TikaConfig(Document document) throws TikaException, IOException - Throws:
- TikaException
- IOException
 
 - 
TikaConfigpublic TikaConfig(Document document, ServiceLoader loader) throws TikaException, IOException - Throws:
- TikaException
- IOException
 
 - 
TikaConfigpublic TikaConfig(Element element) throws TikaException, IOException - Throws:
- TikaException
- IOException
 
 - 
TikaConfigpublic TikaConfig(Element element, ClassLoader loader) throws TikaException, IOException - Throws:
- TikaException
- IOException
 
 - 
TikaConfigpublic TikaConfig(ClassLoader loader) throws MimeTypeException, IOException Creates a Tika configuration from the built-in media type rules and all theParserimplementations available through theservice provider mechanismin the given class loader.- Parameters:
- loader- the class loader through which parser implementations are loaded, or- nullfor no parsers
- Throws:
- MimeTypeException- if the built-in media type rules are broken
- IOException- if the built-in media type rules can not be read
- Since:
- Apache Tika 0.8
 
 - 
TikaConfigpublic TikaConfig() throws TikaException, IOExceptionCreates a default Tika configuration. First checks whether an XML config file is specified, either in- System property "tika.config", or
- Environment variable TIKA_CONFIG
 If one of these have a value, try to resolve it relative to file system or classpath. If XML config is not specified, initialize from the built-in media type rules and all the Parserimplementations available through theservice provider mechanismin the context class loader of the current thread.- Throws:
- IOException- if the configuration can not be read
- TikaException- if problem with MimeTypes or parsing XML config
 
 
- 
 - 
Method Detail- 
getMaxJsonStringFieldLengthpublic static int getMaxJsonStringFieldLength() - Returns:
- maximum field length when serializing String fields in Tika's metadata or metadata list into JSON
 
 - 
getDefaultDetectorprotected static CompositeDetector getDefaultDetector(MimeTypes types, ServiceLoader loader) 
 - 
getDefaultEncodingDetectorprotected static CompositeEncodingDetector getDefaultEncodingDetector(ServiceLoader loader) 
 - 
getDefaultRendererprotected static CompositeRenderer getDefaultRenderer(ServiceLoader loader) 
 - 
getDefaultConfigpublic static TikaConfig getDefaultConfig() Provides a default configuration (TikaConfig). Currently creates a new instance each time it's called; we may be able to have it return a shared instance once it is completely immutable.- Returns:
- default configuration
 
 - 
mustNotBeEmptypublic static void mustNotBeEmpty(String paramName, String paramValue) throws TikaConfigException - Throws:
- TikaConfigException
 
 - 
mustNotBeEmptypublic static void mustNotBeEmpty(String paramName, Path paramValue) throws TikaConfigException - Throws:
- TikaConfigException
 
 - 
getParserpublic Parser getParser() Returns the configured parser instance.- Returns:
- configured parser
 
 - 
getDetectorpublic Detector getDetector() Returns the configured detector instance.- Returns:
- configured detector
 
 - 
getEncodingDetectorpublic EncodingDetector getEncodingDetector() Returns the configured encoding detector instance- Returns:
- configured encoding detector
 
 - 
getTranslatorpublic Translator getTranslator() Returns the configured translator instance.- Returns:
- configured translator
 
 - 
getExecutorServicepublic ExecutorService getExecutorService() 
 - 
getMimeRepositorypublic MimeTypes getMimeRepository() 
 - 
getMediaTypeRegistrypublic MediaTypeRegistry getMediaTypeRegistry() 
 - 
getServiceLoaderpublic ServiceLoader getServiceLoader() 
 - 
getMetadataFilterpublic MetadataFilter getMetadataFilter() 
 - 
getAutoDetectParserConfigpublic AutoDetectParserConfig getAutoDetectParserConfig() 
 
- 
 
-