Package | Description |
---|---|
org.apache.tika |
Apache Tika.
|
org.apache.tika.config |
Tika configuration tools.
|
org.apache.tika.detect |
Media type detection.
|
org.apache.tika.example | |
org.apache.tika.extractor |
Extraction of component documents.
|
org.apache.tika.mime |
Media type information.
|
org.apache.tika.parser |
Tika parsers.
|
org.apache.tika.parser.apple | |
org.apache.tika.parser.microsoft | |
org.apache.tika.parser.pkg | |
org.apache.tika.server.resource |
Modifier and Type | Method and Description |
---|---|
Detector |
Tika.getDetector()
Returns the detector instance used by this facade.
|
Constructor and Description |
---|
Tika(Detector detector)
Creates a Tika facade using the given detector instance, the
default parser configuration, and the default Translator.
|
Tika(Detector detector,
Parser parser)
Creates a Tika facade using the given detector and parser instances, but the default Translator.
|
Tika(Detector detector,
Parser parser,
Translator translator)
Creates a Tika facade using the given detector, parser, and translator instances.
|
Modifier and Type | Method and Description |
---|---|
Detector |
TikaConfig.getDetector()
Returns the configured detector instance.
|
Modifier and Type | Class and Description |
---|---|
class |
CompositeDetector
Content type detector that combines multiple different detection mechanisms.
|
class |
DefaultDetector
A composite detector based on all the
Detector implementations
available through the service provider mechanism . |
class |
DefaultProbDetector
A version of
DefaultDetector for probabilistic mime
detectors, which use statistical techniques to blend the
results of differing underlying detectors when attempting
to detect the type of a given file. |
class |
EmptyDetector
Dummy detector that returns application/octet-stream for all documents.
|
class |
FileCommandDetector
This runs the linux 'file' command against a file.
|
class |
MagicDetector
Content type detection based on magic bytes, i.e.
|
class |
NameDetector
Content type detection based on the resource name.
|
class |
NNExampleModelDetector |
class |
OverrideDetector
Use this to force a content type detection via the
TikaCoreProperties.CONTENT_TYPE_OVERRIDE key in the metadata object. |
class |
TextDetector
Content type detection of plain text documents.
|
class |
TrainedModelDetector |
class |
TypeDetector
Content type detection based on a content type hint.
|
class |
ZeroSizeFileDetector
Detector to identify zero length files as application/x-zerovalue
|
Modifier and Type | Method and Description |
---|---|
List<Detector> |
DefaultProbDetector.getDetectors() |
List<Detector> |
DefaultDetector.getDetectors() |
List<Detector> |
CompositeDetector.getDetectors()
Returns the component detectors.
|
Constructor and Description |
---|
CompositeDetector(Detector... detectors) |
Constructor and Description |
---|
CompositeDetector(List<Detector> detectors) |
CompositeDetector(MediaTypeRegistry registry,
List<Detector> detectors) |
CompositeDetector(MediaTypeRegistry registry,
List<Detector> detectors,
Collection<Class<? extends Detector>> excludeDetectors) |
CompositeDetector(MediaTypeRegistry registry,
List<Detector> detectors,
Collection<Class<? extends Detector>> excludeDetectors) |
DefaultDetector(MimeTypes types,
ServiceLoader loader,
Collection<Class<? extends Detector>> excludeDetectors) |
Modifier and Type | Class and Description |
---|---|
class |
EncryptedPrescriptionDetector |
Constructor and Description |
---|
ImportContextImpl(javax.jcr.Item importRoot,
String systemId,
org.apache.jackrabbit.webdav.io.InputContext ctx,
InputStream stream,
org.apache.jackrabbit.server.io.IOListener ioListener,
Detector detector)
Creates a new item import context.
|
Modifier and Type | Method and Description |
---|---|
Detector |
EmbeddedDocumentUtil.getDetector() |
Constructor and Description |
---|
ParserContainerExtractor(Parser parser,
Detector detector) |
Modifier and Type | Class and Description |
---|---|
class |
MimeTypes
This class is a MimeType repository.
|
class |
ProbabilisticMimeDetectionSelector
Selector for combining different mime detection results
based on probability
|
Modifier and Type | Method and Description |
---|---|
Detector |
AutoDetectParser.getDetector()
Returns the type detector used by this parser to auto-detect the type
of a document.
|
Modifier and Type | Method and Description |
---|---|
void |
AutoDetectParser.setDetector(Detector detector)
Sets the type detector used by this parser to auto-detect the type
of a document.
|
Constructor and Description |
---|
AutoDetectParser(Detector detector) |
AutoDetectParser(Detector detector,
Parser... parsers) |
Modifier and Type | Class and Description |
---|---|
class |
BPListDetector
Detector for BPList with utility functions for PList.
|
Modifier and Type | Class and Description |
---|---|
class |
POIFSContainerDetector
A detector that works on a POIFS OLE2 document
to figure out exactly what the file is.
|
Modifier and Type | Class and Description |
---|---|
class |
StreamingZipContainerDetector |
class |
ZipContainerDetector
A detector that works on Zip documents and other archive and compression
formats to figure out exactly what the file is.
|
Modifier and Type | Method and Description |
---|---|
static Detector |
TikaResource.getDetector(Parser p) |
Modifier and Type | Method and Description |
---|---|
static void |
TikaResource.setDetector(Parser p,
Detector detector) |
Copyright © 2007–1969 The Apache Software Foundation. All rights reserved.