Package org.apache.tika.detect
@Version("1.0.0")
package org.apache.tika.detect
Media type detection.
-
ClassDescriptionAn input stream reader that automatically detects the character encoding to be used for converting bytes to characters.Content type detector that combines multiple different detection mechanisms.A composite detector based on all the
Detector
implementations available through theservice provider mechanism
.A composite encoding detector based on all theEncodingDetector
implementations available through theservice provider mechanism
.A version ofDefaultDetector
for probabilistic mime detectors, which use statistical techniques to blend the results of differing underlying detectors when attempting to detect the type of a given file.Content type detector.Dummy detector that returns application/octet-stream for all documents.Character encoding detector.This runs the linux 'file' command against a file.Content type detection based on magic bytes, i.e.Content type detection based on the resource name.Always returns the charset passed in via the initializerDeprecated.after 2.5.0 this functionality was moved to the CompositeDetectorContent type detection of plain text documents.Utility class for computing a histogram of the bytes seen in a stream.Content type detection based on a content type hint.Utility class that uses aSAXParser
to determine the namespace URI and local name of the root element of an XML file.Detector to identify zero length files as application/x-zerovalue