Class TypeDetector

  • All Implemented Interfaces:
    Serializable, Detector

    public class TypeDetector
    extends Object
    implements Detector
    Content type detection based on a content type hint. This detector simply trusts any valid content type hint given in the input metadata, and returns that as the likely type of the input document.
    Apache Tika 0.3
    See Also:
    Serialized Form
    • Constructor Detail

      • TypeDetector

        public TypeDetector()
    • Method Detail

      • detect

        public MediaType detect​(InputStream input,
                                Metadata metadata)
        Detects the content type of an input document based on a type hint given in the input metadata. The CONTENT_TYPE attribute of the given input metadata is expected to contain the type of the input document. If that attribute exists and contains a valid type name, then that type is returned.
        Specified by:
        detect in interface Detector
        input - ignored
        metadata - input metadata, possibly with a CONTENT_TYPE value
        detected media type, or application/octet-stream