Uses of Interface
org.apache.tika.detect.EncodingDetector
Packages that use EncodingDetector
Package
Description
Media type detection.
Tika parsers.
-
Uses of EncodingDetector in org.apache.tika.config.loader
Methods in org.apache.tika.config.loader that return EncodingDetectorModifier and TypeMethodDescriptionprotected EncodingDetectorEncodingDetectorLoader.createDefaultComposite(Set<Class<? extends EncodingDetector>> exclusions, LoaderContext context) LoaderContext.getEncodingDetector()Get the EncodingDetector for injection into parsers.protected EncodingDetectorEncodingDetectorLoader.loadComponent(String name, com.fasterxml.jackson.databind.JsonNode configNode, LoaderContext context) TikaLoader.loadEncodingDetectors()Loads and returns all encoding detectors.protected EncodingDetectorEncodingDetectorLoader.wrapInComposite(List<EncodingDetector> detectors, LoaderContext context) Method parameters in org.apache.tika.config.loader with type arguments of type EncodingDetectorModifier and TypeMethodDescriptionprotected EncodingDetectorEncodingDetectorLoader.createDefaultComposite(Set<Class<? extends EncodingDetector>> exclusions, LoaderContext context) protected EncodingDetectorEncodingDetectorLoader.wrapInComposite(List<EncodingDetector> detectors, LoaderContext context) -
Uses of EncodingDetector in org.apache.tika.detect
Subinterfaces of EncodingDetector in org.apache.tika.detectModifier and TypeInterfaceDescriptioninterfaceMarker interface for encoding detectors that arbitrate among candidates collected by base detectors rather than detecting encoding directly from the stream.Classes in org.apache.tika.detect that implement EncodingDetectorModifier and TypeClassDescriptionclassEncoding detector that identifies the character set from a byte-order mark (BOM) at the start of the stream.classA composite encoding detector that runs child detectors.classA composite encoding detector based on all theEncodingDetectorimplementations available through theservice provider mechanism.classEncoding detector that extracts a declared charset from Tika metadata without reading any bytes from the stream.classAlways returns the charset passed in via the initializerMethods in org.apache.tika.detect that return types with arguments of type EncodingDetectorConstructors in org.apache.tika.detect with parameters of type EncodingDetectorModifierConstructorDescriptionAutoDetectReader(InputStream stream, Metadata metadata, EncodingDetector encodingDetector) Constructor parameters in org.apache.tika.detect with type arguments of type EncodingDetectorModifierConstructorDescriptionCompositeEncodingDetector(List<EncodingDetector> detectors) CompositeEncodingDetector(List<EncodingDetector> detectors, Collection<Class<? extends EncodingDetector>> excludeEncodingDetectors) CompositeEncodingDetector(List<EncodingDetector> detectors, Collection<Class<? extends EncodingDetector>> excludeEncodingDetectors) DefaultEncodingDetector(ServiceLoader loader, Collection<Class<? extends EncodingDetector>> excludeEncodingDetectors) -
Uses of EncodingDetector in org.apache.tika.ml.chardetect
Classes in org.apache.tika.ml.chardetect that implement EncodingDetectorModifier and TypeClassDescriptionclassNaive-Bayes pipeline detector: structural checks for wide Unicode + BOMs before falling through to the bigram NB classifier for everything else.classNaive-Bayes byte-bigram charset classifier.classUTF-16 specialist detector of the mixture-of-experts charset detection architecture. -
Uses of EncodingDetector in org.apache.tika.ml.junkdetect
Classes in org.apache.tika.ml.junkdetect that implement EncodingDetectorModifier and TypeClassDescriptionclassAMetaEncodingDetectorthat arbitrates charset candidates by asking aTextQualityDetectorwhich decoded candidate looks most like natural text. -
Uses of EncodingDetector in org.apache.tika.parser
Methods in org.apache.tika.parser that return EncodingDetectorModifier and TypeMethodDescriptionAbstractEncodingDetectorParser.getEncodingDetector()protected EncodingDetectorAbstractEncodingDetectorParser.getEncodingDetector(ParseContext parseContext) Look for an EncodingDetetor in the ParseContext.Methods in org.apache.tika.parser with parameters of type EncodingDetectorModifier and TypeMethodDescriptionvoidAbstractEncodingDetectorParser.setEncodingDetector(EncodingDetector encodingDetector) Constructors in org.apache.tika.parser with parameters of type EncodingDetectorModifierConstructorDescriptionAbstractEncodingDetectorParser(EncodingDetector encodingDetector) DefaultParser(MediaTypeRegistry registry, ServiceLoader loader, Collection<Class<? extends Parser>> excludeParsers, EncodingDetector encodingDetector, Renderer renderer) DefaultParser(MediaTypeRegistry registry, ServiceLoader loader, EncodingDetector encodingDetector, Renderer renderer) -
Uses of EncodingDetector in org.apache.tika.parser.code
Constructors in org.apache.tika.parser.code with parameters of type EncodingDetector -
Uses of EncodingDetector in org.apache.tika.parser.csv
Constructors in org.apache.tika.parser.csv with parameters of type EncodingDetector -
Uses of EncodingDetector in org.apache.tika.parser.envi
Constructors in org.apache.tika.parser.envi with parameters of type EncodingDetector -
Uses of EncodingDetector in org.apache.tika.parser.html
Classes in org.apache.tika.parser.html that implement EncodingDetectorModifier and TypeClassDescriptionclassCharacter encoding detector for determining the character encoding of a HTML document based on the potential charset parameter found in a Content-Type http-equiv meta tag somewhere near the beginning.Methods in org.apache.tika.parser.html that return EncodingDetectorModifier and TypeMethodDescriptionprotected EncodingDetectorJSoupParser.getEncodingDetector(ParseContext parseContext) Look for an EncodingDetetor in the ParseContext.Constructors in org.apache.tika.parser.html with parameters of type EncodingDetector -
Uses of EncodingDetector in org.apache.tika.parser.html.charsetdetector
Classes in org.apache.tika.parser.html.charsetdetector that implement EncodingDetectorModifier and TypeClassDescriptionfinal classFull WHATWG prescan charset detector for HTML: HTTP Content-Type header →<meta charset>/<meta http-equiv>tag, per https://html.spec.whatwg.org/multipage/parsing.html#the-input-byte-stream. -
Uses of EncodingDetector in org.apache.tika.parser.mif
Constructors in org.apache.tika.parser.mif with parameters of type EncodingDetector -
Uses of EncodingDetector in org.apache.tika.parser.pkg
Constructors in org.apache.tika.parser.pkg with parameters of type EncodingDetectorModifierConstructorDescriptionAbstractArchiveParser(EncodingDetector encodingDetector) ZipParser(EncodingDetector encodingDetector) -
Uses of EncodingDetector in org.apache.tika.parser.txt
Classes in org.apache.tika.parser.txt that implement EncodingDetectorConstructors in org.apache.tika.parser.txt with parameters of type EncodingDetector