public interface EncodingDetector extends Serializable
Charset detect(InputStream input, Metadata metadata) throws IOException
nullif the encoding of the document can not be detected.
If the document input stream is not available, then the first
argument may be
null. Otherwise the detector may
read bytes from the start of the stream to help in encoding detection.
The given stream is guaranteed to support the
mark feature and the detector
is expected to
mark the stream before
reading any bytes from it, and to
the stream before returning. The stream must not be closed by the
The given input metadata is only read, not modified, by the detector.
input- text document input stream, or
metadata- input metadata for the document
IOException- if the document input stream could not be read
Copyright © 2007–2019 The Apache Software Foundation. All rights reserved.