Package org.apache.tika.language.detect
Class LanguageHandler
java.lang.Object
org.xml.sax.helpers.DefaultHandler
org.apache.tika.sax.ContentHandlerDecorator
org.apache.tika.sax.WriteOutContentHandler
org.apache.tika.language.detect.LanguageHandler
- All Implemented Interfaces:
ContentHandler
,DTDHandler
,EntityResolver
,ErrorHandler
SAX content handler that updates a language detector based on all the
received character content.
- Since:
- Apache Tika 0.10
-
Constructor Summary
ConstructorDescriptionLanguageHandler
(LanguageDetector detector) LanguageHandler
(LanguageWriter writer) -
Method Summary
Modifier and TypeMethodDescriptionReturns the language detector used by this content handler.Returns the detected language based on text handled thus far.Methods inherited from class org.apache.tika.sax.WriteOutContentHandler
characters, ignorableWhitespace
Methods inherited from class org.apache.tika.sax.ContentHandlerDecorator
endDocument, endElement, endPrefixMapping, error, fatalError, handleException, processingInstruction, setContentHandler, setDocumentLocator, skippedEntity, startDocument, startElement, startPrefixMapping, toString, warning
Methods inherited from class org.xml.sax.helpers.DefaultHandler
notationDecl, resolveEntity, unparsedEntityDecl
-
Constructor Details
-
LanguageHandler
- Throws:
IOException
-
LanguageHandler
-
LanguageHandler
-
-
Method Details
-
getDetector
Returns the language detector used by this content handler. Note that the returned detector gets updated whenever new SAX events are received by this content handler.- Returns:
- language detector
-
getLanguage
Returns the detected language based on text handled thus far.- Returns:
- LanguageResult
-