Package org.apache.tika.eval.core.tokens
Class CommonTokenCountManager
- java.lang.Object
-
- org.apache.tika.eval.core.tokens.CommonTokenCountManager
-
public class CommonTokenCountManager extends Object
-
-
Constructor Summary
Constructors Constructor Description CommonTokenCountManager()CommonTokenCountManager(Path commonTokensDir, String defaultLangCode)
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description voidclose()Set<String>getLangs()org.apache.commons.lang3.tuple.Pair<String,LangModel>getLangTokens(String lang)Set<String>getTokens(String lang)
-
-
-
Method Detail
-
getLangTokens
public org.apache.commons.lang3.tuple.Pair<String,LangModel> getLangTokens(String lang)
- Parameters:
lang-- Returns:
- pair of actual language code used and a set of common tokens for that language
-
close
public void close() throws IOException- Throws:
IOException
-
-