Package org.apache.tika.example
Class TextStatsFromTikaEval
- java.lang.Object
- 
- org.apache.tika.example.TextStatsFromTikaEval
 
- 
 public class TextStatsFromTikaEval extends Object These examples create a newCompositeTextStatsCalculatorfor each call. This is extremely inefficient because the lang id model has to be loaded and the common words for each call.
- 
- 
Constructor SummaryConstructors Constructor Description TextStatsFromTikaEval()
 - 
Method SummaryAll Methods Instance Methods Concrete Methods Modifier and Type Method Description doublegetOOV(String txt)Use the default language id models and the default common tokens lists in tika-eval to calculate the out-of-vocabulary percentage for a given string.
 
- 
- 
- 
Method Detail- 
getOOVpublic double getOOV(String txt) Use the default language id models and the default common tokens lists in tika-eval to calculate the out-of-vocabulary percentage for a given string.- Parameters:
- txt-
- Returns:
 
 
- 
 
-