Package org.apache.tika.eval.app.tools
Class BatchTopCommonTokenCounter
java.lang.Object
org.apache.tika.eval.app.tools.BatchTopCommonTokenCounter
Utility class that runs TopCommonTokenCounter against a directory
of table files (named {lang}_table.gz or leipzip-like afr_...-sentences.txt)
and outputs common tokens files for each input table file in the output directory.
-
Constructor Summary
-
Method Summary
-
Constructor Details
-
BatchTopCommonTokenCounter
public BatchTopCommonTokenCounter()
-
-
Method Details
-
main
- Throws:
Exception
-