Class BatchTopCommonTokenCounter

java.lang.Object
org.apache.tika.eval.app.tools.BatchTopCommonTokenCounter

public class BatchTopCommonTokenCounter extends Object
Utility class that runs TopCommonTokenCounter against a directory of table files (named {lang}_table.gz or leipzip-like afr_...-sentences.txt) and outputs common tokens files for each input table file in the output directory.
  • Constructor Details

    • BatchTopCommonTokenCounter

      public BatchTopCommonTokenCounter()
  • Method Details