Class AnalyzerManager


  • public class AnalyzerManager
    extends Object
    • Method Detail

      • newInstance

        public static AnalyzerManager newInstance​(int maxTokens)
      • getGeneralAnalyzer

        public org.apache.lucene.analysis.Analyzer getGeneralAnalyzer()
        This analyzer should be used to extract all tokens.
        Returns:
      • getCommonTokensAnalyzer

        public org.apache.lucene.analysis.Analyzer getCommonTokensAnalyzer()
        This analyzer should be used to generate common tokens lists from large corpora. It is not used by tika-eval in profiling or comparing.
        Returns: