public class URLEmailNormalizingFilterFactory
extends org.apache.lucene.analysis.util.TokenFilterFactory
UAX29URLEmailTokenizer
is used! This must be run _before_ the
AlphaIdeographFilterFactory
, or else the emails/urls will already
be removed!Constructor and Description |
---|
URLEmailNormalizingFilterFactory(Map<String,String> args) |
Modifier and Type | Method and Description |
---|---|
org.apache.lucene.analysis.TokenStream |
create(org.apache.lucene.analysis.TokenStream tokenStream) |
availableTokenFilters, findSPIName, forName, lookupClass, normalize, reloadTokenFilters
get, get, get, get, get, getBoolean, getChar, getClassArg, getFloat, getInt, getLines, getLuceneMatchVersion, getOriginalArgs, getPattern, getSet, getSnowballWordSet, getWordSet, isExplicitLuceneMatchVersion, require, require, require, requireBoolean, requireChar, requireFloat, requireInt, setExplicitLuceneMatchVersion, splitAt, splitFileNames
public static final String URL
public static final String EMAIL
Copyright © 2007–2022 The Apache Software Foundation. All rights reserved.