Package org.apache.tika.eval.core.tokens
Enum Class TikaEvalTokenizer.Mode
- All Implemented Interfaces:
Serializable,Comparable<TikaEvalTokenizer.Mode>,Constable
- Enclosing class:
- TikaEvalTokenizer
Tokenization mode.
-
Nested Class Summary
Nested classes/interfaces inherited from class java.lang.Enum
Enum.EnumDesc<E extends Enum<E>> -
Enum Constant Summary
Enum ConstantsEnum ConstantDescriptionCommon-token analysis — letters and ideographs only.General token counting — letters, ideographs, and numbers. -
Method Summary
Modifier and TypeMethodDescriptionstatic TikaEvalTokenizer.ModeReturns the enum constant of this class with the specified name.static TikaEvalTokenizer.Mode[]values()Returns an array containing the constants of this enum class, in the order they are declared.
-
Enum Constant Details
-
STANDARD
General token counting — letters, ideographs, and numbers. No minimum length, no skip list. -
COMMON_TOKENS
Common-token analysis — letters and ideographs only. Minimum 3 characters for alphabetic tokens, HTML terms excluded.
-
-
Method Details
-
values
Returns an array containing the constants of this enum class, in the order they are declared.- Returns:
- an array containing the constants of this enum class, in the order they are declared
-
valueOf
Returns the enum constant of this class with the specified name. The string must match exactly an identifier used to declare an enum constant in this class. (Extraneous whitespace characters are not permitted.)- Parameters:
name- the name of the enum constant to be returned.- Returns:
- the enum constant with the specified name
- Throws:
IllegalArgumentException- if this enum class has no constant with the specified nameNullPointerException- if the argument is null
-