public class TextProfileSignature extends Object implements TokenCountStatsCalculator<String>
See documentation: https://nutch.apache.org/apidocs/apidocs-2.0/org/apache/nutch/crawl/TextProfileSignature.html
This returns the base32 encoded sha256
Constructor and Description |
---|
TextProfileSignature() |
Modifier and Type | Method and Description |
---|---|
String |
calculate(TokenCounts tokenCounts) |
void |
setMinTokenLength(int minTokenLength)
Be careful -- for CJK languages, the default analyzer uses character
bigrams.
|
void |
setQuantRate(float quantRate) |
public String calculate(TokenCounts tokenCounts)
calculate
in interface TokenCountStatsCalculator<String>
public void setMinTokenLength(int minTokenLength)
minTokenLength
- -- include tokens of this length or greater.public void setQuantRate(float quantRate)
Copyright © 2007–2023 The Apache Software Foundation. All rights reserved.