public class TextProfileSignature extends Object implements TokenCountStatsCalculator<String>
See documentation: https://nutch.apache.org/apidocs/apidocs-2.0/org/apache/nutch/crawl/TextProfileSignature.html
This returns the base32 encoded sha256
| Constructor and Description | 
|---|
TextProfileSignature()  | 
| Modifier and Type | Method and Description | 
|---|---|
String | 
calculate(TokenCounts tokenCounts)  | 
void | 
setMinTokenLength(int minTokenLength)
Be careful -- for CJK languages, the default analyzer uses character
 bigrams. 
 | 
void | 
setQuantRate(float quantRate)  | 
public String calculate(TokenCounts tokenCounts)
calculate in interface TokenCountStatsCalculator<String>public void setMinTokenLength(int minTokenLength)
minTokenLength - -- include tokens of this length or greater.public void setQuantRate(float quantRate)
Copyright © 2007–2021 The Apache Software Foundation. All rights reserved.