Package | Description |
---|---|
org.apache.tika.eval.textstats |
Modifier and Type | Class and Description |
---|---|
class |
BasicTokenCountStatsCalculator |
class |
TextProfileSignature
Copied nearly directly from Apache Nutch:
https://github.com/apache/nutch/blob/master/src/java/org/apache/nutch/crawl/TextProfileSignature.java
See documentation: https://nutch.apache.org/apidocs/apidocs-2.0/org/apache/nutch/crawl/TextProfileSignature.html
This returns the base32 encoded sha256
|
class |
TokenEntropy |
class |
TokenLengths |
class |
TopNTokens |
Copyright © 2007–2021 The Apache Software Foundation. All rights reserved.