Package org.apache.tika.eval.app
Class ExtractComparer
java.lang.Object
org.apache.tika.eval.app.ProfilerBase
org.apache.tika.eval.app.ExtractComparer
-
Nested Class Summary
Nested classes/interfaces inherited from class org.apache.tika.eval.app.ProfilerBase
ProfilerBase.EXCEPTION_TYPE, ProfilerBase.PARSE_ERROR_TYPE -
Field Summary
FieldsModifier and TypeFieldDescriptionstatic TableInfostatic TableInfostatic TableInfostatic TableInfostatic TableInfostatic TableInfostatic TableInfostatic TableInfostatic TableInfostatic TableInfostatic TableInfostatic TableInfostatic TableInfostatic TableInfostatic TableInfoFields inherited from class org.apache.tika.eval.app.ProfilerBase
FALSE, ID, MIME_TABLE, REF_EXTRACT_EXCEPTION_TYPES, REF_PARSE_ERROR_TYPES, REF_PARSE_EXCEPTION_TYPES, TRUE, writer -
Constructor Summary
ConstructorsConstructorDescriptionExtractComparer(Path inputDir, Path extractsA, Path extractsB, ExtractReader extractReader, IDBWriter writer) -
Method Summary
Modifier and TypeMethodDescriptionprotected voidcompareFiles(org.apache.tika.eval.app.EvalFilePaths fpsA, org.apache.tika.eval.app.EvalFilePaths fpsB) booleanprocessFileResource(FetchKey fetchKey) static voidUSAGE()Methods inherited from class org.apache.tika.eval.app.ProfilerBase
calcTextStats, closeWriter, getContent, getFileLength, getPathsFromExtractCrawl, getPathsFromSrcCrawl, getSourceFileLength, loadCommonTokens, setMaxContentLength, setMaxContentLengthForLangId, setMaxTokens, truncateContent, writeContentData, writeExceptionData, writeExtractException, writeProfileData
-
Field Details
-
REF_PAIR_NAMES
-
COMPARISON_CONTAINERS
-
CONTENT_COMPARISONS
-
PROFILES_A
-
PROFILES_B
-
EMBEDDED_FILE_PATH_TABLE_A
-
EMBEDDED_FILE_PATH_TABLE_B
-
CONTENTS_TABLE_A
-
CONTENTS_TABLE_B
-
TAGS_TABLE_A
-
TAGS_TABLE_B
-
EXCEPTION_TABLE_A
-
EXCEPTION_TABLE_B
-
EXTRACT_EXCEPTION_TABLE_A
-
EXTRACT_EXCEPTION_TABLE_B
-
-
Constructor Details
-
ExtractComparer
public ExtractComparer(Path inputDir, Path extractsA, Path extractsB, ExtractReader extractReader, IDBWriter writer)
-
-
Method Details
-
USAGE
- Throws:
IOException
-
processFileResource
- Specified by:
processFileResourcein classProfilerBase
-
compareFiles
protected void compareFiles(org.apache.tika.eval.app.EvalFilePaths fpsA, org.apache.tika.eval.app.EvalFilePaths fpsB) throws IOException - Throws:
IOException
-