public class ExtractComparer extends AbstractProfiler
AbstractProfiler.EXCEPTION_TYPE, AbstractProfiler.PARSE_ERROR_TYPE| Modifier and Type | Field and Description |
|---|---|
static TableInfo |
COMPARISON_CONTAINERS |
static TableInfo |
CONTENT_COMPARISONS |
static TableInfo |
CONTENTS_TABLE_A |
static TableInfo |
CONTENTS_TABLE_B |
static TableInfo |
EMBEDDED_FILE_PATH_TABLE_A |
static TableInfo |
EMBEDDED_FILE_PATH_TABLE_B |
static TableInfo |
EXCEPTION_TABLE_A |
static TableInfo |
EXCEPTION_TABLE_B |
static TableInfo |
EXTRACT_EXCEPTION_TABLE_A |
static TableInfo |
EXTRACT_EXCEPTION_TABLE_B |
static TableInfo |
PROFILES_A |
static TableInfo |
PROFILES_B |
static TableInfo |
REF_PAIR_NAMES |
static TableInfo |
TAGS_TABLE_A |
static TableInfo |
TAGS_TABLE_B |
FALSE, ID, MIME_TABLE, REF_EXTRACT_EXCEPTION_TYPES, REF_PARSE_ERROR_TYPES, REF_PARSE_EXCEPTION_TYPES, TRUE, writerELAPSED_MILLIS, IO_IS, IO_OS, OOM, PARSE_ERR, PARSE_EX, TIMED_OUT| Constructor and Description |
|---|
ExtractComparer(ArrayBlockingQueue<FileResource> queue,
Path inputDir,
Path extractsA,
Path extractsB,
ExtractReader extractReader,
IDBWriter writer) |
| Modifier and Type | Method and Description |
|---|---|
protected void |
compareFiles(org.apache.tika.eval.app.EvalFilePaths fpsA,
org.apache.tika.eval.app.EvalFilePaths fpsB) |
boolean |
processFileResource(FileResource fileResource)
Main piece of code that needs to be implemented.
|
static void |
USAGE() |
calcTextStats, closeWriter, getContent, getFileLength, getPathsFromExtractCrawl, getPathsFromSrcCrawl, getSourceFileLength, loadCommonTokens, setMaxContentLength, setMaxContentLengthForLangId, setMaxTokens, truncateContent, writeContentData, writeExceptionData, writeExtractException, writeProfileDatacall, checkForTimedOutMillis, close, flushAndClose, getCurrentFile, getNumHandledExceptions, getNumResourcesConsumed, getXMLifiedLogMsg, getXMLifiedLogMsg, incrementHandledExceptions, isStillActive, parse, pleaseShutdownpublic static TableInfo REF_PAIR_NAMES
public static TableInfo COMPARISON_CONTAINERS
public static TableInfo CONTENT_COMPARISONS
public static TableInfo PROFILES_A
public static TableInfo PROFILES_B
public static TableInfo EMBEDDED_FILE_PATH_TABLE_A
public static TableInfo EMBEDDED_FILE_PATH_TABLE_B
public static TableInfo CONTENTS_TABLE_A
public static TableInfo CONTENTS_TABLE_B
public static TableInfo TAGS_TABLE_A
public static TableInfo TAGS_TABLE_B
public static TableInfo EXCEPTION_TABLE_A
public static TableInfo EXCEPTION_TABLE_B
public static TableInfo EXTRACT_EXCEPTION_TABLE_A
public static TableInfo EXTRACT_EXCEPTION_TABLE_B
public ExtractComparer(ArrayBlockingQueue<FileResource> queue, Path inputDir, Path extractsA, Path extractsB, ExtractReader extractReader, IDBWriter writer)
public static void USAGE()
public boolean processFileResource(FileResource fileResource)
FileResourceConsumerFileResourceConsumer.incrementHandledExceptions() appropriately in
your implementation of this method.
processFileResource in class FileResourceConsumerfileResource - resource to processprotected void compareFiles(org.apache.tika.eval.app.EvalFilePaths fpsA,
org.apache.tika.eval.app.EvalFilePaths fpsB)
throws IOException
IOExceptionCopyright © 2007–2021 The Apache Software Foundation. All rights reserved.