Class ExtractComparer

    • Field Detail

      • REF_PAIR_NAMES

        public static TableInfo REF_PAIR_NAMES
      • COMPARISON_CONTAINERS

        public static TableInfo COMPARISON_CONTAINERS
      • CONTENT_COMPARISONS

        public static TableInfo CONTENT_COMPARISONS
      • PROFILES_A

        public static TableInfo PROFILES_A
      • PROFILES_B

        public static TableInfo PROFILES_B
      • EMBEDDED_FILE_PATH_TABLE_A

        public static TableInfo EMBEDDED_FILE_PATH_TABLE_A
      • EMBEDDED_FILE_PATH_TABLE_B

        public static TableInfo EMBEDDED_FILE_PATH_TABLE_B
      • CONTENTS_TABLE_A

        public static TableInfo CONTENTS_TABLE_A
      • CONTENTS_TABLE_B

        public static TableInfo CONTENTS_TABLE_B
      • TAGS_TABLE_A

        public static TableInfo TAGS_TABLE_A
      • TAGS_TABLE_B

        public static TableInfo TAGS_TABLE_B
      • EXCEPTION_TABLE_A

        public static TableInfo EXCEPTION_TABLE_A
      • EXCEPTION_TABLE_B

        public static TableInfo EXCEPTION_TABLE_B
      • EXTRACT_EXCEPTION_TABLE_A

        public static TableInfo EXTRACT_EXCEPTION_TABLE_A
      • EXTRACT_EXCEPTION_TABLE_B

        public static TableInfo EXTRACT_EXCEPTION_TABLE_B
    • Method Detail

      • USAGE

        public static void USAGE()
      • processFileResource

        public boolean processFileResource​(FileResource fileResource)
        Description copied from class: FileResourceConsumer
        Main piece of code that needs to be implemented. Clients are responsible for closing streams and handling the exceptions that they'd like to handle.

        Unchecked throwables can be thrown past this, of course. When an unchecked throwable is thrown, this logs the error, and then rethrows the exception. Clients/subclasses should make sure to catch and handle everything they can.

        The design goal is that the whole process should close up and shutdown soon after an unchecked exception or error is thrown.

        Make sure to call FileResourceConsumer.incrementHandledExceptions() appropriately in your implementation of this method.

        Specified by:
        processFileResource in class FileResourceConsumer
        Parameters:
        fileResource - resource to process
        Returns:
        whether or not a file was successfully processed
      • compareFiles

        protected void compareFiles​(org.apache.tika.eval.app.EvalFilePaths fpsA,
                                    org.apache.tika.eval.app.EvalFilePaths fpsB)
                             throws IOException
        Throws:
        IOException