Package org.apache.tika.batch.fs
package org.apache.tika.batch.fs
-
ClassDescriptionBasic FileResourceConsumer that reads files from an input directory and writes content to the output directory.Selector that chooses files based on their file name and their size, as determined by TikaCoreProperties.RESOURCE_NAME_KEY and Metadata.CONTENT_LENGTH.FileSystem(FS)Resource wraps a file name.Class that "crawls" a list of files.Utility class to handle some common issues when reading from and writing to a file system (FS).This runs a RecursiveParserWrapper against an input file and outputs the json metadata to an output file.This uses the
JsonStreamingSerializer
to write out a single metadata object at a time.