Package org.apache.tika.batch.fs
Class RecursiveParserWrapperFSConsumer
java.lang.Object
org.apache.tika.batch.FileResourceConsumer
org.apache.tika.batch.fs.AbstractFSConsumer
org.apache.tika.batch.fs.RecursiveParserWrapperFSConsumer
- All Implemented Interfaces:
Callable<IFileProcessorFutureResult>
This runs a RecursiveParserWrapper against an input file
and outputs the json metadata to an output file.
-
Field Summary
Fields inherited from class org.apache.tika.batch.FileResourceConsumer
ELAPSED_MILLIS, IO_IS, IO_OS, LOG, OOM, PARSE_ERR, PARSE_EX, TIMED_OUT -
Constructor Summary
ConstructorsConstructorDescriptionRecursiveParserWrapperFSConsumer(ArrayBlockingQueue<FileResource> queue, Parser parser, ContentHandlerFactory contentHandlerFactory, OutputStreamFactory fsOSFactory, MetadataFilter metadataFilter) -
Method Summary
Modifier and TypeMethodDescriptionbooleanprocessFileResource(FileResource fileResource) Main piece of code that needs to be implemented.voidsetOutputEncoding(String outputEncoding) Methods inherited from class org.apache.tika.batch.fs.AbstractFSConsumer
getInputStream, getOutputStreamMethods inherited from class org.apache.tika.batch.FileResourceConsumer
call, checkForTimedOutMillis, close, flushAndClose, getCurrentFile, getNumHandledExceptions, getNumResourcesConsumed, getXMLifiedLogMsg, getXMLifiedLogMsg, incrementHandledExceptions, isStillActive, parse, pleaseShutdown
-
Constructor Details
-
RecursiveParserWrapperFSConsumer
public RecursiveParserWrapperFSConsumer(ArrayBlockingQueue<FileResource> queue, Parser parser, ContentHandlerFactory contentHandlerFactory, OutputStreamFactory fsOSFactory, MetadataFilter metadataFilter) - Parameters:
queue-parser- -- must be RecursiveParserWrapper or a ForkParser that wraps a RecursiveParserWrappercontentHandlerFactory-fsOSFactory-
-
-
Method Details
-
processFileResource
Description copied from class:FileResourceConsumerMain piece of code that needs to be implemented. Clients are responsible for closing streams and handling the exceptions that they'd like to handle. Unchecked throwables can be thrown past this, of course. When an unchecked throwable is thrown, this logs the error, and then rethrows the exception. Clients/subclasses should make sure to catch and handle everything they can. The design goal is that the whole process should close up and shutdown soon after an unchecked exception or error is thrown. Make sure to callFileResourceConsumer.incrementHandledExceptions()appropriately in your implementation of this method.- Specified by:
processFileResourcein classFileResourceConsumer- Parameters:
fileResource- resource to process- Returns:
- whether or not a file was successfully processed
-
getOutputEncoding
-
setOutputEncoding
-