Class FileListPipesIterator
java.lang.Object
org.apache.tika.config.ConfigBase
org.apache.tika.pipes.pipesiterator.PipesIterator
org.apache.tika.pipes.pipesiterator.filelist.FileListPipesIterator
- All Implemented Interfaces:
Iterable<FetchEmitTuple>
,Callable<Integer>
,Initializable
Reads a list of file names/relative paths from a UTF-8 file.
One file name/relative path per line. This path is used for the fetch key,
the id and the emit key. If you need more customized control of the keys/ids,
consider using the jdbc pipes iterator or the csv pipes iterator.
Skips empty lines and lines starting with '#'
-
Field Summary
Fields inherited from class org.apache.tika.pipes.pipesiterator.PipesIterator
COMPLETED_SEMAPHORE, DEFAULT_MAX_WAIT_MS, DEFAULT_QUEUE_SIZE
-
Constructor Summary
-
Method Summary
Modifier and TypeMethodDescriptionvoid
checkInitialization
(InitializableProblemHandler problemHandler) protected void
enqueue()
void
setFileList
(String path) void
setHasHeader
(boolean hasHeader) Methods inherited from class org.apache.tika.pipes.pipesiterator.PipesIterator
build, call, getEmitterName, getFetcherName, getHandlerConfig, getOnParseException, initialize, iterator, setEmitterName, setFetcherName, setHandlerType, setMaxEmbeddedResources, setMaxWaitMs, setOnParseException, setOnParseException, setParseMode, setParseMode, setQueueSize, setThrowOnWriteLimitReached, setWriteLimit, tryToAdd
Methods inherited from class org.apache.tika.config.ConfigBase
buildComposite, buildComposite, buildSingle, buildSingle, configure, handleSettings
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
Methods inherited from interface org.apache.tika.config.Initializable
initialize
Methods inherited from interface java.lang.Iterable
forEach, spliterator
-
Constructor Details
-
FileListPipesIterator
public FileListPipesIterator()
-
-
Method Details
-
enqueue
- Specified by:
enqueue
in classPipesIterator
- Throws:
IOException
TimeoutException
InterruptedException
-
setFileList
-
setHasHeader
-
checkInitialization
public void checkInitialization(InitializableProblemHandler problemHandler) throws TikaConfigException - Specified by:
checkInitialization
in interfaceInitializable
- Overrides:
checkInitialization
in classPipesIterator
- Parameters:
problemHandler
- if there is a problem and no custom initializableProblemHandler has been configured via Initializable parameters, this is called to respond.- Throws:
TikaConfigException
-