Package org.apache.tika.batch.fs
Class FSDirectoryCrawler
- java.lang.Object
- 
- org.apache.tika.batch.FileResourceCrawler
- 
- org.apache.tika.batch.fs.FSDirectoryCrawler
 
 
- 
- All Implemented Interfaces:
- Callable<IFileProcessorFutureResult>
 
 public class FSDirectoryCrawler extends FileResourceCrawler 
- 
- 
Nested Class SummaryNested Classes Modifier and Type Class Description static classFSDirectoryCrawler.CRAWL_ORDER
 - 
Field Summary- 
Fields inherited from class org.apache.tika.batch.FileResourceCrawlerADDED, LOG, SKIPPED, STOP_NOW
 
- 
 - 
Constructor SummaryConstructors Constructor Description FSDirectoryCrawler(ArrayBlockingQueue<FileResource> fileQueue, int numConsumers, Path root, Path startDirectory, FSDirectoryCrawler.CRAWL_ORDER crawlOrder)FSDirectoryCrawler(ArrayBlockingQueue<FileResource> fileQueue, int numConsumers, Path root, FSDirectoryCrawler.CRAWL_ORDER crawlOrder)
 - 
Method SummaryAll Methods Instance Methods Concrete Methods Modifier and Type Method Description voidhandleFirstFileInDirectory(Path f)Override this if you have any special handling for the first actual file that the crawler comes across in a directory.voidstart()Implement this to control the addition of FileResources.- 
Methods inherited from class org.apache.tika.batch.FileResourceCrawlercall, getAdded, getConsidered, isActive, isQueueEmpty, select, setDocumentSelector, setMaxConsecWaitInMillis, setMaxFilesToAdd, setMaxFilesToConsider, shutDownNoPoison, tryToAdd, wasTimedOut
 
- 
 
- 
- 
- 
Constructor Detail- 
FSDirectoryCrawlerpublic FSDirectoryCrawler(ArrayBlockingQueue<FileResource> fileQueue, int numConsumers, Path root, FSDirectoryCrawler.CRAWL_ORDER crawlOrder) 
 - 
FSDirectoryCrawlerpublic FSDirectoryCrawler(ArrayBlockingQueue<FileResource> fileQueue, int numConsumers, Path root, Path startDirectory, FSDirectoryCrawler.CRAWL_ORDER crawlOrder) 
 
- 
 - 
Method Detail- 
startpublic void start() throws InterruptedExceptionDescription copied from class:FileResourceCrawlerImplement this to control the addition of FileResources. CallFileResourceCrawler.tryToAdd(org.apache.tika.batch.FileResource)to add FileResources to the queue.- Specified by:
- startin class- FileResourceCrawler
- Throws:
- InterruptedException
 
 - 
handleFirstFileInDirectorypublic void handleFirstFileInDirectory(Path f) Override this if you have any special handling for the first actual file that the crawler comes across in a directory. For example, it might be handy to call mkdirs() on an output directory if your FileResourceConsumers are writing to a file.- Parameters:
- f- file to handle
 
 
- 
 
-