| Modifier and Type | Class and Description | 
|---|---|
class  | 
DL4JInceptionV3Net
DL4JInceptionV3Net is an implementation of ObjectRecogniser. | 
class  | 
DL4JVGG16Net  | 
| Modifier and Type | Class and Description | 
|---|---|
class  | 
RegexCaptureParser  | 
| Modifier and Type | Class and Description | 
|---|---|
class  | 
TensorflowRESTCaptioner
Tensorflow image captioner. 
 | 
| Modifier and Type | Class and Description | 
|---|---|
class  | 
ExternalParser
This is a next generation external parser that uses some of the more
 recent additions to Tika. 
 | 
| Modifier and Type | Class and Description | 
|---|---|
class  | 
TesseractOCRParser
TesseractOCRParser powered by tesseract-ocr engine. 
 | 
| Modifier and Type | Class and Description | 
|---|---|
class  | 
PDFParser
PDF parser. 
 | 
| Modifier and Type | Interface and Description | 
|---|---|
interface  | 
ObjectRecogniser
This is a contract for object recognisers used by  
ObjectRecognitionParser | 
| Modifier and Type | Class and Description | 
|---|---|
class  | 
AgeRecogniser
Parser for extracting features from text. 
 | 
class  | 
ObjectRecognitionParser
This parser recognises objects from Images. 
 | 
| Modifier and Type | Class and Description | 
|---|---|
class  | 
TensorflowImageRecParser
This is an implementation of  
ObjectRecogniser powered by
  Tensorflow 
 convolutional neural network (CNN). | 
class  | 
TensorflowRESTRecogniser
Tensor Flow image recogniser which has high performance. 
 | 
class  | 
TensorflowRESTVideoRecogniser
Tensor Flow video recogniser which has high performance. 
 | 
| Modifier and Type | Class and Description | 
|---|---|
class  | 
SentimentAnalysisParser
This parser classifies documents based on the sentiment of document. 
 | 
| Modifier and Type | Class and Description | 
|---|---|
class  | 
SQLite3Parser
This is the main class for parsing SQLite3 files. 
 | 
| Modifier and Type | Class and Description | 
|---|---|
class  | 
StringsParser
Parser that uses the "strings" (or strings-alternative) command to find the
 printable strings in a object, or other binary, file
 (application/octet-stream). 
 | 
| Modifier and Type | Class and Description | 
|---|---|
class  | 
AmazonTranscribe
Amazon Transcribe
 implementation. 
 | 
| Modifier and Type | Class and Description | 
|---|---|
class  | 
CompositePipesReporter  | 
class  | 
PipesReporterBase
Base class that includes filtering by  
PipesResult.STATUS | 
| Modifier and Type | Class and Description | 
|---|---|
class  | 
AZBlobEmitter
Emit files to Azure blob storage. 
 | 
| Modifier and Type | Class and Description | 
|---|---|
class  | 
GCSEmitter  | 
| Modifier and Type | Class and Description | 
|---|---|
class  | 
JDBCEmitter
This is only an initial, basic implementation of an emitter for JDBC. 
 | 
| Modifier and Type | Class and Description | 
|---|---|
class  | 
KafkaEmitter
Emits the now-parsed documents into a specified Apache Kafka topic. 
 | 
| Modifier and Type | Class and Description | 
|---|---|
class  | 
OpenSearchEmitter  | 
| Modifier and Type | Class and Description | 
|---|---|
class  | 
S3Emitter
Emits to existing s3 bucket
  
  <properties>
      <emitters>
          <emitter class="org.apache.tika.pipes.emitter.s3.S3Emitter>
              <params>
                  <!-- required -->
                  <param name="name" type="string">s3e</param>
                  <!-- required -->
                  <param name="region" type="string">us-east-1</param>
                  <!-- required -->
                  <param name="credentialsProvider"
                       type="string">(profile|instance)</param>
                  <!-- required if credentialsProvider=profile-->
                  <param name="profile" type="string">my-profile</param>
                  <!-- required -->
                  <param name="bucket" type="string">my-bucket</param>
                  <!-- optional; prefix to add to the path before emitting;
                       default is no prefix -->
                  <param name="prefix" type="string">my-prefix</param>
                  <!-- optional; default is 'json' this will be added to the SOURCE_PATH
                                    if no emitter key is specified. | 
| Modifier and Type | Class and Description | 
|---|---|
class  | 
SolrEmitter  | 
| Modifier and Type | Class and Description | 
|---|---|
class  | 
AZBlobFetcher
Fetches files from Azure blob storage. 
 | 
| Modifier and Type | Class and Description | 
|---|---|
class  | 
FileSystemFetcher  | 
| Modifier and Type | Class and Description | 
|---|---|
class  | 
GCSFetcher
Fetches files from google cloud storage. 
 | 
| Modifier and Type | Class and Description | 
|---|---|
class  | 
HttpFetcher
Based on Apache httpclient 
 | 
| Modifier and Type | Class and Description | 
|---|---|
class  | 
S3Fetcher
Fetches files from s3. 
 | 
| Modifier and Type | Class and Description | 
|---|---|
class  | 
PipesIterator
Abstract class that handles the testing for timeouts/thread safety
 issues. 
 | 
| Modifier and Type | Class and Description | 
|---|---|
class  | 
AZBlobPipesIterator  | 
| Modifier and Type | Class and Description | 
|---|---|
class  | 
CSVPipesIterator
Iterates through a UTF-8 CSV file. 
 | 
| Modifier and Type | Class and Description | 
|---|---|
class  | 
FileListPipesIterator
Reads a list of file names/relative paths from a UTF-8 file. 
 | 
| Modifier and Type | Class and Description | 
|---|---|
class  | 
FileSystemPipesIterator  | 
| Modifier and Type | Class and Description | 
|---|---|
class  | 
GCSPipesIterator  | 
| Modifier and Type | Class and Description | 
|---|---|
class  | 
JDBCPipesIterator
Iterates through a the results from a sql call via jdbc. 
 | 
| Modifier and Type | Class and Description | 
|---|---|
class  | 
KafkaPipesIterator  | 
| Modifier and Type | Class and Description | 
|---|---|
class  | 
S3PipesIterator  | 
| Modifier and Type | Class and Description | 
|---|---|
class  | 
SolrPipesIterator
Iterates through results from a Solr query. 
 | 
| Modifier and Type | Class and Description | 
|---|---|
class  | 
FileSystemStatusReporter
This is intended to write summary statistics to disk
 periodically. 
 | 
| Modifier and Type | Class and Description | 
|---|---|
class  | 
JDBCPipesReporter
This is an initial draft of a JDBCPipesReporter. 
 | 
| Modifier and Type | Class and Description | 
|---|---|
class  | 
OpenSearchPipesReporter
As of the 2.5.0 release, this is ALPHA version. 
 | 
| Modifier and Type | Class and Description | 
|---|---|
class  | 
CompositeRenderer  | 
| Modifier and Type | Class and Description | 
|---|---|
class  | 
PDFBoxRenderer  | 
| Modifier and Type | Class and Description | 
|---|---|
class  | 
TikaServerClientConfig  | 
| Modifier and Type | Class and Description | 
|---|---|
class  | 
TlsConfig  | 
Copyright © 2007–2023 The Apache Software Foundation. All rights reserved.