Uses of Class
org.apache.tika.config.Field
-
-
Uses of Field in org.apache.tika.detect
Methods in org.apache.tika.detect with annotations of type Field Modifier and Type Method Description voidFileCommandDetector. setFilePath(String fileCommandPath)voidFileCommandDetector. setMaxBytes(int maxBytes)If this is not called on a TikaInputStream, this detector will spool up to this many bytes to a file to be detected by the 'file' command.voidFileCommandDetector. setTimeoutMs(long timeoutMs)voidFileCommandDetector. setUseMime(boolean useMime) -
Uses of Field in org.apache.tika.detect.magika
Methods in org.apache.tika.detect.magika with annotations of type Field Modifier and Type Method Description voidMagikaDetector. setMagikaPath(String fileCommandPath)voidMagikaDetector. setMaxBytes(int maxBytes)If this is not called on a TikaInputStream, this detector will spool up to this many bytes to a file to be detected by the 'file' command.voidMagikaDetector. setTimeoutMs(long timeoutMs)voidMagikaDetector. setUseMime(boolean useMime)As default behavior, Tika runs magika to add its detection to the metadata, but NOT to use detection in determining parsers etc. -
Uses of Field in org.apache.tika.detect.siegfried
Methods in org.apache.tika.detect.siegfried with annotations of type Field Modifier and Type Method Description voidSiegfriedDetector. setMaxBytes(int maxBytes)If this is not called on a TikaInputStream, this detector will spool up to this many bytes to a file to be detected by the 'file' command.voidSiegfriedDetector. setSiegfriedPath(String fileCommandPath)voidSiegfriedDetector. setTimeoutMs(long timeoutMs)voidSiegfriedDetector. setUseMime(boolean useMime)As default behavior, Tika runs Siegfried to add its detection to the metadata, but NOT to use detection in determining parsers etc. -
Uses of Field in org.apache.tika.detect.zip
Methods in org.apache.tika.detect.zip with annotations of type Field Modifier and Type Method Description voidDefaultZipContainerDetector. setMarkLimit(int markLimit)If this is less than 0, the file will be spooled to disk, and detection will run on the full file. -
Uses of Field in org.apache.tika.extractor
Methods in org.apache.tika.extractor with annotations of type Field Modifier and Type Method Description voidRUnpackExtractorFactory. setEmbeddedBytesExcludeEmbeddedResourceTypes(List<String> excludeAttachmentTypes)voidRUnpackExtractorFactory. setEmbeddedBytesExcludeMimeTypes(List<String> excludeMimeTypes)voidRUnpackExtractorFactory. setEmbeddedBytesIncludeEmbeddedResourceTypes(List<String> includeAttachmentTypes)voidRUnpackExtractorFactory. setEmbeddedBytesIncludeMimeTypes(List<String> includeMimeTypes)voidRUnpackExtractorFactory. setMaxEmbeddedBytesForExtraction(long maxEmbeddedBytesForExtraction)Total number of bytes to write out.voidParsingEmbeddedDocumentExtractorFactory. setWriteFileNameToContent(boolean writeFileNameToContent)voidRUnpackExtractorFactory. setWriteFileNameToContent(boolean writeFileNameToContent) -
Uses of Field in org.apache.tika.langdetect.opennlp.metadatafilter
Methods in org.apache.tika.langdetect.opennlp.metadatafilter with annotations of type Field Modifier and Type Method Description voidOpenNLPMetadataFilter. setMaxCharsForDetection(int maxCharsForDetection) -
Uses of Field in org.apache.tika.langdetect.optimaize.metadatafilter
Methods in org.apache.tika.langdetect.optimaize.metadatafilter with annotations of type Field Modifier and Type Method Description voidOptimaizeMetadataFilter. setMaxCharsForDetection(int maxCharsForDetection) -
Uses of Field in org.apache.tika.metadata.filter
Methods in org.apache.tika.metadata.filter with annotations of type Field Modifier and Type Method Description voidDateNormalizingMetadataFilter. setDefaultTimeZone(String timeZoneId)voidExcludeFieldMetadataFilter. setExclude(List<String> exclude)voidFieldNameMappingFilter. setExcludeUnmapped(boolean excludeUnmapped)If this istrue(default), this means that only the fields that have a "from" value in the mapper will be passed through.voidGeoPointMetadataFilter. setGeoPointFieldName(String geoPointFieldName)Set the field for the concatenated LATITUDE,LONGITUDE string.voidIncludeFieldMetadataFilter. setInclude(List<String> include)voidFieldNameMappingFilter. setMappings(Map<String,String> mappings)voidClearByMimeMetadataFilter. setMimes(List<String> mimes)voidCaptureGroupMetadataFilter. setRegex(String regex)voidCaptureGroupMetadataFilter. setSourceField(String sourceField)voidCaptureGroupMetadataFilter. setTargetField(String targetField)voidClearByAttachmentTypeMetadataFilter. setTypes(List<String> types)For types seeTikaCoreProperties.EmbeddedResourceType -
Uses of Field in org.apache.tika.parser
Methods in org.apache.tika.parser with annotations of type Field Modifier and Type Method Description voidRegexCaptureParser. setCaptureMap(Map<String,String> map)voidRegexCaptureParser. setMatchMap(Map<String,String> map)voidRegexCaptureParser. setWriteContent(boolean writeContent) -
Uses of Field in org.apache.tika.parser.csv
Methods in org.apache.tika.parser.csv with annotations of type Field Modifier and Type Method Description voidTextAndCSVParser. setNameToDelimiterMap(Map<String,String> map) -
Uses of Field in org.apache.tika.parser.digestutils
Methods in org.apache.tika.parser.digestutils with annotations of type Field Modifier and Type Method Description voidCommonsDigesterFactory. setAlgorithmString(String algorithmString)voidCommonsDigesterFactory. setMarkLimit(int markLimit)voidCommonsDigesterFactory. setSkipContainerDocument(boolean skipContainerDocument) -
Uses of Field in org.apache.tika.parser.dwg
Methods in org.apache.tika.parser.dwg with annotations of type Field Modifier and Type Method Description voidAbstractDWGParser. setCleanDwgReadOutput(boolean cleanDwgReadOutput)voidAbstractDWGParser. setCleanDwgReadOutputBatchSize(int cleanDwgReadOutputBatchSize)voidAbstractDWGParser. setCleanDwgReadRegexToReplace(String cleanDwgReadRegexToReplace)voidAbstractDWGParser. setCleanDwgReadReplaceWith(String cleanDwgReadReplaceWith)voidAbstractDWGParser. setDwgReadExecutable(String dwgReadExecutable)voidAbstractDWGParser. setDwgReadTimeout(long dwgReadTimeout) -
Uses of Field in org.apache.tika.parser.epub
Methods in org.apache.tika.parser.epub with annotations of type Field Modifier and Type Method Description voidEpubParser. setStreaming(boolean streaming) -
Uses of Field in org.apache.tika.parser.external2
Methods in org.apache.tika.parser.external2 with annotations of type Field Modifier and Type Method Description voidExternalParser. setCommandLine(List<String> commandLine)Use this to specify the full commandLine.voidExternalParser. setMaxStdErr(int maxStdErr)voidExternalParser. setMaxStdOut(int maxStdOut)voidExternalParser. setOutputParser(Parser parser)This parser is called on the output of the process.voidExternalParser. setReturnStderr(boolean returnStderr)If set to true, this will return the stderr in the metadata viaExternalProcess.STD_ERR.voidExternalParser. setReturnStdout(boolean returnStdout)If set to true, this will return the stdout in the metadata viaExternalProcess.STD_OUT.voidExternalParser. setSupportedTypes(List<String> supportedTypes)This is set during initialization from a tika-config.voidExternalParser. setTimeoutMs(long timeoutMs) -
Uses of Field in org.apache.tika.parser.gdal
Methods in org.apache.tika.parser.gdal with annotations of type Field Modifier and Type Method Description voidGDALParser. setMaxStdErr(int maxStdErr)voidGDALParser. setMaxStdOut(int maxStdOut)voidGDALParser. setTimeoutMs(long timeoutMs) -
Uses of Field in org.apache.tika.parser.geo.topic
Methods in org.apache.tika.parser.geo.topic with annotations of type Field Modifier and Type Method Description voidGeoParser. setGazetteerRestEndpoint(String gazetteerRestEndpoint)voidGeoParser. setNerModelUrl(String nerModelUrl) -
Uses of Field in org.apache.tika.parser.geopkg
Methods in org.apache.tika.parser.geopkg with annotations of type Field Modifier and Type Method Description voidGeoPkgParser. setIgnoreBlobColumns(List<String> ignoreBlobColumns) -
Uses of Field in org.apache.tika.parser.html
Methods in org.apache.tika.parser.html with annotations of type Field Modifier and Type Method Description voidJSoupParser. setExtractScripts(boolean extractScripts)Whether or not to extract contents in script entities.voidHtmlEncodingDetector. setMarkLimit(int markLimit)How far into the stream to read for charset detection. -
Uses of Field in org.apache.tika.parser.html.charsetdetector
Methods in org.apache.tika.parser.html.charsetdetector with annotations of type Field Modifier and Type Method Description voidStandardHtmlEncodingDetector. setMarkLimit(int markLimit)How far into the stream to read for charset detection. -
Uses of Field in org.apache.tika.parser.image
Methods in org.apache.tika.parser.image with annotations of type Field Modifier and Type Method Description voidPSDParser. setMaxDataLengthBytes(int maxDataLengthBytes)voidBPGParser. setMaxRecordLength(int maxRecordLength) -
Uses of Field in org.apache.tika.parser.microsoft
Methods in org.apache.tika.parser.microsoft with annotations of type Field Modifier and Type Method Description voidAbstractOfficeParser. setByteArrayMaxOverride(int maxOverride)WARNING: this sets a static variable in POI.voidAbstractOfficeParser. setConcatenatePhoneticRuns(boolean concatenatePhoneticRuns)voidAbstractOfficeParser. setDateFormatOverride(String format)voidAbstractOfficeParser. setExtractAllAlternativesFromMSG(boolean extractAllAlternativesFromMSG)Some .msg files can contain body content in html, rtf and/or text.voidAbstractOfficeParser. setExtractMacros(boolean extractMacros)voidAbstractOfficeParser. setIncludeDeletedContent(boolean includeDeletedConent)voidAbstractOfficeParser. setIncludeHeadersAndFooters(boolean includeHeadersAndFooters)voidAbstractOfficeParser. setIncludeMoveFromContent(boolean includeMoveFromContent)voidAbstractOfficeParser. setIncludeShapeBasedContent(boolean includeShapeBasedContent)voidAbstractOfficeParser. setUseSAXDocxExtractor(boolean useSAXDocxExtractor)voidAbstractOfficeParser. setUseSAXPptxExtractor(boolean useSAXPptxExtractor)voidAbstractOfficeParser. setWriteSelectHeadersInBody(boolean val)If set totrue, this will write the to/from/cc into the body content -
Uses of Field in org.apache.tika.parser.microsoft.libpst
Methods in org.apache.tika.parser.microsoft.libpst with annotations of type Field Modifier and Type Method Description voidLibPstParser. setIncludeDeleted(boolean includeDeleted)voidLibPstParser. setMaxEmails(int maxEmails)voidLibPstParser. setProcessEmailAsMsg(boolean processEmailAsMsg)voidLibPstParser. setReadPstPath(String readPstPath)This should include the path up to but not including 'readpst', e.g.voidLibPstParser. setTimeoutSeconds(long timeoutSeconds) -
Uses of Field in org.apache.tika.parser.microsoft.rtf
Methods in org.apache.tika.parser.microsoft.rtf with annotations of type Field Modifier and Type Method Description voidRTFParser. setMemoryLimitInKb(int memoryLimitInKb) -
Uses of Field in org.apache.tika.parser.mp3
Methods in org.apache.tika.parser.mp3 with annotations of type Field Modifier and Type Method Description voidMp3Parser. setMaxRecordSize(int maxRecordSize)This statically sets the max record size inID3v2Frame -
Uses of Field in org.apache.tika.parser.ocr
Methods in org.apache.tika.parser.ocr with annotations of type Field Modifier and Type Method Description voidTesseractOCRParser. setApplyRotation(boolean applyRotation)voidTesseractOCRParser. setColorspace(String colorspace)voidTesseractOCRParser. setDensity(int density)voidTesseractOCRParser. setDepth(int depth)voidTesseractOCRParser. setEnableImagePreprocessing(boolean enableImagePreprocessing)voidTesseractOCRParser. setFilter(String filter)voidTesseractOCRParser. setImageMagickPath(String imageMagickPath)Set the path to the ImageMagick executable directory, needed if it is not on system path.voidTesseractOCRParser. setInlineContent(boolean inlineContent)voidTesseractOCRParser. setLanguage(String language)voidTesseractOCRParser. setMaxFileSizeToOcr(long maxFileSizeToOcr)voidTesseractOCRParser. setMinFileSizeToOcr(long minFileSizeToOcr)voidTesseractOCRParser. setOtherTesseractSettings(List<String> settings)voidTesseractOCRParser. setOutputType(String outputType)voidTesseractOCRParser. setPageSegMode(String pageSegMode)voidTesseractOCRParser. setPreloadLangs(boolean preloadLangs)If set totrueand if tesseract is found, this will load the langs that result from --list-langs.voidTesseractOCRParser. setPreserveInterwordSpacing(boolean preserveInterwordSpacing)voidTesseractOCRParser. setResize(int resize)voidTesseractOCRParser. setSkipOCR(boolean skipOCR)voidTesseractOCRParser. setTessdataPath(String tessdataPath)Set the path to the 'tessdata' folder, which contains language files and config files.voidTesseractOCRParser. setTesseractPath(String tesseractPath)Set the path to the Tesseract executable's directory, needed if it is not on system path.voidTesseractOCRParser. setTimeout(int timeout)Set default timeout in seconds. -
Uses of Field in org.apache.tika.parser.odf
Methods in org.apache.tika.parser.odf with annotations of type Field Modifier and Type Method Description voidFlatOpenDocumentParser. setExtractMacros(boolean extractMacros)voidOpenDocumentParser. setExtractMacros(boolean extractMacros) -
Uses of Field in org.apache.tika.parser.pdf
Methods in org.apache.tika.parser.pdf with annotations of type Field Modifier and Type Method Description voidPDFParser. setAllowExtractionForAccessibility(boolean allowExtractionForAccessibility)voidPDFParser. setAverageCharTolerance(float averageCharTolerance)voidPDFParser. setCatchIntermediateExceptions(boolean catchIntermediateExceptions)voidPDFParser. setDetectAngles(boolean detectAngles)voidPDFParser. setDropThreshold(float dropThreshold)voidPDFParser. setEnableAutoSpace(boolean v)If true (the default), the parser should estimate where spaces should be inserted between words.voidPDFParser. setExtractAcroFormContent(boolean extractAcroFormContent)voidPDFParser. setExtractActions(boolean extractActions)voidPDFParser. setExtractAnnotationText(boolean v)If true (the default), text in annotations will be extracted.voidPDFParser. setExtractBookmarksText(boolean extractBookmarksText)voidPDFParser. setExtractFontNames(boolean extractFontNames)voidPDFParser. setExtractIncrementalUpdateInfo(boolean setExtractIncrementalUpdateInfo)Whether or not to scan a PDF for incremental updates.voidPDFParser. setExtractInlineImageMetadataOnly(boolean extractInlineImageMetadataOnly)voidPDFParser. setExtractInlineImages(boolean extractInlineImages)voidPDFParser. setExtractMarkedContent(boolean extractMarkedContent)voidPDFParser. setExtractUniqueInlineImagesOnly(boolean extractUniqueInlineImagesOnly)voidPDFParser. setIfXFAExtractOnlyXFA(boolean ifXFAExtractOnlyXFA)voidPDFParser. setIgnoreContentStreamSpaceGlyphs(boolean v)If true, the parser should ignore spaces in the content stream and rely purely on the algorithm to determine where word breaks are (PDFBOX-3774).voidPDFParser. setImageGraphicsEngineFactory(ImageGraphicsEngineFactory imageGraphicsEngineFactory)voidPDFParser. setImageStrategy(String imageStrategy)voidPDFParser. setMaxIncrementalUpdates(int maxIncrementalUpdates)Set the maximum number of incremental updates to parsevoidPDFParser. setMaxMainMemoryBytes(long maxMainMemoryBytes)voidPDFParser. setOcrDPI(int dpi)voidPDFParser. setOcrImageFormatName(String formatName)voidPDFParser. setOcrImageQuality(float imageQuality)voidPDFParser. setOcrImageType(String imageType)voidPDFParser. setOcrRenderingStrategy(String ocrRenderingStrategy)voidPDFParser. setOcrStrategy(String ocrStrategyString)voidPDFParser. setOcrStrategyAuto(String ocrStrategyAuto)voidPDFParser. setParseIncrementalUpdates(boolean parseIncrementalUpdates)If set to true, this will parse incremental updates if they exist within a PDF.voidPDFParser. setSetKCMS(boolean setKCMS)voidPDFParser. setSortByPosition(boolean v)If true, sort text tokens by their x/y position before extracting text.voidPDFParser. setSpacingTolerance(float spacingTolerance)voidPDFParser. setSuppressDuplicateOverlappingText(boolean v)If true, the parser should try to remove duplicated text over the same region.voidPDFParser. setThrowOnEncryptedPayload(boolean throwOnEncryptedPayload)If the file is a 'Collection' and contains an embedded file with a defined 'AssociatedFile' value of 'EncryptedPayload', then throw anEncryptedDocumentException. -
Uses of Field in org.apache.tika.parser.pkg
Methods in org.apache.tika.parser.pkg with annotations of type Field Modifier and Type Method Description voidCompressorParser. setDecompressConcatenated(boolean decompressConcatenated)voidPackageParser. setDetectCharsetsInEntryNames(boolean detectCharsetsInEntryNames)Whether or not to run the default charset detector against entry names in ZipFiles.voidCompressorParser. setMemoryLimitInKb(int memoryLimitInKb) -
Uses of Field in org.apache.tika.parser.recognition
Methods in org.apache.tika.parser.recognition with annotations of type Field Modifier and Type Method Description voidObjectRecognitionParser. setRecogniser(String recogniserClass) -
Uses of Field in org.apache.tika.parser.recognition.tf
Fields in org.apache.tika.parser.recognition.tf with annotations of type Field Modifier and Type Field Description protected URITensorflowRESTRecogniser. apiBaseUriprotected doubleTensorflowRESTRecogniser. minConfidenceprotected intTensorflowRESTRecogniser. topN -
Uses of Field in org.apache.tika.parser.strings
Methods in org.apache.tika.parser.strings with annotations of type Field Modifier and Type Method Description voidStringsParser. setEncoding(String encoding)voidStringsParser. setMinLength(int minLength)voidStringsParser. setStringsPath(String path)Sets the "strings" installation folder.voidStringsParser. setTimeoutSeconds(int timeoutSeconds) -
Uses of Field in org.apache.tika.parser.transcribe.aws
Methods in org.apache.tika.parser.transcribe.aws with annotations of type Field Modifier and Type Method Description voidAmazonTranscribe. setBucket(String bucket)Sets the client secret for the transcriber API.voidAmazonTranscribe. setClientId(String id)Sets the client Id for the transcriber API.voidAmazonTranscribe. setClientSecret(String secret)Sets the client secret for the transcriber API.voidAmazonTranscribe. setRegion(String region) -
Uses of Field in org.apache.tika.parser.txt
Methods in org.apache.tika.parser.txt with annotations of type Field Modifier and Type Method Description voidIcu4jEncodingDetector. setIgnoreCharsets(List<String> charsetsToIgnore)voidIcu4jEncodingDetector. setMarkLimit(int markLimit)How far into the stream to read for charset detection.voidUniversalEncodingDetector. setMarkLimit(int markLimit)How far into the stream to read for charset detection.voidIcu4jEncodingDetector. setStripMarkup(boolean stripMarkup)Whether or not to attempt to strip html-ish markup from the stream before sending it to the underlying detector. -
Uses of Field in org.apache.tika.parser.wordperfect
Methods in org.apache.tika.parser.wordperfect with annotations of type Field Modifier and Type Method Description voidWordPerfectParser. setIncludeDeletedContent(boolean includeDeletedContent)Whether or not to include deleted content. -
Uses of Field in org.apache.tika.pipes
Methods in org.apache.tika.pipes with annotations of type Field Modifier and Type Method Description voidCompositePipesReporter. addPipesReporter(PipesReporter pipesReporter)voidPipesReporterBase. setExcludes(List<String> excludes)voidPipesReporterBase. setIncludes(List<String> includes) -
Uses of Field in org.apache.tika.pipes.emitter.azblob
Methods in org.apache.tika.pipes.emitter.azblob with annotations of type Field Modifier and Type Method Description voidAZBlobEmitter. setContainer(String container)voidAZBlobEmitter. setEndpoint(String endpoint)voidAZBlobEmitter. setFileExtension(String fileExtension)If you want to customize the output file's file extension.voidAZBlobEmitter. setOverwriteExisting(boolean overwriteExisting)voidAZBlobEmitter. setPrefix(String prefix)voidAZBlobEmitter. setSasToken(String sasToken) -
Uses of Field in org.apache.tika.pipes.emitter.fs
Methods in org.apache.tika.pipes.emitter.fs with annotations of type Field Modifier and Type Method Description voidFileSystemEmitter. setBasePath(String basePath)voidFileSystemEmitter. setFileExtension(String fileExtension)If you want to customize the output file's file extension.voidFileSystemEmitter. setOnExists(String onExists)What to do if the target file already exists.voidFileSystemEmitter. setPrettyPrint(boolean prettyPrint) -
Uses of Field in org.apache.tika.pipes.emitter.gcs
Methods in org.apache.tika.pipes.emitter.gcs with annotations of type Field Modifier and Type Method Description voidGCSEmitter. setBucket(String bucket)voidGCSEmitter. setFileExtension(String fileExtension)If you want to customize the output file's file extension.voidGCSEmitter. setPrefix(String prefix)voidGCSEmitter. setProjectId(String projectId) -
Uses of Field in org.apache.tika.pipes.emitter.jdbc
Methods in org.apache.tika.pipes.emitter.jdbc with annotations of type Field Modifier and Type Method Description voidJDBCEmitter. setAttachmentStrategy(String attachmentStrategy)voidJDBCEmitter. setConnection(String connection)voidJDBCEmitter. setCreateTable(String createTable)voidJDBCEmitter. setInsert(String insert)voidJDBCEmitter. setKeys(Map<String,String> keys)The implementation of keys should be a LinkedHashMap because order matters!voidJDBCEmitter. setMaxStringLength(int maxStringLength)Set the maximum string length in characters (not bytes).voidJDBCEmitter. setMultivaluedFieldDelimiter(String delimiter)voidJDBCEmitter. setMultivaluedFieldStrategy(String strategy)This applies to fields of type 'string' or 'varchar'.voidJDBCEmitter. setPostConnection(String postConnection)This sql will be called immediately after the connection is made. -
Uses of Field in org.apache.tika.pipes.emitter.kafka
Methods in org.apache.tika.pipes.emitter.kafka with annotations of type Field Modifier and Type Method Description voidKafkaEmitter. setAcks(String acks)voidKafkaEmitter. setBootstrapServers(String bootstrapServers)voidKafkaEmitter. setBufferMemory(int bufferMemory)voidKafkaEmitter. setClientId(String clientId)voidKafkaEmitter. setCompressionType(String compressionType)voidKafkaEmitter. setConnectionsMaxIdleMs(int connectionsMaxIdleMs)voidKafkaEmitter. setDeliveryTimeoutMs(int deliveryTimeoutMs)voidKafkaEmitter. setEnableIdempotence(boolean enableIdempotence)voidKafkaEmitter. setInterceptorClasses(String interceptorClasses)voidKafkaEmitter. setKeySerializer(String keySerializer)voidKafkaEmitter. setLingerMs(int lingerMs)voidKafkaEmitter. setMaxBlockMs(int maxBlockMs)voidKafkaEmitter. setMaxInFlightRequestsPerConnection(int maxInFlightRequestsPerConnection)voidKafkaEmitter. setMaxRequestSize(int maxRequestSize)voidKafkaEmitter. setMetadataMaxAgeMs(int metadataMaxAgeMs)voidKafkaEmitter. setRequestTimeoutMs(int requestTimeoutMs)voidKafkaEmitter. setRetries(int retries)voidKafkaEmitter. setRetryBackoffMs(int retryBackoffMs)voidKafkaEmitter. setTopic(String topic)voidKafkaEmitter. setTransactionalId(String transactionalId)voidKafkaEmitter. setTransactionTimeoutMs(int transactionTimeoutMs)voidKafkaEmitter. setValueSerializer(String valueSerializer) -
Uses of Field in org.apache.tika.pipes.emitter.opensearch
Methods in org.apache.tika.pipes.emitter.opensearch with annotations of type Field Modifier and Type Method Description voidOpenSearchEmitter. setAttachmentStrategy(String attachmentStrategy)Options: SEPARATE_DOCUMENTS, PARENT_CHILD.voidOpenSearchEmitter. setAuthScheme(String authScheme)voidOpenSearchEmitter. setCommitWithin(int commitWithin)voidOpenSearchEmitter. setConnectionTimeout(int connectionTimeout)voidOpenSearchEmitter. setEmbeddedFileFieldName(String embeddedFileFieldName)If using theOpenSearchEmitter.AttachmentStrategy.PARENT_CHILD, this is the field name used to store the child documents.voidOpenSearchEmitter. setIdField(String idField)Specify the field in the first Metadata that should be used as the id field for the document.voidOpenSearchEmitter. setOpenSearchUrl(String openSearchUrl)voidOpenSearchEmitter. setPassword(String password)voidOpenSearchEmitter. setProxyHost(String proxyHost)voidOpenSearchEmitter. setProxyPort(int proxyPort)voidOpenSearchEmitter. setSocketTimeout(int socketTimeout)voidOpenSearchEmitter. setUserName(String userName) -
Uses of Field in org.apache.tika.pipes.emitter.s3
Methods in org.apache.tika.pipes.emitter.s3 with annotations of type Field Modifier and Type Method Description voidS3Emitter. setAccessKey(String accessKey)voidS3Emitter. setBucket(String bucket)voidS3Emitter. setCredentialsProvider(String credentialsProvider)voidS3Emitter. setEndpointConfigurationService(String endpointConfigurationService)voidS3Emitter. setFileExtension(String fileExtension)If you want to customize the output file's file extension.voidS3Emitter. setMaxConnections(int maxConnections)maximum number of http connections allowed.voidS3Emitter. setPathStyleAccessEnabled(boolean pathStyleAccessEnabled)voidS3Emitter. setPrefix(String prefix)voidS3Emitter. setProfile(String profile)voidS3Emitter. setRegion(String region)voidS3Emitter. setSecretKey(String secretKey)voidS3Emitter. setSpoolToTemp(boolean spoolToTemp)Whether or not to spool the metadatalist to a tmp file before putting object. -
Uses of Field in org.apache.tika.pipes.emitter.solr
Methods in org.apache.tika.pipes.emitter.solr with annotations of type Field Modifier and Type Method Description voidSolrEmitter. setAttachmentStrategy(String attachmentStrategy)Options: SKIP, CONCATENATE_CONTENT, PARENT_CHILD.voidSolrEmitter. setAuthScheme(String authScheme)voidSolrEmitter. setCommitWithin(int commitWithin)voidSolrEmitter. setConnectionTimeout(int connectionTimeout)voidSolrEmitter. setEmbeddedFileFieldName(String embeddedFileFieldName)If using theSolrEmitter.AttachmentStrategy.PARENT_CHILD, this is the field name used to store the child documents.voidSolrEmitter. setIdField(String idField)Specify the field in the first Metadata that should be used as the id field for the document.voidSolrEmitter. setPassword(String password)voidSolrEmitter. setProxyHost(String proxyHost)voidSolrEmitter. setProxyPort(int proxyPort)voidSolrEmitter. setSocketTimeout(int socketTimeout)voidSolrEmitter. setSolrCollection(String solrCollection)voidSolrEmitter. setSolrUrls(List<String> solrUrls)voidSolrEmitter. setSolrZkChroot(String solrZkChroot)voidSolrEmitter. setSolrZkHosts(List<String> solrZkHosts)voidSolrEmitter. setUpdateStrategy(String updateStrategy)voidSolrEmitter. setUserName(String userName) -
Uses of Field in org.apache.tika.pipes.fetcher
Methods in org.apache.tika.pipes.fetcher with annotations of type Field Modifier and Type Method Description voidAbstractFetcher. setName(String name) -
Uses of Field in org.apache.tika.pipes.fetcher.azblob
Methods in org.apache.tika.pipes.fetcher.azblob with annotations of type Field Modifier and Type Method Description voidAZBlobFetcher. setContainer(String container)voidAZBlobFetcher. setEndpoint(String endpoint)voidAZBlobFetcher. setExtractUserMetadata(boolean extractUserMetadata)Whether or not to extract user metadata from the blob objectvoidAZBlobFetcher. setSasToken(String sasToken)voidAZBlobFetcher. setSpoolToTemp(boolean spoolToTemp) -
Uses of Field in org.apache.tika.pipes.fetcher.fs
Methods in org.apache.tika.pipes.fetcher.fs with annotations of type Field Modifier and Type Method Description voidFileSystemFetcher. setBasePath(String basePath)Default behavior si that clients will send in relative paths, this must be set to allow this fetcher to fetch the full path.voidFileSystemFetcher. setExtractFileSystemMetadata(boolean extractFileSystemMetadata)Extract file system metadata (created, modified, accessed) when fetching file. -
Uses of Field in org.apache.tika.pipes.fetcher.gcs
Methods in org.apache.tika.pipes.fetcher.gcs with annotations of type Field Modifier and Type Method Description voidGCSFetcher. setBucket(String bucket)voidGCSFetcher. setExtractUserMetadata(boolean extractUserMetadata)Whether or not to extract user metadata from the S3ObjectvoidGCSFetcher. setProjectId(String projectId)voidGCSFetcher. setSpoolToTemp(boolean spoolToTemp) -
Uses of Field in org.apache.tika.pipes.fetcher.http
Methods in org.apache.tika.pipes.fetcher.http with annotations of type Field Modifier and Type Method Description voidHttpFetcher. setAuthScheme(String authScheme)voidHttpFetcher. setConnectTimeout(int connectTimeout)voidHttpFetcher. setHttpHeaders(List<String> headers)Which http headers should we capture in the metadata.voidHttpFetcher. setHttpRequestHeaders(List<String> headers)Which http request headers should we send in the http fetch requests.voidHttpFetcher. setJwtExpiresInSeconds(int jwtExpiresInSeconds)voidHttpFetcher. setJwtIssuer(String jwtIssuer)voidHttpFetcher. setJwtPrivateKeyBase64(String jwtPrivateKeyBase64)voidHttpFetcher. setJwtSecret(String jwtSecret)voidHttpFetcher. setJwtSubject(String jwtSubject)voidHttpFetcher. setMaxConnections(int maxConnections)voidHttpFetcher. setMaxConnectionsPerRoute(int maxConnectionsPerRoute)voidHttpFetcher. setMaxErrMsgSize(int maxErrMsgSize)voidHttpFetcher. setMaxRedirects(int maxRedirects)voidHttpFetcher. setMaxSpoolSize(long maxSpoolSize)Set the maximum number of bytes to spool to a temp file.voidHttpFetcher. setNtDomain(String domain)voidHttpFetcher. setOverallTimeout(long overallTimeout)This sets an overall timeout on the request.voidHttpFetcher. setPassword(String password)voidHttpFetcher. setProxyHost(String proxyHost)voidHttpFetcher. setProxyPort(int proxyPort)voidHttpFetcher. setRequestTimeout(int requestTimeout)voidHttpFetcher. setSocketTimeout(int socketTimeout)voidHttpFetcher. setUserAgent(String userAgent)When making the request, what User-Agent is sent in the request.voidHttpFetcher. setUserName(String userName) -
Uses of Field in org.apache.tika.pipes.fetcher.s3
Methods in org.apache.tika.pipes.fetcher.s3 with annotations of type Field Modifier and Type Method Description voidS3Fetcher. setAccessKey(String accessKey)voidS3Fetcher. setBucket(String bucket)voidS3Fetcher. setCredentialsProvider(String credentialsProvider)voidS3Fetcher. setEndpointConfigurationService(String endpointConfigurationService)voidS3Fetcher. setExtractUserMetadata(boolean extractUserMetadata)Whether or not to extract user metadata from the S3ObjectvoidS3Fetcher. setMaxConnections(int maxConnections)voidS3Fetcher. setMaxLength(long maxLength)voidS3Fetcher. setPathStyleAccessEnabled(boolean pathStyleAccessEnabled)voidS3Fetcher. setPrefix(String prefix)prefix to prepend to the fetch key before fetching.voidS3Fetcher. setProfile(String profile)voidS3Fetcher. setRegion(String region)voidS3Fetcher. setSecretKey(String secretKey)voidS3Fetcher. setSleepBeforeRetryMillis(long sleepBeforeRetryMillis)Deprecated.voidS3Fetcher. setSpoolToTemp(boolean spoolToTemp)voidS3Fetcher. setThrottleSeconds(String commaDelimitedLongs)Set seconds to throttle retries as a comma-delimited list, e.g.: 30,60,120,600 -
Uses of Field in org.apache.tika.pipes.fetchers.microsoftgraph
Methods in org.apache.tika.pipes.fetchers.microsoftgraph with annotations of type Field Modifier and Type Method Description voidMicrosoftGraphFetcher. setThrottleSeconds(String commaDelimitedLongs)Set seconds to throttle retries as a comma-delimited list, e.g.: 30,60,120,600 -
Uses of Field in org.apache.tika.pipes.pipesiterator
Methods in org.apache.tika.pipes.pipesiterator with annotations of type Field Modifier and Type Method Description voidPipesIterator. setEmitterName(String emitterName)voidPipesIterator. setFetcherName(String fetcherName)voidPipesIterator. setHandlerType(String handlerType)voidPipesIterator. setMaxEmbeddedResources(int maxEmbeddedResources)voidPipesIterator. setMaxWaitMs(long maxWaitMs)voidPipesIterator. setOnParseException(String onParseException)voidPipesIterator. setParseMode(String parseModeString)voidPipesIterator. setQueueSize(int queueSize)voidPipesIterator. setThrowOnWriteLimitReached(boolean throwOnWriteLimitReached)voidPipesIterator. setWriteLimit(int writeLimit) -
Uses of Field in org.apache.tika.pipes.pipesiterator.azblob
Methods in org.apache.tika.pipes.pipesiterator.azblob with annotations of type Field Modifier and Type Method Description voidAZBlobPipesIterator. setContainer(String container)voidAZBlobPipesIterator. setEndpoint(String endpoint)voidAZBlobPipesIterator. setPrefix(String prefix)voidAZBlobPipesIterator. setSasToken(String sasToken) -
Uses of Field in org.apache.tika.pipes.pipesiterator.csv
Methods in org.apache.tika.pipes.pipesiterator.csv with annotations of type Field Modifier and Type Method Description voidCSVPipesIterator. setCsvPath(String csvPath)voidCSVPipesIterator. setCsvPath(Path csvPath)voidCSVPipesIterator. setEmitKeyColumn(String emitKeyColumn)voidCSVPipesIterator. setFetchKeyColumn(String fetchKeyColumn)voidCSVPipesIterator. setIdColumn(String idColumn) -
Uses of Field in org.apache.tika.pipes.pipesiterator.filelist
Methods in org.apache.tika.pipes.pipesiterator.filelist with annotations of type Field Modifier and Type Method Description voidFileListPipesIterator. setFileList(String path)voidFileListPipesIterator. setHasHeader(boolean hasHeader) -
Uses of Field in org.apache.tika.pipes.pipesiterator.fs
Methods in org.apache.tika.pipes.pipesiterator.fs with annotations of type Field Modifier and Type Method Description voidFileSystemPipesIterator. setBasePath(String basePath)voidFileSystemPipesIterator. setCountTotal(boolean countTotal) -
Uses of Field in org.apache.tika.pipes.pipesiterator.gcs
Methods in org.apache.tika.pipes.pipesiterator.gcs with annotations of type Field Modifier and Type Method Description voidGCSPipesIterator. setBucket(String bucket)voidGCSPipesIterator. setPrefix(String prefix)voidGCSPipesIterator. setProjectId(String projectId) -
Uses of Field in org.apache.tika.pipes.pipesiterator.jdbc
Methods in org.apache.tika.pipes.pipesiterator.jdbc with annotations of type Field Modifier and Type Method Description voidJDBCPipesIterator. setConnection(String connection)voidJDBCPipesIterator. setEmitKeyColumn(String fetchKeyColumn)voidJDBCPipesIterator. setFetchKeyColumn(String fetchKeyColumn)voidJDBCPipesIterator. setFetchKeyRangeEndColumn(String fetchKeyRangeEndColumn)voidJDBCPipesIterator. setFetchKeyRangeStartColumn(String fetchKeyRangeStartColumn)voidJDBCPipesIterator. setFetchSize(int fetchSize)voidJDBCPipesIterator. setIdColumn(String idColumn)voidJDBCPipesIterator. setSelect(String select) -
Uses of Field in org.apache.tika.pipes.pipesiterator.kafka
Methods in org.apache.tika.pipes.pipesiterator.kafka with annotations of type Field Modifier and Type Method Description voidKafkaPipesIterator. setAutoOffsetReset(String autoOffsetReset)voidKafkaPipesIterator. setBootstrapServers(String bootstrapServers)voidKafkaPipesIterator. setEmitMax(int emitMax)If the kafka pipe iterator will keep polling for more documents until it returns an empty result.voidKafkaPipesIterator. setGroupId(String groupId)voidKafkaPipesIterator. setGroupInitialRebalanceDelayMs(int groupInitialRebalanceDelayMs)voidKafkaPipesIterator. setKeySerializer(String keySerializer)voidKafkaPipesIterator. setPollDelayMs(int pollDelayMs)voidKafkaPipesIterator. setTopic(String topic)voidKafkaPipesIterator. setValueSerializer(String valueSerializer) -
Uses of Field in org.apache.tika.pipes.pipesiterator.s3
Methods in org.apache.tika.pipes.pipesiterator.s3 with annotations of type Field Modifier and Type Method Description voidS3PipesIterator. setAccessKey(String accessKey)voidS3PipesIterator. setBucket(String bucket)voidS3PipesIterator. setCredentialsProvider(String credentialsProvider)voidS3PipesIterator. setEndpointConfigurationService(String endpointConfigurationService)voidS3PipesIterator. setFileNamePattern(String fileNamePattern)voidS3PipesIterator. setFileNamePattern(Pattern fileNamePattern)voidS3PipesIterator. setMaxConnections(int maxConnections)voidS3PipesIterator. setPathStyleAccessEnabled(boolean pathStyleAccessEnabled)voidS3PipesIterator. setPrefix(String prefix)voidS3PipesIterator. setProfile(String profile)voidS3PipesIterator. setRegion(String region)voidS3PipesIterator. setSecretKey(String secretKey) -
Uses of Field in org.apache.tika.pipes.pipesiterator.solr
Methods in org.apache.tika.pipes.pipesiterator.solr with annotations of type Field Modifier and Type Method Description voidSolrPipesIterator. setAdditionalFields(List<String> additionalFields)voidSolrPipesIterator. setAuthScheme(String authScheme)voidSolrPipesIterator. setConnectionTimeout(int connectionTimeout)voidSolrPipesIterator. setFailCountField(String failCountField)voidSolrPipesIterator. setFilters(List<String> filters)voidSolrPipesIterator. setIdField(String idField)voidSolrPipesIterator. setParsingIdField(String parsingIdField)voidSolrPipesIterator. setPassword(String password)voidSolrPipesIterator. setProxyHost(String proxyHost)voidSolrPipesIterator. setProxyPort(int proxyPort)voidSolrPipesIterator. setRows(int rows)voidSolrPipesIterator. setSizeFieldName(String sizeFieldName)voidSolrPipesIterator. setSocketTimeout(int socketTimeout)voidSolrPipesIterator. setSolrCollection(String solrCollection)voidSolrPipesIterator. setSolrUrls(List<String> solrUrls)voidSolrPipesIterator. setSolrZkChroot(String solrZkChroot)voidSolrPipesIterator. setSolrZkHosts(List<String> solrZkHosts)voidSolrPipesIterator. setUserName(String userName) -
Uses of Field in org.apache.tika.pipes.reporters.fs
Methods in org.apache.tika.pipes.reporters.fs with annotations of type Field Modifier and Type Method Description voidFileSystemStatusReporter. setReportUpdateMillis(long millis)voidFileSystemStatusReporter. setStatusFile(String path) -
Uses of Field in org.apache.tika.pipes.reporters.jdbc
Methods in org.apache.tika.pipes.reporters.jdbc with annotations of type Field Modifier and Type Method Description voidJDBCPipesReporter. setCacheSize(int cacheSize)Commit the reports if the cache is greater than or equal to this size.voidJDBCPipesReporter. setConnection(String connection)voidJDBCPipesReporter. setCreateTable(boolean createTable)The default is true.voidJDBCPipesReporter. setPostConnection(String postConnection)This sql will be called immediately after the connection is made.voidJDBCPipesReporter. setReportSql(String reportSql)This is the sql for the prepared statement to execute to store the report record. the default is:insert into tika_status (id, status, timestamp) values (?voidJDBCPipesReporter. setReportVariables(List<String> variables)ADVANCED: This is used to set the variables in the prepared statement for the report.voidJDBCPipesReporter. setReportWithinMs(long reportWithinMs)Commit the reports if the amount of time elapsed since the last report commit exceeds this value.voidJDBCPipesReporter. setTableName(String tableName)The default isJDBCPipesReporter.TABLE_NAME -
Uses of Field in org.apache.tika.pipes.reporters.opensearch
Methods in org.apache.tika.pipes.reporters.opensearch with annotations of type Field Modifier and Type Method Description voidOpenSearchPipesReporter. setAuthScheme(String authScheme)voidOpenSearchPipesReporter. setConnectionTimeout(int connectionTimeout)voidOpenSearchPipesReporter. setExcludeStatuses(List<String> statusList)voidOpenSearchPipesReporter. setIncludeRouting(boolean includeRouting)voidOpenSearchPipesReporter. setIncludeStatuses(List<String> statusList)voidOpenSearchPipesReporter. setKeyPrefix(String keyPrefix)This prefixes the keys before sending them to OpenSearch.voidOpenSearchPipesReporter. setOpenSearchUrl(String openSearchUrl)voidOpenSearchPipesReporter. setPassword(String password)voidOpenSearchPipesReporter. setProxyHost(String proxyHost)voidOpenSearchPipesReporter. setProxyPort(int proxyPort)voidOpenSearchPipesReporter. setSocketTimeout(int socketTimeout)voidOpenSearchPipesReporter. setUserName(String userName)
-