Uses of Annotation Type
org.apache.tika.config.Field
Packages that use Field
Package
Description
Media type detection.
Extraction of component documents.
Tika parsers.
-
Uses of Field in org.apache.tika.detect
Methods in org.apache.tika.detect with annotations of type FieldModifier and TypeMethodDescriptionvoidFileCommandDetector.setFilePath(String fileCommandPath) voidFileCommandDetector.setMaxBytes(int maxBytes) If this is not called on a TikaInputStream, this detector will spool up to this many bytes to a file to be detected by the 'file' command.voidFileCommandDetector.setTimeoutMs(long timeoutMs) voidFileCommandDetector.setUseMime(boolean useMime) -
Uses of Field in org.apache.tika.detect.magika
Methods in org.apache.tika.detect.magika with annotations of type FieldModifier and TypeMethodDescriptionvoidMagikaDetector.setMagikaPath(String fileCommandPath) voidMagikaDetector.setMaxBytes(int maxBytes) If this is not called on a TikaInputStream, this detector will spool up to this many bytes to a file to be detected by the 'file' command.voidMagikaDetector.setTimeoutMs(long timeoutMs) voidMagikaDetector.setUseMime(boolean useMime) As default behavior, Tika runs magika to add its detection to the metadata, but NOT to use detection in determining parsers etc. -
Uses of Field in org.apache.tika.detect.siegfried
Methods in org.apache.tika.detect.siegfried with annotations of type FieldModifier and TypeMethodDescriptionvoidSiegfriedDetector.setMaxBytes(int maxBytes) If this is not called on a TikaInputStream, this detector will spool up to this many bytes to a file to be detected by the 'file' command.voidSiegfriedDetector.setSiegfriedPath(String fileCommandPath) voidSiegfriedDetector.setTimeoutMs(long timeoutMs) voidSiegfriedDetector.setUseMime(boolean useMime) As default behavior, Tika runs Siegfried to add its detection to the metadata, but NOT to use detection in determining parsers etc. -
Uses of Field in org.apache.tika.detect.zip
Methods in org.apache.tika.detect.zip with annotations of type FieldModifier and TypeMethodDescriptionvoidDefaultZipContainerDetector.setMarkLimit(int markLimit) If this is less than 0 and a TikaInputStream is used, the file will be spooled to disk, and detection will run on the full file. -
Uses of Field in org.apache.tika.extractor
Methods in org.apache.tika.extractor with annotations of type FieldModifier and TypeMethodDescriptionvoidRUnpackExtractorFactory.setEmbeddedBytesExcludeEmbeddedResourceTypes(List<String> excludeAttachmentTypes) voidRUnpackExtractorFactory.setEmbeddedBytesExcludeMimeTypes(List<String> excludeMimeTypes) voidRUnpackExtractorFactory.setEmbeddedBytesIncludeEmbeddedResourceTypes(List<String> includeAttachmentTypes) voidRUnpackExtractorFactory.setEmbeddedBytesIncludeMimeTypes(List<String> includeMimeTypes) voidRUnpackExtractorFactory.setMaxEmbeddedBytesForExtraction(long maxEmbeddedBytesForExtraction) Total number of bytes to write out.voidParsingEmbeddedDocumentExtractorFactory.setWriteFileNameToContent(boolean writeFileNameToContent) voidRUnpackExtractorFactory.setWriteFileNameToContent(boolean writeFileNameToContent) -
Uses of Field in org.apache.tika.langdetect.opennlp.metadatafilter
Methods in org.apache.tika.langdetect.opennlp.metadatafilter with annotations of type FieldModifier and TypeMethodDescriptionvoidOpenNLPMetadataFilter.setMaxCharsForDetection(int maxCharsForDetection) -
Uses of Field in org.apache.tika.langdetect.optimaize.metadatafilter
Methods in org.apache.tika.langdetect.optimaize.metadatafilter with annotations of type FieldModifier and TypeMethodDescriptionvoidOptimaizeMetadataFilter.setMaxCharsForDetection(int maxCharsForDetection) -
Uses of Field in org.apache.tika.metadata.filter
Methods in org.apache.tika.metadata.filter with annotations of type FieldModifier and TypeMethodDescriptionvoidDateNormalizingMetadataFilter.setDefaultTimeZone(String timeZoneId) voidExcludeFieldMetadataFilter.setExclude(List<String> exclude) voidFieldNameMappingFilter.setExcludeUnmapped(boolean excludeUnmapped) If this istrue(default), this means that only the fields that have a "from" value in the mapper will be passed through.voidGeoPointMetadataFilter.setGeoPointFieldName(String geoPointFieldName) Set the field for the concatenated LATITUDE,LONGITUDE string.voidIncludeFieldMetadataFilter.setInclude(List<String> include) voidFieldNameMappingFilter.setMappings(Map<String, String> mappings) voidvoidvoidCaptureGroupMetadataFilter.setSourceField(String sourceField) voidCaptureGroupMetadataFilter.setTargetField(String targetField) voidFor types seeTikaCoreProperties.EmbeddedResourceType -
Uses of Field in org.apache.tika.parser
Methods in org.apache.tika.parser with annotations of type FieldModifier and TypeMethodDescriptionvoidRegexCaptureParser.setCaptureMap(Map<String, String> map) voidRegexCaptureParser.setMatchMap(Map<String, String> map) voidRegexCaptureParser.setWriteContent(boolean writeContent) -
Uses of Field in org.apache.tika.parser.csv
Methods in org.apache.tika.parser.csv with annotations of type FieldModifier and TypeMethodDescriptionvoidTextAndCSVParser.setNameToDelimiterMap(Map<String, String> map) -
Uses of Field in org.apache.tika.parser.digestutils
Methods in org.apache.tika.parser.digestutils with annotations of type FieldModifier and TypeMethodDescriptionvoidCommonsDigesterFactory.setAlgorithmString(String algorithmString) voidCommonsDigesterFactory.setMarkLimit(int markLimit) voidCommonsDigesterFactory.setSkipContainerDocument(boolean skipContainerDocument) -
Uses of Field in org.apache.tika.parser.dwg
Methods in org.apache.tika.parser.dwg with annotations of type FieldModifier and TypeMethodDescriptionvoidAbstractDWGParser.setCleanDwgReadOutput(boolean cleanDwgReadOutput) voidAbstractDWGParser.setCleanDwgReadOutputBatchSize(int cleanDwgReadOutputBatchSize) voidAbstractDWGParser.setCleanDwgReadRegexToReplace(String cleanDwgReadRegexToReplace) voidAbstractDWGParser.setCleanDwgReadReplaceWith(String cleanDwgReadReplaceWith) voidAbstractDWGParser.setDwgReadExecutable(String dwgReadExecutable) voidAbstractDWGParser.setDwgReadTimeout(long dwgReadTimeout) -
Uses of Field in org.apache.tika.parser.epub
Methods in org.apache.tika.parser.epub with annotations of type Field -
Uses of Field in org.apache.tika.parser.external2
Methods in org.apache.tika.parser.external2 with annotations of type FieldModifier and TypeMethodDescriptionvoidExternalParser.setCommandLine(List<String> commandLine) Use this to specify the full commandLine.voidExternalParser.setMaxStdErr(int maxStdErr) voidExternalParser.setMaxStdOut(int maxStdOut) voidExternalParser.setOutputParser(Parser parser) This parser is called on the output of the process.voidExternalParser.setReturnStderr(boolean returnStderr) If set to true, this will return the stderr in the metadata viaExternalProcess.STD_ERR.voidExternalParser.setReturnStdout(boolean returnStdout) If set to true, this will return the stdout in the metadata viaExternalProcess.STD_OUT.voidExternalParser.setSupportedTypes(List<String> supportedTypes) This is set during initialization from a tika-config.voidExternalParser.setTimeoutMs(long timeoutMs) -
Uses of Field in org.apache.tika.parser.gdal
Methods in org.apache.tika.parser.gdal with annotations of type FieldModifier and TypeMethodDescriptionvoidGDALParser.setMaxStdErr(int maxStdErr) voidGDALParser.setMaxStdOut(int maxStdOut) voidGDALParser.setTimeoutMs(long timeoutMs) -
Uses of Field in org.apache.tika.parser.geo.topic
Methods in org.apache.tika.parser.geo.topic with annotations of type FieldModifier and TypeMethodDescriptionvoidGeoParser.setGazetteerRestEndpoint(String gazetteerRestEndpoint) voidGeoParser.setNerModelUrl(String nerModelUrl) -
Uses of Field in org.apache.tika.parser.geopkg
Methods in org.apache.tika.parser.geopkg with annotations of type FieldModifier and TypeMethodDescriptionvoidGeoPkgParser.setIgnoreBlobColumns(List<String> ignoreBlobColumns) -
Uses of Field in org.apache.tika.parser.html
Methods in org.apache.tika.parser.html with annotations of type FieldModifier and TypeMethodDescriptionvoidJSoupParser.setExtractScripts(boolean extractScripts) Whether or not to extract contents in script entities.voidHtmlEncodingDetector.setMarkLimit(int markLimit) How far into the stream to read for charset detection. -
Uses of Field in org.apache.tika.parser.html.charsetdetector
Methods in org.apache.tika.parser.html.charsetdetector with annotations of type FieldModifier and TypeMethodDescriptionvoidStandardHtmlEncodingDetector.setMarkLimit(int markLimit) How far into the stream to read for charset detection. -
Uses of Field in org.apache.tika.parser.image
Methods in org.apache.tika.parser.image with annotations of type FieldModifier and TypeMethodDescriptionvoidPSDParser.setMaxDataLengthBytes(int maxDataLengthBytes) voidBPGParser.setMaxRecordLength(int maxRecordLength) -
Uses of Field in org.apache.tika.parser.microsoft
Methods in org.apache.tika.parser.microsoft with annotations of type FieldModifier and TypeMethodDescriptionvoidAbstractOfficeParser.setByteArrayMaxOverride(int maxOverride) WARNING: this sets a static variable in POI.voidAbstractOfficeParser.setConcatenatePhoneticRuns(boolean concatenatePhoneticRuns) voidAbstractOfficeParser.setDateFormatOverride(String format) voidAbstractOfficeParser.setExtractAllAlternativesFromMSG(boolean extractAllAlternativesFromMSG) Some .msg files can contain body content in html, rtf and/or text.voidAbstractOfficeParser.setExtractMacros(boolean extractMacros) voidAbstractOfficeParser.setIncludeDeletedContent(boolean includeDeletedConent) voidAbstractOfficeParser.setIncludeHeadersAndFooters(boolean includeHeadersAndFooters) voidAbstractOfficeParser.setIncludeMoveFromContent(boolean includeMoveFromContent) voidAbstractOfficeParser.setIncludeShapeBasedContent(boolean includeShapeBasedContent) voidAbstractOfficeParser.setUseSAXDocxExtractor(boolean useSAXDocxExtractor) voidAbstractOfficeParser.setUseSAXPptxExtractor(boolean useSAXPptxExtractor) voidAbstractOfficeParser.setWriteSelectHeadersInBody(boolean val) If set totrue, this will write the to/from/cc into the body content -
Uses of Field in org.apache.tika.parser.microsoft.libpst
Methods in org.apache.tika.parser.microsoft.libpst with annotations of type FieldModifier and TypeMethodDescriptionvoidLibPstParser.setIncludeDeleted(boolean includeDeleted) voidLibPstParser.setMaxEmails(int maxEmails) voidLibPstParser.setProcessEmailAsMsg(boolean processEmailAsMsg) voidLibPstParser.setReadPstPath(String readPstPath) This should include the path up to but not including 'readpst', e.g.voidLibPstParser.setTimeoutSeconds(long timeoutSeconds) -
Uses of Field in org.apache.tika.parser.microsoft.rtf
Methods in org.apache.tika.parser.microsoft.rtf with annotations of type Field -
Uses of Field in org.apache.tika.parser.mp3
Methods in org.apache.tika.parser.mp3 with annotations of type FieldModifier and TypeMethodDescriptionvoidMp3Parser.setMaxRecordSize(int maxRecordSize) This statically sets the max record size inID3v2Frame -
Uses of Field in org.apache.tika.parser.ocr
Methods in org.apache.tika.parser.ocr with annotations of type FieldModifier and TypeMethodDescriptionvoidTesseractOCRParser.setApplyRotation(boolean applyRotation) voidTesseractOCRParser.setColorspace(String colorspace) voidTesseractOCRParser.setDensity(int density) voidTesseractOCRParser.setDepth(int depth) voidTesseractOCRParser.setEnableImagePreprocessing(boolean enableImagePreprocessing) voidvoidTesseractOCRParser.setImageMagickPath(String imageMagickPath) Set the path to the ImageMagick executable directory, needed if it is not on system path.voidTesseractOCRParser.setInlineContent(boolean inlineContent) voidTesseractOCRParser.setLanguage(String language) voidTesseractOCRParser.setMaxFileSizeToOcr(long maxFileSizeToOcr) voidTesseractOCRParser.setMinFileSizeToOcr(long minFileSizeToOcr) voidTesseractOCRParser.setOtherTesseractSettings(List<String> settings) voidTesseractOCRParser.setOutputType(String outputType) voidTesseractOCRParser.setPageSegMode(String pageSegMode) voidTesseractOCRParser.setPreloadLangs(boolean preloadLangs) If set totrueand if tesseract is found, this will load the langs that result from --list-langs.voidTesseractOCRParser.setPreserveInterwordSpacing(boolean preserveInterwordSpacing) voidTesseractOCRParser.setResize(int resize) voidTesseractOCRParser.setSkipOCR(boolean skipOCR) voidTesseractOCRParser.setTessdataPath(String tessdataPath) Set the path to the 'tessdata' folder, which contains language files and config files.voidTesseractOCRParser.setTesseractPath(String tesseractPath) Set the path to the Tesseract executable's directory, needed if it is not on system path.voidTesseractOCRParser.setTimeout(int timeout) Set default timeout in seconds. -
Uses of Field in org.apache.tika.parser.odf
Methods in org.apache.tika.parser.odf with annotations of type FieldModifier and TypeMethodDescriptionvoidFlatOpenDocumentParser.setExtractMacros(boolean extractMacros) voidOpenDocumentParser.setExtractMacros(boolean extractMacros) -
Uses of Field in org.apache.tika.parser.pdf
Methods in org.apache.tika.parser.pdf with annotations of type FieldModifier and TypeMethodDescriptionvoidPDFParser.setAllowExtractionForAccessibility(boolean allowExtractionForAccessibility) voidPDFParser.setAverageCharTolerance(float averageCharTolerance) voidPDFParser.setCatchIntermediateExceptions(boolean catchIntermediateExceptions) voidPDFParser.setDetectAngles(boolean detectAngles) voidPDFParser.setDropThreshold(float dropThreshold) voidPDFParser.setEnableAutoSpace(boolean v) If true (the default), the parser should estimate where spaces should be inserted between words.voidPDFParser.setExtractAcroFormContent(boolean extractAcroFormContent) voidPDFParser.setExtractActions(boolean extractActions) voidPDFParser.setExtractAnnotationText(boolean v) If true (the default), text in annotations will be extracted.voidPDFParser.setExtractBookmarksText(boolean extractBookmarksText) voidPDFParser.setExtractFontNames(boolean extractFontNames) voidPDFParser.setExtractIncrementalUpdateInfo(boolean setExtractIncrementalUpdateInfo) Whether or not to scan a PDF for incremental updates.voidPDFParser.setExtractInlineImageMetadataOnly(boolean extractInlineImageMetadataOnly) voidPDFParser.setExtractInlineImages(boolean extractInlineImages) voidPDFParser.setExtractMarkedContent(boolean extractMarkedContent) voidPDFParser.setExtractUniqueInlineImagesOnly(boolean extractUniqueInlineImagesOnly) voidPDFParser.setIfXFAExtractOnlyXFA(boolean ifXFAExtractOnlyXFA) voidPDFParser.setIgnoreContentStreamSpaceGlyphs(boolean v) If true, the parser should ignore spaces in the content stream and rely purely on the algorithm to determine where word breaks are (PDFBOX-3774).voidPDFParser.setImageGraphicsEngineFactory(ImageGraphicsEngineFactory imageGraphicsEngineFactory) voidPDFParser.setImageStrategy(String imageStrategy) voidPDFParser.setMaxIncrementalUpdates(int maxIncrementalUpdates) Set the maximum number of incremental updates to parsevoidPDFParser.setMaxMainMemoryBytes(long maxMainMemoryBytes) voidPDFParser.setOcrDPI(int dpi) voidPDFParser.setOcrImageFormatName(String formatName) voidPDFParser.setOcrImageQuality(float imageQuality) voidPDFParser.setOcrImageType(String imageType) voidPDFParser.setOcrRenderingStrategy(String ocrRenderingStrategy) voidPDFParser.setOcrStrategy(String ocrStrategyString) voidPDFParser.setOcrStrategyAuto(String ocrStrategyAuto) voidPDFParser.setParseIncrementalUpdates(boolean parseIncrementalUpdates) If set to true, this will parse incremental updates if they exist within a PDF.voidPDFParser.setSetKCMS(boolean setKCMS) voidPDFParser.setSortByPosition(boolean v) If true, sort text tokens by their x/y position before extracting text.voidPDFParser.setSpacingTolerance(float spacingTolerance) voidPDFParser.setSuppressDuplicateOverlappingText(boolean v) If true, the parser should try to remove duplicated text over the same region.voidPDFParser.setThrowOnEncryptedPayload(boolean throwOnEncryptedPayload) If the file is a 'Collection' and contains an embedded file with a defined 'AssociatedFile' value of 'EncryptedPayload', then throw anEncryptedDocumentException. -
Uses of Field in org.apache.tika.parser.pkg
Methods in org.apache.tika.parser.pkg with annotations of type FieldModifier and TypeMethodDescriptionvoidCompressorParser.setDecompressConcatenated(boolean decompressConcatenated) voidPackageParser.setDetectCharsetsInEntryNames(boolean detectCharsetsInEntryNames) Whether or not to run the default charset detector against entry names in ZipFiles.voidCompressorParser.setMemoryLimitInKb(int memoryLimitInKb) -
Uses of Field in org.apache.tika.parser.recognition
Methods in org.apache.tika.parser.recognition with annotations of type FieldModifier and TypeMethodDescriptionvoidObjectRecognitionParser.setRecogniser(String recogniserClass) -
Uses of Field in org.apache.tika.parser.recognition.tf
Fields in org.apache.tika.parser.recognition.tf with annotations of type FieldModifier and TypeFieldDescriptionprotected URITensorflowRESTRecogniser.apiBaseUriprotected doubleTensorflowRESTRecogniser.minConfidenceprotected intTensorflowRESTRecogniser.topN -
Uses of Field in org.apache.tika.parser.strings
Methods in org.apache.tika.parser.strings with annotations of type FieldModifier and TypeMethodDescriptionvoidStringsParser.setEncoding(String encoding) voidStringsParser.setMinLength(int minLength) voidStringsParser.setStringsPath(String path) Sets the "strings" installation folder.voidStringsParser.setTimeoutSeconds(int timeoutSeconds) -
Uses of Field in org.apache.tika.parser.transcribe.aws
Methods in org.apache.tika.parser.transcribe.aws with annotations of type FieldModifier and TypeMethodDescriptionvoidSets the client secret for the transcriber API.voidAmazonTranscribe.setClientId(String id) Sets the client Id for the transcriber API.voidAmazonTranscribe.setClientSecret(String secret) Sets the client secret for the transcriber API.void -
Uses of Field in org.apache.tika.parser.txt
Methods in org.apache.tika.parser.txt with annotations of type FieldModifier and TypeMethodDescriptionvoidIcu4jEncodingDetector.setIgnoreCharsets(List<String> charsetsToIgnore) voidIcu4jEncodingDetector.setMarkLimit(int markLimit) How far into the stream to read for charset detection.voidUniversalEncodingDetector.setMarkLimit(int markLimit) How far into the stream to read for charset detection.voidIcu4jEncodingDetector.setStripMarkup(boolean stripMarkup) Whether or not to attempt to strip html-ish markup from the stream before sending it to the underlying detector. -
Uses of Field in org.apache.tika.parser.wordperfect
Methods in org.apache.tika.parser.wordperfect with annotations of type FieldModifier and TypeMethodDescriptionvoidWordPerfectParser.setIncludeDeletedContent(boolean includeDeletedContent) Whether or not to include deleted content. -
Uses of Field in org.apache.tika.pipes
Methods in org.apache.tika.pipes with annotations of type FieldModifier and TypeMethodDescriptionvoidCompositePipesReporter.addPipesReporter(PipesReporter pipesReporter) voidPipesReporterBase.setExcludes(List<String> excludes) voidPipesReporterBase.setIncludes(List<String> includes) -
Uses of Field in org.apache.tika.pipes.emitter.azblob
Methods in org.apache.tika.pipes.emitter.azblob with annotations of type FieldModifier and TypeMethodDescriptionvoidAZBlobEmitter.setContainer(String container) voidAZBlobEmitter.setEndpoint(String endpoint) voidAZBlobEmitter.setFileExtension(String fileExtension) If you want to customize the output file's file extension.voidAZBlobEmitter.setOverwriteExisting(boolean overwriteExisting) voidvoidAZBlobEmitter.setSasToken(String sasToken) -
Uses of Field in org.apache.tika.pipes.emitter.fs
Methods in org.apache.tika.pipes.emitter.fs with annotations of type FieldModifier and TypeMethodDescriptionvoidFileSystemEmitter.setBasePath(String basePath) voidFileSystemEmitter.setFileExtension(String fileExtension) If you want to customize the output file's file extension.voidFileSystemEmitter.setOnExists(String onExists) What to do if the target file already exists.voidFileSystemEmitter.setPrettyPrint(boolean prettyPrint) -
Uses of Field in org.apache.tika.pipes.emitter.gcs
Methods in org.apache.tika.pipes.emitter.gcs with annotations of type FieldModifier and TypeMethodDescriptionvoidvoidGCSEmitter.setFileExtension(String fileExtension) If you want to customize the output file's file extension.voidvoidGCSEmitter.setProjectId(String projectId) -
Uses of Field in org.apache.tika.pipes.emitter.jdbc
Methods in org.apache.tika.pipes.emitter.jdbc with annotations of type FieldModifier and TypeMethodDescriptionvoidJDBCEmitter.setAttachmentStrategy(String attachmentStrategy) voidJDBCEmitter.setConnection(String connection) voidJDBCEmitter.setCreateTable(String createTable) voidvoidThe implementation of keys should be a LinkedHashMap because order matters!voidJDBCEmitter.setMaxStringLength(int maxStringLength) Set the maximum string length in characters (not bytes).voidJDBCEmitter.setMultivaluedFieldDelimiter(String delimiter) voidJDBCEmitter.setMultivaluedFieldStrategy(String strategy) This applies to fields of type 'string' or 'varchar'.voidJDBCEmitter.setPostConnection(String postConnection) This sql will be called immediately after the connection is made. -
Uses of Field in org.apache.tika.pipes.emitter.kafka
Methods in org.apache.tika.pipes.emitter.kafka with annotations of type FieldModifier and TypeMethodDescriptionvoidvoidKafkaEmitter.setBootstrapServers(String bootstrapServers) voidKafkaEmitter.setBufferMemory(int bufferMemory) voidKafkaEmitter.setClientId(String clientId) voidKafkaEmitter.setCompressionType(String compressionType) voidKafkaEmitter.setConnectionsMaxIdleMs(int connectionsMaxIdleMs) voidKafkaEmitter.setDeliveryTimeoutMs(int deliveryTimeoutMs) voidKafkaEmitter.setEnableIdempotence(boolean enableIdempotence) voidKafkaEmitter.setInterceptorClasses(String interceptorClasses) voidKafkaEmitter.setKeySerializer(String keySerializer) voidKafkaEmitter.setLingerMs(int lingerMs) voidKafkaEmitter.setMaxBlockMs(int maxBlockMs) voidKafkaEmitter.setMaxInFlightRequestsPerConnection(int maxInFlightRequestsPerConnection) voidKafkaEmitter.setMaxRequestSize(int maxRequestSize) voidKafkaEmitter.setMetadataMaxAgeMs(int metadataMaxAgeMs) voidKafkaEmitter.setRequestTimeoutMs(int requestTimeoutMs) voidKafkaEmitter.setRetries(int retries) voidKafkaEmitter.setRetryBackoffMs(int retryBackoffMs) voidvoidKafkaEmitter.setTransactionalId(String transactionalId) voidKafkaEmitter.setTransactionTimeoutMs(int transactionTimeoutMs) voidKafkaEmitter.setValueSerializer(String valueSerializer) -
Uses of Field in org.apache.tika.pipes.emitter.opensearch
Methods in org.apache.tika.pipes.emitter.opensearch with annotations of type FieldModifier and TypeMethodDescriptionvoidOpenSearchEmitter.setAttachmentStrategy(String attachmentStrategy) Options: SEPARATE_DOCUMENTS, PARENT_CHILD.voidOpenSearchEmitter.setAuthScheme(String authScheme) voidOpenSearchEmitter.setCommitWithin(int commitWithin) voidOpenSearchEmitter.setConnectionTimeout(int connectionTimeout) voidOpenSearchEmitter.setEmbeddedFileFieldName(String embeddedFileFieldName) If using theOpenSearchEmitter.AttachmentStrategy.PARENT_CHILD, this is the field name used to store the child documents.voidOpenSearchEmitter.setIdField(String idField) Specify the field in the first Metadata that should be used as the id field for the document.voidOpenSearchEmitter.setOpenSearchUrl(String openSearchUrl) voidOpenSearchEmitter.setPassword(String password) voidOpenSearchEmitter.setProxyHost(String proxyHost) voidOpenSearchEmitter.setProxyPort(int proxyPort) voidOpenSearchEmitter.setSocketTimeout(int socketTimeout) voidOpenSearchEmitter.setUserName(String userName) -
Uses of Field in org.apache.tika.pipes.emitter.s3
Methods in org.apache.tika.pipes.emitter.s3 with annotations of type FieldModifier and TypeMethodDescriptionvoidS3Emitter.setAccessKey(String accessKey) voidvoidS3Emitter.setCredentialsProvider(String credentialsProvider) voidS3Emitter.setEndpointConfigurationService(String endpointConfigurationService) voidS3Emitter.setFileExtension(String fileExtension) If you want to customize the output file's file extension.voidS3Emitter.setMaxConnections(int maxConnections) maximum number of http connections allowed.voidS3Emitter.setPathStyleAccessEnabled(boolean pathStyleAccessEnabled) voidvoidS3Emitter.setProfile(String profile) voidvoidS3Emitter.setSecretKey(String secretKey) voidS3Emitter.setSpoolToTemp(boolean spoolToTemp) Whether or not to spool the metadatalist to a tmp file before putting object. -
Uses of Field in org.apache.tika.pipes.emitter.solr
Methods in org.apache.tika.pipes.emitter.solr with annotations of type FieldModifier and TypeMethodDescriptionvoidSolrEmitter.setAttachmentStrategy(String attachmentStrategy) Options: SKIP, CONCATENATE_CONTENT, PARENT_CHILD.voidSolrEmitter.setAuthScheme(String authScheme) voidSolrEmitter.setCommitWithin(int commitWithin) voidSolrEmitter.setConnectionTimeout(int connectionTimeout) voidSolrEmitter.setEmbeddedFileFieldName(String embeddedFileFieldName) If using theSolrEmitter.AttachmentStrategy.PARENT_CHILD, this is the field name used to store the child documents.voidSolrEmitter.setIdField(String idField) Specify the field in the first Metadata that should be used as the id field for the document.voidSolrEmitter.setPassword(String password) voidSolrEmitter.setProxyHost(String proxyHost) voidSolrEmitter.setProxyPort(int proxyPort) voidSolrEmitter.setSocketTimeout(int socketTimeout) voidSolrEmitter.setSolrCollection(String solrCollection) voidSolrEmitter.setSolrUrls(List<String> solrUrls) voidSolrEmitter.setSolrZkChroot(String solrZkChroot) voidSolrEmitter.setSolrZkHosts(List<String> solrZkHosts) voidSolrEmitter.setUpdateStrategy(String updateStrategy) voidSolrEmitter.setUserName(String userName) -
Uses of Field in org.apache.tika.pipes.fetcher
Methods in org.apache.tika.pipes.fetcher with annotations of type Field -
Uses of Field in org.apache.tika.pipes.fetcher.azblob
Methods in org.apache.tika.pipes.fetcher.azblob with annotations of type FieldModifier and TypeMethodDescriptionvoidAZBlobFetcher.setContainer(String container) voidAZBlobFetcher.setEndpoint(String endpoint) voidAZBlobFetcher.setExtractUserMetadata(boolean extractUserMetadata) Whether or not to extract user metadata from the blob objectvoidAZBlobFetcher.setSasToken(String sasToken) voidAZBlobFetcher.setSpoolToTemp(boolean spoolToTemp) -
Uses of Field in org.apache.tika.pipes.fetcher.fs
Methods in org.apache.tika.pipes.fetcher.fs with annotations of type FieldModifier and TypeMethodDescriptionvoidFileSystemFetcher.setBasePath(String basePath) Default behavior si that clients will send in relative paths, this must be set to allow this fetcher to fetch the full path.voidFileSystemFetcher.setExtractFileSystemMetadata(boolean extractFileSystemMetadata) Extract file system metadata (created, modified, accessed) when fetching file. -
Uses of Field in org.apache.tika.pipes.fetcher.gcs
Methods in org.apache.tika.pipes.fetcher.gcs with annotations of type FieldModifier and TypeMethodDescriptionvoidvoidGCSFetcher.setExtractUserMetadata(boolean extractUserMetadata) Whether or not to extract user metadata from the S3ObjectvoidGCSFetcher.setProjectId(String projectId) voidGCSFetcher.setSpoolToTemp(boolean spoolToTemp) -
Uses of Field in org.apache.tika.pipes.fetcher.http
Methods in org.apache.tika.pipes.fetcher.http with annotations of type FieldModifier and TypeMethodDescriptionvoidHttpFetcher.setAuthScheme(String authScheme) voidHttpFetcher.setConnectTimeout(int connectTimeout) voidHttpFetcher.setHttpHeaders(List<String> headers) Which http headers should we capture in the metadata.voidHttpFetcher.setHttpRequestHeaders(List<String> headers) Which http request headers should we send in the http fetch requests.voidHttpFetcher.setJwtExpiresInSeconds(int jwtExpiresInSeconds) voidHttpFetcher.setJwtIssuer(String jwtIssuer) voidHttpFetcher.setJwtPrivateKeyBase64(String jwtPrivateKeyBase64) voidHttpFetcher.setJwtSecret(String jwtSecret) voidHttpFetcher.setJwtSubject(String jwtSubject) voidHttpFetcher.setMaxConnections(int maxConnections) voidHttpFetcher.setMaxConnectionsPerRoute(int maxConnectionsPerRoute) voidHttpFetcher.setMaxErrMsgSize(int maxErrMsgSize) voidHttpFetcher.setMaxRedirects(int maxRedirects) voidHttpFetcher.setMaxSpoolSize(long maxSpoolSize) Set the maximum number of bytes to spool to a temp file.voidHttpFetcher.setNtDomain(String domain) voidHttpFetcher.setOverallTimeout(long overallTimeout) This sets an overall timeout on the request.voidHttpFetcher.setPassword(String password) voidHttpFetcher.setProxyHost(String proxyHost) voidHttpFetcher.setProxyPort(int proxyPort) voidHttpFetcher.setRequestTimeout(int requestTimeout) voidHttpFetcher.setSocketTimeout(int socketTimeout) voidHttpFetcher.setUserAgent(String userAgent) When making the request, what User-Agent is sent in the request.voidHttpFetcher.setUserName(String userName) -
Uses of Field in org.apache.tika.pipes.fetcher.s3
Methods in org.apache.tika.pipes.fetcher.s3 with annotations of type FieldModifier and TypeMethodDescriptionvoidS3Fetcher.setAccessKey(String accessKey) voidvoidS3Fetcher.setCredentialsProvider(String credentialsProvider) voidS3Fetcher.setEndpointConfigurationService(String endpointConfigurationService) voidS3Fetcher.setExtractUserMetadata(boolean extractUserMetadata) Whether or not to extract user metadata from the S3ObjectvoidS3Fetcher.setMaxConnections(int maxConnections) voidS3Fetcher.setMaxLength(long maxLength) voidS3Fetcher.setPathStyleAccessEnabled(boolean pathStyleAccessEnabled) voidprefix to prepend to the fetch key before fetching.voidS3Fetcher.setProfile(String profile) voidvoidS3Fetcher.setSecretKey(String secretKey) voidS3Fetcher.setSleepBeforeRetryMillis(long sleepBeforeRetryMillis) Deprecated.voidS3Fetcher.setSpoolToTemp(boolean spoolToTemp) voidS3Fetcher.setThrottleSeconds(String commaDelimitedLongs) Set seconds to throttle retries as a comma-delimited list, e.g.: 30,60,120,600 -
Uses of Field in org.apache.tika.pipes.fetchers.microsoftgraph
Methods in org.apache.tika.pipes.fetchers.microsoftgraph with annotations of type FieldModifier and TypeMethodDescriptionvoidMicrosoftGraphFetcher.setThrottleSeconds(String commaDelimitedLongs) Set seconds to throttle retries as a comma-delimited list, e.g.: 30,60,120,600 -
Uses of Field in org.apache.tika.pipes.pipesiterator
Methods in org.apache.tika.pipes.pipesiterator with annotations of type FieldModifier and TypeMethodDescriptionvoidPipesIterator.setEmitterName(String emitterName) voidPipesIterator.setFetcherName(String fetcherName) voidPipesIterator.setHandlerType(String handlerType) voidPipesIterator.setMaxEmbeddedResources(int maxEmbeddedResources) voidPipesIterator.setMaxWaitMs(long maxWaitMs) voidPipesIterator.setOnParseException(String onParseException) voidPipesIterator.setParseMode(String parseModeString) voidPipesIterator.setQueueSize(int queueSize) voidPipesIterator.setThrowOnWriteLimitReached(boolean throwOnWriteLimitReached) voidPipesIterator.setWriteLimit(int writeLimit) -
Uses of Field in org.apache.tika.pipes.pipesiterator.azblob
Methods in org.apache.tika.pipes.pipesiterator.azblob with annotations of type FieldModifier and TypeMethodDescriptionvoidAZBlobPipesIterator.setContainer(String container) voidAZBlobPipesIterator.setEndpoint(String endpoint) voidvoidAZBlobPipesIterator.setSasToken(String sasToken) -
Uses of Field in org.apache.tika.pipes.pipesiterator.csv
Methods in org.apache.tika.pipes.pipesiterator.csv with annotations of type FieldModifier and TypeMethodDescriptionvoidCSVPipesIterator.setCsvPath(String csvPath) voidCSVPipesIterator.setCsvPath(Path csvPath) voidCSVPipesIterator.setEmitKeyColumn(String emitKeyColumn) voidCSVPipesIterator.setFetchKeyColumn(String fetchKeyColumn) voidCSVPipesIterator.setIdColumn(String idColumn) -
Uses of Field in org.apache.tika.pipes.pipesiterator.filelist
Methods in org.apache.tika.pipes.pipesiterator.filelist with annotations of type FieldModifier and TypeMethodDescriptionvoidFileListPipesIterator.setFileList(String path) voidFileListPipesIterator.setHasHeader(boolean hasHeader) -
Uses of Field in org.apache.tika.pipes.pipesiterator.fs
Methods in org.apache.tika.pipes.pipesiterator.fs with annotations of type FieldModifier and TypeMethodDescriptionvoidFileSystemPipesIterator.setBasePath(String basePath) voidFileSystemPipesIterator.setCountTotal(boolean countTotal) -
Uses of Field in org.apache.tika.pipes.pipesiterator.gcs
Methods in org.apache.tika.pipes.pipesiterator.gcs with annotations of type Field -
Uses of Field in org.apache.tika.pipes.pipesiterator.jdbc
Methods in org.apache.tika.pipes.pipesiterator.jdbc with annotations of type FieldModifier and TypeMethodDescriptionvoidJDBCPipesIterator.setConnection(String connection) voidJDBCPipesIterator.setEmitKeyColumn(String fetchKeyColumn) voidJDBCPipesIterator.setFetchKeyColumn(String fetchKeyColumn) voidJDBCPipesIterator.setFetchKeyRangeEndColumn(String fetchKeyRangeEndColumn) voidJDBCPipesIterator.setFetchKeyRangeStartColumn(String fetchKeyRangeStartColumn) voidJDBCPipesIterator.setFetchSize(int fetchSize) voidJDBCPipesIterator.setIdColumn(String idColumn) void -
Uses of Field in org.apache.tika.pipes.pipesiterator.kafka
Methods in org.apache.tika.pipes.pipesiterator.kafka with annotations of type FieldModifier and TypeMethodDescriptionvoidKafkaPipesIterator.setAutoOffsetReset(String autoOffsetReset) voidKafkaPipesIterator.setBootstrapServers(String bootstrapServers) voidKafkaPipesIterator.setEmitMax(int emitMax) If the kafka pipe iterator will keep polling for more documents until it returns an empty result.voidKafkaPipesIterator.setGroupId(String groupId) voidKafkaPipesIterator.setGroupInitialRebalanceDelayMs(int groupInitialRebalanceDelayMs) voidKafkaPipesIterator.setKeySerializer(String keySerializer) voidKafkaPipesIterator.setPollDelayMs(int pollDelayMs) voidvoidKafkaPipesIterator.setValueSerializer(String valueSerializer) -
Uses of Field in org.apache.tika.pipes.pipesiterator.s3
Methods in org.apache.tika.pipes.pipesiterator.s3 with annotations of type FieldModifier and TypeMethodDescriptionvoidS3PipesIterator.setAccessKey(String accessKey) voidvoidS3PipesIterator.setCredentialsProvider(String credentialsProvider) voidS3PipesIterator.setEndpointConfigurationService(String endpointConfigurationService) voidS3PipesIterator.setFileNamePattern(String fileNamePattern) voidS3PipesIterator.setFileNamePattern(Pattern fileNamePattern) voidS3PipesIterator.setMaxConnections(int maxConnections) voidS3PipesIterator.setPathStyleAccessEnabled(boolean pathStyleAccessEnabled) voidvoidS3PipesIterator.setProfile(String profile) voidvoidS3PipesIterator.setSecretKey(String secretKey) -
Uses of Field in org.apache.tika.pipes.pipesiterator.solr
Methods in org.apache.tika.pipes.pipesiterator.solr with annotations of type FieldModifier and TypeMethodDescriptionvoidSolrPipesIterator.setAdditionalFields(List<String> additionalFields) voidSolrPipesIterator.setAuthScheme(String authScheme) voidSolrPipesIterator.setConnectionTimeout(int connectionTimeout) voidSolrPipesIterator.setFailCountField(String failCountField) voidSolrPipesIterator.setFilters(List<String> filters) voidSolrPipesIterator.setIdField(String idField) voidSolrPipesIterator.setParsingIdField(String parsingIdField) voidSolrPipesIterator.setPassword(String password) voidSolrPipesIterator.setProxyHost(String proxyHost) voidSolrPipesIterator.setProxyPort(int proxyPort) voidSolrPipesIterator.setRows(int rows) voidSolrPipesIterator.setSizeFieldName(String sizeFieldName) voidSolrPipesIterator.setSocketTimeout(int socketTimeout) voidSolrPipesIterator.setSolrCollection(String solrCollection) voidSolrPipesIterator.setSolrUrls(List<String> solrUrls) voidSolrPipesIterator.setSolrZkChroot(String solrZkChroot) voidSolrPipesIterator.setSolrZkHosts(List<String> solrZkHosts) voidSolrPipesIterator.setUserName(String userName) -
Uses of Field in org.apache.tika.pipes.reporters.fs
Methods in org.apache.tika.pipes.reporters.fs with annotations of type FieldModifier and TypeMethodDescriptionvoidFileSystemStatusReporter.setReportUpdateMillis(long millis) voidFileSystemStatusReporter.setStatusFile(String path) -
Uses of Field in org.apache.tika.pipes.reporters.jdbc
Methods in org.apache.tika.pipes.reporters.jdbc with annotations of type FieldModifier and TypeMethodDescriptionvoidJDBCPipesReporter.setCacheSize(int cacheSize) Commit the reports if the cache is greater than or equal to this size.voidJDBCPipesReporter.setConnection(String connection) voidJDBCPipesReporter.setCreateTable(boolean createTable) The default is true.voidJDBCPipesReporter.setPostConnection(String postConnection) This sql will be called immediately after the connection is made.voidJDBCPipesReporter.setReportSql(String reportSql) This is the sql for the prepared statement to execute to store the report record. the default is:insert into tika_status (id, status, timestamp) values (?voidJDBCPipesReporter.setReportVariables(List<String> variables) ADVANCED: This is used to set the variables in the prepared statement for the report.voidJDBCPipesReporter.setReportWithinMs(long reportWithinMs) Commit the reports if the amount of time elapsed since the last report commit exceeds this value.voidJDBCPipesReporter.setTableName(String tableName) The default isJDBCPipesReporter.TABLE_NAME -
Uses of Field in org.apache.tika.pipes.reporters.opensearch
Methods in org.apache.tika.pipes.reporters.opensearch with annotations of type FieldModifier and TypeMethodDescriptionvoidOpenSearchPipesReporter.setAuthScheme(String authScheme) voidOpenSearchPipesReporter.setConnectionTimeout(int connectionTimeout) voidOpenSearchPipesReporter.setExcludeStatuses(List<String> statusList) voidOpenSearchPipesReporter.setIncludeRouting(boolean includeRouting) voidOpenSearchPipesReporter.setIncludeStatuses(List<String> statusList) voidOpenSearchPipesReporter.setKeyPrefix(String keyPrefix) This prefixes the keys before sending them to OpenSearch.voidOpenSearchPipesReporter.setOpenSearchUrl(String openSearchUrl) voidOpenSearchPipesReporter.setPassword(String password) voidOpenSearchPipesReporter.setProxyHost(String proxyHost) voidOpenSearchPipesReporter.setProxyPort(int proxyPort) voidOpenSearchPipesReporter.setSocketTimeout(int socketTimeout) voidOpenSearchPipesReporter.setUserName(String userName)
S3Fetcher.setThrottleSeconds(String)