Class UnpackConfig
java.lang.Object
org.apache.tika.pipes.core.extractor.UnpackConfig
- All Implemented Interfaces:
Serializable
- See Also:
-
Nested Class Summary
Nested ClassesModifier and TypeClassDescriptionstatic enumstatic enumOutput format for UNPACK mode.static enumOutput mode for how embedded files are delivered.static enum -
Field Summary
FieldsModifier and TypeFieldDescriptionstatic final longDefault maximum bytes to unpack per file: 10 GB. -
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptionfinal booleanlongMaximum total bytes to unpack per file.Get the output format for UNPACK mode.Get the output mode for how embedded files are delivered.intinthashCode()booleanWhether to include full RMETA-style metadata in metadata.json.booleanWhether to include the metadata JSON for each embedded document in the zip file.booleanbooleanWhether to zip all embedded files into a single archive before emitting.voidsetEmbeddedIdPrefix(String embeddedIdPrefix) voidsetEmitKeyBase(String emitKeyBase) voidsetEmitter(String emitter) voidsetIncludeFullMetadata(boolean includeFullMetadata) voidsetIncludeMetadataInZip(boolean includeMetadataInZip) voidsetIncludeOriginal(boolean includeOriginal) voidsetKeyBaseStrategy(String keyBaseStrategy) voidsetKeyBaseStrategy(UnpackConfig.KEY_BASE_STRATEGY keyBaseStrategy) voidsetMaxUnpackBytes(long maxUnpackBytes) voidsetOutputFormat(String outputFormat) voidsetOutputFormat(UnpackConfig.OUTPUT_FORMAT outputFormat) voidsetOutputMode(String outputMode) voidsetOutputMode(UnpackConfig.OUTPUT_MODE outputMode) voidsetSuffixStrategy(String suffixStrategy) voidsetSuffixStrategy(UnpackConfig.SUFFIX_STRATEGY suffixStrategy) voidsetZeroPadName(int zeroPadName) voidsetZipEmbeddedFiles(boolean zipEmbeddedFiles) toString()
-
Field Details
-
DEFAULT_MAX_UNPACK_BYTES
public static final long DEFAULT_MAX_UNPACK_BYTESDefault maximum bytes to unpack per file: 10 GB. Use -1 to disable the limit (not recommended).- See Also:
-
-
Constructor Details
-
UnpackConfig
public UnpackConfig()Create an UnpackConfig with default settings.
-
-
Method Details
-
getZeroPadName
public int getZeroPadName() -
getSuffixStrategy
-
getKeyBaseStrategy
-
getEmbeddedIdPrefix
-
getEmitter
-
isIncludeOriginal
public boolean isIncludeOriginal() -
setZeroPadName
public void setZeroPadName(int zeroPadName) -
setSuffixStrategy
-
setSuffixStrategy
-
setKeyBaseStrategy
-
setKeyBaseStrategy
-
setEmbeddedIdPrefix
-
setEmitter
-
setIncludeOriginal
public void setIncludeOriginal(boolean includeOriginal) -
setEmitKeyBase
-
getEmitKeyBase
-
isZipEmbeddedFiles
public boolean isZipEmbeddedFiles()Whether to zip all embedded files into a single archive before emitting. When true, embedded files are collected during parsing and then zipped and emitted as a single archive after parsing completes. -
setZipEmbeddedFiles
public void setZipEmbeddedFiles(boolean zipEmbeddedFiles) -
isIncludeMetadataInZip
public boolean isIncludeMetadataInZip()Whether to include the metadata JSON for each embedded document in the zip file. Only applicable whenisZipEmbeddedFiles()is true. -
setIncludeMetadataInZip
public void setIncludeMetadataInZip(boolean includeMetadataInZip) -
getMaxUnpackBytes
public long getMaxUnpackBytes()Maximum total bytes to unpack per file. Default is 10GB. Set to -1 to disable the limit (not recommended).- Returns:
- max bytes to unpack, or -1 if no limit
-
setMaxUnpackBytes
public void setMaxUnpackBytes(long maxUnpackBytes) -
getOutputFormat
Get the output format for UNPACK mode. REGULAR is the default (existing behavior). FRICTIONLESS creates a Frictionless Data Package with datapackage.json manifest. -
setOutputFormat
-
setOutputFormat
-
getOutputMode
Get the output mode for how embedded files are delivered. ZIPPED packages all files into a single zip archive. DIRECTORY emits files directly to the configured emitter. -
setOutputMode
-
setOutputMode
-
isIncludeFullMetadata
public boolean isIncludeFullMetadata()Whether to include full RMETA-style metadata in metadata.json. Only applicable when outputFormat is FRICTIONLESS. -
setIncludeFullMetadata
public void setIncludeFullMetadata(boolean includeFullMetadata) -
toString
-
equals
-
hashCode
public int hashCode()
-