Package org.apache.tika.extractor
Class RUnpackExtractorFactory
java.lang.Object
org.apache.tika.extractor.RUnpackExtractorFactory
- All Implemented Interfaces:
Serializable
,EmbeddedDocumentByteStoreExtractorFactory
,EmbeddedDocumentExtractorFactory
public class RUnpackExtractorFactory
extends Object
implements EmbeddedDocumentByteStoreExtractorFactory
- See Also:
-
Field Summary
-
Constructor Summary
-
Method Summary
Modifier and TypeMethodDescriptionnewInstance
(Metadata metadata, ParseContext parseContext) void
setEmbeddedBytesExcludeEmbeddedResourceTypes
(List<String> excludeAttachmentTypes) void
setEmbeddedBytesExcludeMimeTypes
(List<String> excludeMimeTypes) void
setEmbeddedBytesIncludeEmbeddedResourceTypes
(List<String> includeAttachmentTypes) void
setEmbeddedBytesIncludeMimeTypes
(List<String> includeMimeTypes) void
setMaxEmbeddedBytesForExtraction
(long maxEmbeddedBytesForExtraction) Total number of bytes to write out.void
setWriteFileNameToContent
(boolean writeFileNameToContent)
-
Field Details
-
DEFAULT_MAX_EMBEDDED_BYTES_FOR_EXTRACTION
public static long DEFAULT_MAX_EMBEDDED_BYTES_FOR_EXTRACTION
-
-
Constructor Details
-
RUnpackExtractorFactory
public RUnpackExtractorFactory()
-
-
Method Details
-
setWriteFileNameToContent
-
setEmbeddedBytesIncludeMimeTypes
-
setEmbeddedBytesExcludeMimeTypes
-
setEmbeddedBytesIncludeEmbeddedResourceTypes
-
setEmbeddedBytesExcludeEmbeddedResourceTypes
-
setMaxEmbeddedBytesForExtraction
@Field public void setMaxEmbeddedBytesForExtraction(long maxEmbeddedBytesForExtraction) throws TikaConfigException Total number of bytes to write out. A good zip bomb may contain petabytes compressed into a few kb. Make sure that you can't fill up a disk! This does not include the container file in the count of bytes written out. This only counts the lengths of the embedded files.- Parameters:
maxEmbeddedBytesForExtraction
-- Throws:
TikaConfigException
-
newInstance
- Specified by:
newInstance
in interfaceEmbeddedDocumentExtractorFactory
-