Package org.apache.tika.extractor
Class RUnpackExtractorFactory
- java.lang.Object
-
- org.apache.tika.extractor.RUnpackExtractorFactory
-
- All Implemented Interfaces:
Serializable
,EmbeddedDocumentByteStoreExtractorFactory
,EmbeddedDocumentExtractorFactory
public class RUnpackExtractorFactory extends Object implements EmbeddedDocumentByteStoreExtractorFactory
- See Also:
- Serialized Form
-
-
Field Summary
Fields Modifier and Type Field Description static long
DEFAULT_MAX_EMBEDDED_BYTES_FOR_EXTRACTION
-
Constructor Summary
Constructors Constructor Description RUnpackExtractorFactory()
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description EmbeddedDocumentExtractor
newInstance(Metadata metadata, ParseContext parseContext)
void
setEmbeddedBytesExcludeEmbeddedResourceTypes(List<String> excludeAttachmentTypes)
void
setEmbeddedBytesExcludeMimeTypes(List<String> excludeMimeTypes)
void
setEmbeddedBytesIncludeEmbeddedResourceTypes(List<String> includeAttachmentTypes)
void
setEmbeddedBytesIncludeMimeTypes(List<String> includeMimeTypes)
void
setMaxEmbeddedBytesForExtraction(long maxEmbeddedBytesForExtraction)
Total number of bytes to write out.void
setWriteFileNameToContent(boolean writeFileNameToContent)
-
-
-
Method Detail
-
setWriteFileNameToContent
@Field public void setWriteFileNameToContent(boolean writeFileNameToContent)
-
setEmbeddedBytesIncludeMimeTypes
@Field public void setEmbeddedBytesIncludeMimeTypes(List<String> includeMimeTypes)
-
setEmbeddedBytesExcludeMimeTypes
@Field public void setEmbeddedBytesExcludeMimeTypes(List<String> excludeMimeTypes)
-
setEmbeddedBytesIncludeEmbeddedResourceTypes
@Field public void setEmbeddedBytesIncludeEmbeddedResourceTypes(List<String> includeAttachmentTypes)
-
setEmbeddedBytesExcludeEmbeddedResourceTypes
@Field public void setEmbeddedBytesExcludeEmbeddedResourceTypes(List<String> excludeAttachmentTypes)
-
setMaxEmbeddedBytesForExtraction
@Field public void setMaxEmbeddedBytesForExtraction(long maxEmbeddedBytesForExtraction) throws TikaConfigException
Total number of bytes to write out. A good zip bomb may contain petabytes compressed into a few kb. Make sure that you can't fill up a disk! This does not include the container file in the count of bytes written out. This only counts the lengths of the embedded files.- Parameters:
maxEmbeddedBytesForExtraction
-- Throws:
TikaConfigException
-
newInstance
public EmbeddedDocumentExtractor newInstance(Metadata metadata, ParseContext parseContext)
- Specified by:
newInstance
in interfaceEmbeddedDocumentExtractorFactory
-
-