Package org.apache.tika.extractor
Class RUnpackExtractorFactory
- java.lang.Object
-
- org.apache.tika.extractor.RUnpackExtractorFactory
-
- All Implemented Interfaces:
Serializable,EmbeddedDocumentByteStoreExtractorFactory,EmbeddedDocumentExtractorFactory
public class RUnpackExtractorFactory extends Object implements EmbeddedDocumentByteStoreExtractorFactory
- See Also:
- Serialized Form
-
-
Field Summary
Fields Modifier and Type Field Description static longDEFAULT_MAX_EMBEDDED_BYTES_FOR_EXTRACTION
-
Constructor Summary
Constructors Constructor Description RUnpackExtractorFactory()
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description EmbeddedDocumentExtractornewInstance(Metadata metadata, ParseContext parseContext)voidsetEmbeddedBytesExcludeEmbeddedResourceTypes(List<String> excludeAttachmentTypes)voidsetEmbeddedBytesExcludeMimeTypes(List<String> excludeMimeTypes)voidsetEmbeddedBytesIncludeEmbeddedResourceTypes(List<String> includeAttachmentTypes)voidsetEmbeddedBytesIncludeMimeTypes(List<String> includeMimeTypes)voidsetMaxEmbeddedBytesForExtraction(long maxEmbeddedBytesForExtraction)Total number of bytes to write out.voidsetWriteFileNameToContent(boolean writeFileNameToContent)
-
-
-
Method Detail
-
setWriteFileNameToContent
@Field public void setWriteFileNameToContent(boolean writeFileNameToContent)
-
setEmbeddedBytesIncludeMimeTypes
@Field public void setEmbeddedBytesIncludeMimeTypes(List<String> includeMimeTypes)
-
setEmbeddedBytesExcludeMimeTypes
@Field public void setEmbeddedBytesExcludeMimeTypes(List<String> excludeMimeTypes)
-
setEmbeddedBytesIncludeEmbeddedResourceTypes
@Field public void setEmbeddedBytesIncludeEmbeddedResourceTypes(List<String> includeAttachmentTypes)
-
setEmbeddedBytesExcludeEmbeddedResourceTypes
@Field public void setEmbeddedBytesExcludeEmbeddedResourceTypes(List<String> excludeAttachmentTypes)
-
setMaxEmbeddedBytesForExtraction
@Field public void setMaxEmbeddedBytesForExtraction(long maxEmbeddedBytesForExtraction) throws TikaConfigException
Total number of bytes to write out. A good zip bomb may contain petabytes compressed into a few kb. Make sure that you can't fill up a disk! This does not include the container file in the count of bytes written out. This only counts the lengths of the embedded files.- Parameters:
maxEmbeddedBytesForExtraction-- Throws:
TikaConfigException
-
newInstance
public EmbeddedDocumentExtractor newInstance(Metadata metadata, ParseContext parseContext)
- Specified by:
newInstancein interfaceEmbeddedDocumentExtractorFactory
-
-