Class PDFBoxRenderer
- java.lang.Object
-
- org.apache.tika.renderer.pdf.pdfbox.PDFBoxRenderer
-
- All Implemented Interfaces:
Serializable,Initializable,PDDocumentRenderer,Renderer
public class PDFBoxRenderer extends Object implements PDDocumentRenderer, Initializable
- See Also:
- Serialized Form
-
-
Field Summary
Fields Modifier and Type Field Description protected static org.slf4j.LoggerLOGstatic PropertyPDFBOX_IMAGE_WRITING_TIME_MSThis is the amount of time it takes for PDFBox/java to write the image after it has been rendered into a BufferedImage.static PropertyPDFBOX_RENDERING_TIME_MSThis is the amount of time it takes for PDFBox to render the page to a BufferedImage
-
Constructor Summary
Constructors Constructor Description PDFBoxRenderer()
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description voidcheckInitialization(InitializableProblemHandler problemHandler)protected intgetDPI(ParseContext parseContext)protected StringgetImageFormatName(ParseContext parseContext)protected org.apache.pdfbox.rendering.ImageTypegetImageType(ParseContext parseContext)Set<MediaType>getSupportedTypes(ParseContext context)Returns the set of media types supported by this renderer when used with the given parse context.voidinitialize(Map<String,Param> params)RenderResultsrender(InputStream is, Metadata metadata, ParseContext parseContext, RenderRequest... requests)protected RenderResultrenderPage(org.apache.pdfbox.rendering.PDFRenderer renderer, int id, int pageNumber, Metadata metadata, ParseContext parseContext)voidsetDPI(int dpi)voidsetImageFormatName(String imageFormatName)voidsetImageType(org.apache.pdfbox.rendering.ImageType imageType)
-
-
-
Field Detail
-
LOG
protected static final org.slf4j.Logger LOG
-
PDFBOX_RENDERING_TIME_MS
public static Property PDFBOX_RENDERING_TIME_MS
This is the amount of time it takes for PDFBox to render the page to a BufferedImage
-
PDFBOX_IMAGE_WRITING_TIME_MS
public static Property PDFBOX_IMAGE_WRITING_TIME_MS
This is the amount of time it takes for PDFBox/java to write the image after it has been rendered into a BufferedImage. Some formats take much longer to encode than others.
-
-
Method Detail
-
getSupportedTypes
public Set<MediaType> getSupportedTypes(ParseContext context)
Description copied from interface:RendererReturns the set of media types supported by this renderer when used with the given parse context.- Specified by:
getSupportedTypesin interfaceRenderer- Parameters:
context- parse context- Returns:
- immutable set of media types
-
render
public RenderResults render(InputStream is, Metadata metadata, ParseContext parseContext, RenderRequest... requests) throws IOException, TikaException
- Specified by:
renderin interfaceRenderer- Throws:
IOExceptionTikaException
-
renderPage
protected RenderResult renderPage(org.apache.pdfbox.rendering.PDFRenderer renderer, int id, int pageNumber, Metadata metadata, ParseContext parseContext) throws IOException
- Throws:
IOException
-
initialize
public void initialize(Map<String,Param> params) throws TikaConfigException
- Specified by:
initializein interfaceInitializable- Parameters:
params- params to use for initialization- Throws:
TikaConfigException
-
checkInitialization
public void checkInitialization(InitializableProblemHandler problemHandler) throws TikaConfigException
- Specified by:
checkInitializationin interfaceInitializable- Parameters:
problemHandler- if there is a problem and no custom initializableProblemHandler has been configured via Initializable parameters, this is called to respond.- Throws:
TikaConfigException
-
setDPI
public void setDPI(int dpi)
-
setImageType
public void setImageType(org.apache.pdfbox.rendering.ImageType imageType)
-
setImageFormatName
public void setImageFormatName(String imageFormatName)
-
getDPI
protected int getDPI(ParseContext parseContext)
-
getImageType
protected org.apache.pdfbox.rendering.ImageType getImageType(ParseContext parseContext)
-
getImageFormatName
protected String getImageFormatName(ParseContext parseContext)
-
-