Class GeminiVLMParser

java.lang.Object
org.apache.tika.parser.vlm.AbstractVLMParser
org.apache.tika.parser.vlm.GeminiVLMParser
All Implemented Interfaces:
Serializable, Initializable, SelfConfiguring, Parser

public class GeminiVLMParser extends AbstractVLMParser
VLM parser for the Google Gemini generateContent API.

Supports both images and PDFs natively (Gemini processes PDFs with native vision, understanding layout, charts, tables, and diagrams — not just extracting text).

The API key is sent as a key query parameter (not a Bearer header).

Default base URL points to the public Gemini API; change it for Vertex AI or a proxy.

Configuration key: "gemini-vlm-parser"

Since:
Apache Tika 4.0
See Also: