Advanced Topics

Table of Contents

This section covers advanced usage and internals of Apache Tika.

Most pages here are written from a Java-API perspective. Where a topic has a JSON-config or CLI equivalent, look first under Configuration (per-parser options), Tika Pipes (pipeline + Pipes-mode tuning), Tika Server (REST + server config), or Tika CLI (tika-app flags). The Setting Limits page is the model — it covers Java, JSON, and CLI side by side. Filing issues against specific advanced pages where the JSON/CLI equivalent isn’t documented yet helps us prioritize the gap.

Topics

Integration Testing