JSON Plugin

The JSON plugin (tika-pipes-json) provides an iterator that reads work items from a JSON-lines file (one JSON object per line). It is iterator-only — pair it with a fetcher and emitter.

Interface Component name Class

Iterator

json-pipes-iterator

JsonPipesIterator

JSON Iterator (json-pipes-iterator)

Reads each line of a JSONL file as a work item and emits one FetchEmitTuple per object.

{
  "pipes-iterator": {
    "json-pipes-iterator": {
      "jsonPath": "/data/work-items.jsonl",
      "fetcherId": "fsf",
      "emitterId": "fse"
    }
  }
}

Configuration

Field Default Description

jsonPath

required

Path to the JSONL file on disk.

fetcherId / emitterId

required

IDs of the fetcher and emitter to bind to each emitted tuple. See Pipes Iterators for the shared iterator contract.

Notes

  • The file format is JSON-lines (also called NDJSON) — one valid JSON object per line, no surrounding array brackets.

  • Each line’s JSON object is parsed and its fields become the metadata of the emitted FetchEmitTuple. The keys used for fetch / emit identification come from the tuple-shaped fields (e.g., fetchKey, emitKey).

  • For columnar work items in a CSV, use the CSV iterator instead.