Releasing Tika Docker Images
Table of Contents
This guide covers the process for releasing Apache Tika Docker images.
Image Types
The tika-docker repository produces two types of images:
- Minimal
-
Apache Tika with base dependencies (Java only)
- Full
-
Apache Tika plus Tesseract OCR and GDAL
Helper Tools
- docker-tool.sh
-
Automates building, testing, and publishing Docker images
- republish-images.sh
-
Legacy script for batch republishing images
| The repository also contains Docker Compose files for advanced scenarios (Vision, Grobid, OCR, NER), but these are not used for official releases. |
Release Process
Step 4: Test Locally
Test the release locally before publishing:
./docker-tool.sh build <docker-version> <tika-version>
./docker-tool.sh test <docker-version>
Step 5: Commit Changes
Commit all changes:
git add README.md .env CHANGES.md
git commit -m "Prepare for Docker release <docker-version>"
git push
Step 6: Build and Publish
Build and publish the images using the docker-tool script.
Example for version 3.1.0.0 based on Tika 3.1.0:
# Build the images
./docker-tool.sh build 3.1.0.0 3.1.0
# Test the images
./docker-tool.sh test 3.1.0.0
# Publish to DockerHub
./docker-tool.sh publish 3.1.0.0 3.1.0
Multi-architecture building takes time. The publish step automatically
updates the -latest tag on DockerHub.
|
Post-Release
After publishing the Docker images:
-
Verify the images are available on DockerHub at https://hub.docker.com/r/apache/tika
-
Test pulling and running the new images
-
Update the main Tika website if needed
-
Proceed to release the Helm charts if applicable