Apache Tika 2.6.0

The most notable changes in Tika 2.6.0 over the previous release are:

  • Add optional Siegfried detector (TIKA-3901).
  • Move OverrideDetector's functionality to the CompositeDetector (TIKA-3904).
  • The FileCommandDetector has been refactored to have the same behavior as the Siegfried detector; see setUseMime in the javadoc (TIKA-3902).
  • Fix bug in OpenSearch emitter that prevented upserts on documents with embedded files (TIKA-3882).
  • Extract PDF actions and triggers into the file's metadata (TIKA-3887).
  • Add a tika-async-cli module (TIKA-3885).

The following people have contributed to Tika 2.6.0 by submitting or commenting on the issues resolved in this release:

  • Dave Meikle
  • Ethan Wilansky
  • Luca Perico
  • Tilman Hausherr
  • Tim Allison
  • Tong Wang

See https://s.apache.org/zrcax for more details on these contributions.