Apache Tika 3.2.0

The most notable changes in Tika 3.2.0 over the previous release are:

  • Detect inline images in MSG files (TIKA-4391).
  • Improve extraction of metadata in MSG files (TIKA-4381).
  • Fix concurrency bug in TikaToXMP (TIKA-4393).
  • Fix potential GDAL deadlock (TIKA-4385).
  • Include internal attachment path in tika-eval reports (TIKA-4374).
  • Upgrade jsoupt to 1.20.1 with workaround for change in self-closing tag behavior (TIKA-4419).
  • Upgrade dependencies (TIKA-4379).
  • Allow users to turn off the injection of some headers into the content stream of MSG files (TIKA-4345).

The following people have contributed to Tika 3.0.0 by submitting or commenting on the issues resolved in this release:

  • Alexander Veit
  • David Frizelle
  • Ghiles OUAREZKI
  • james
  • Leszek Sliwko
  • Subbu
  • Tilman Hausherr
  • Tim Allison

See https://s.apache.org/sxih8 for more details on these contributions.