Apache Tika 3.0.0-BETA2

The most notable changes in Tika 3.0.0-BETA2 over the previous release are:

BREAKING CHANGES

  • Updated PST parser to use standard Message metadata keys and improved handling of embedded files (TIKA-4248).
  • Convenience methods for XML readers were moved from ParseContext to XMLReaderUtils (TIKA-4259).

    Other Changes

  • Add GRPC server (TIKA-4181).
  • Improved configurability in tika-pipes (TIKA-4243).
  • Add optional PST parser based on libpst/readpst (TIKA-4126).
    • Fix bug in DateUtils that stripped timezone information fromincoming Calendar objects (TIKA-4126).

The following people have contributed to Tika 3.0.0-BETA2 by submitting or commenting on the issues resolved in this release:

  • Alexander Veit
  • Bartek Ciszkowski
  • Gregory Lepore
  • Kartik Jain
  • Manfred Baedke
  • Matthias Juchmes
  • Nicholas DiPiazza
  • Nicolas Daniels
  • Nissim Shiman
  • Robert Fromholz
  • Robin Schimpf
  • Tika User
  • Tilman Hausherr
  • Tim Allison
  • Xiaohong Yang

See https://s.apache.org/3vk0l for more details on these contributions.