Apache Tika 3.0.0-BETA2
The most notable changes in Tika 3.0.0-BETA2 over the previous release are:
BREAKING CHANGES
- Updated PST parser to use standard Message metadata keys and improved handling of embedded files (TIKA-4248).
- Convenience methods for XML readers were moved from ParseContext to XMLReaderUtils (TIKA-4259).
Other Changes
- Add GRPC server (TIKA-4181).
- Improved configurability in tika-pipes (TIKA-4243).
- Add optional PST parser based on libpst/readpst (TIKA-4126).
- Fix bug in DateUtils that stripped timezone information fromincoming Calendar objects (TIKA-4126).
The following people have contributed to Tika 3.0.0-BETA2 by submitting or commenting on the issues resolved in this release:
- Alexander Veit
- Bartek Ciszkowski
- Gregory Lepore
- Kartik Jain
- Manfred Baedke
- Matthias Juchmes
- Nicholas DiPiazza
- Nicolas Daniels
- Nissim Shiman
- Robert Fromholz
- Robin Schimpf
- Tika User
- Tilman Hausherr
- Tim Allison
- Xiaohong Yang
See https://s.apache.org/3vk0l for more details on these contributions.