Package org.apache.tika.parser.dwg
Class DWGReadFormatRemover
java.lang.Object
org.apache.tika.parser.dwg.DWGReadFormatRemover
DWGReadFormatRemover removes the formatting from the text from libredwg files so only
the raw text remains.
What needs to be cleaned has been found on the following websites:
https://www.cadforum.cz/en/text-formatting-codes-in-mtext-objects-tip8640
https://adndevblog.typepad.com/autocad/2017/09/dissecting-mtext-format-codes.html
-
Constructor Summary
-
Method Summary
-
Constructor Details
-
DWGReadFormatRemover
public DWGReadFormatRemover()
-
-
Method Details
-
cleanupDwgString
-