Package org.apache.tika.parser.microsoft.rtf.jflex
package org.apache.tika.parser.microsoft.rtf.jflex
-
ClassDescriptionShared charset maps for RTF parsing.Handles embedded objects and pictures within the JFlex-based RTF token stream.State associated with a single RTF group (
\{ ... \}).Extracts the original HTML from an RTF document that contains encapsulated HTML (as indicated by the\fromhtml1control word), using a JFlex-based tokenizer and sharedRTFStatefor font/codepage tracking.Parses OLE objdata from an RTF stream inline, byte by byte.Streams decoded bytes from an RTF\pictgroup to a temp file.Shared RTF parsing state: group stack, font table, codepage tracking, and unicode skip handling.A single token produced by the RTF tokenizer.