Package org.apache.tika.parser.html
package org.apache.tika.parser.html
-
ClassDescriptionNot thread safe.The default HTML mapping rules in Tika.Character encoding detector for determining the character encoding of a HTML document based on the potential charset parameter found in a Content-Type http-equiv meta tag somewhere near the beginning.Configuration class for JSON deserialization.HTML mapper used to make incoming HTML documents easier to handle by Tika clients.Alternative HTML mapping rules that pass the input HTML as-is without any modifications.HTML parser.Configuration class for JSON deserialization.