Interface | Description |
---|---|
HtmlMapper |
HTML mapper used to make incoming HTML documents easier to handle by
Tika clients.
|
Class | Description |
---|---|
DataURIScheme | |
DataURISchemeUtil |
Not thread safe.
|
DefaultHtmlMapper |
The default HTML mapping rules in Tika.
|
HtmlEncodingDetector |
Character encoding detector for determining the character encoding of a
HTML document based on the potential charset parameter found in a
Content-Type http-equiv meta tag somewhere near the beginning.
|
HtmlParser |
HTML parser.
|
IdentityHtmlMapper |
Alternative HTML mapping rules that pass the input HTML as-is without any
modifications.
|
Exception | Description |
---|---|
DataURISchemeParseException |
Copyright © 2007–2021 The Apache Software Foundation. All rights reserved.