| Interface | Description |
|---|---|
| HtmlMapper |
HTML mapper used to make incoming HTML documents easier to handle by
Tika clients.
|
| Class | Description |
|---|---|
| DataURIScheme | |
| DataURISchemeUtil |
Not thread safe.
|
| DefaultHtmlMapper |
The default HTML mapping rules in Tika.
|
| HtmlEncodingDetector |
Character encoding detector for determining the character encoding of a
HTML document based on the potential charset parameter found in a
Content-Type http-equiv meta tag somewhere near the beginning.
|
| HtmlParser |
HTML parser.
|
| IdentityHtmlMapper |
Alternative HTML mapping rules that pass the input HTML as-is without any
modifications.
|
| Exception | Description |
|---|---|
| DataURISchemeParseException |
Copyright © 2007–2022 The Apache Software Foundation. All rights reserved.