Package org.apache.tika.parser.html
Class IdentityHtmlMapper
- java.lang.Object
- 
- org.apache.tika.parser.html.IdentityHtmlMapper
 
- 
- All Implemented Interfaces:
- HtmlMapper
 
 public class IdentityHtmlMapper extends Object implements HtmlMapper Alternative HTML mapping rules that pass the input HTML as-is without any modifications.- Since:
- Apache Tika 0.8
 
- 
- 
Field SummaryFields Modifier and Type Field Description static HtmlMapperINSTANCE
 - 
Constructor SummaryConstructors Constructor Description IdentityHtmlMapper()
 - 
Method SummaryAll Methods Instance Methods Concrete Methods Modifier and Type Method Description booleanisDiscardElement(String name)Checks whether all content within the given HTML element should be discarded instead of including it in the parse output.StringmapSafeAttribute(String elementName, String attributeName)Maps "safe" HTML attribute names to semantic XHTML equivalents.StringmapSafeElement(String name)Maps "safe" HTML element names to semantic XHTML equivalents.
 
- 
- 
- 
Field Detail- 
INSTANCEpublic static final HtmlMapper INSTANCE 
 
- 
 - 
Method Detail- 
isDiscardElementpublic boolean isDiscardElement(String name) Description copied from interface:HtmlMapperChecks whether all content within the given HTML element should be discarded instead of including it in the parse output.- Specified by:
- isDiscardElementin interface- HtmlMapper
- Parameters:
- name- HTML element name (upper case)
- Returns:
- trueif content inside the named element should be ignored,- falseotherwise
 
 - 
mapSafeAttributepublic String mapSafeAttribute(String elementName, String attributeName) Description copied from interface:HtmlMapperMaps "safe" HTML attribute names to semantic XHTML equivalents. If the given attribute is unknown or deemed unsafe for inclusion in the parse output, then this method returnsnulland the attribute will be ignored. This method assumes that the element name is valid and normalised.- Specified by:
- mapSafeAttributein interface- HtmlMapper
- Parameters:
- elementName- HTML element name (lower case)
- attributeName- HTML attribute name (lower case)
- Returns:
- XHTML attribute name (lower case), or
 nullif the element is unsafe
 
 - 
mapSafeElementpublic String mapSafeElement(String name) Description copied from interface:HtmlMapperMaps "safe" HTML element names to semantic XHTML equivalents. If the given element is unknown or deemed unsafe for inclusion in the parse output, then this method returnsnulland the element will be ignored but the content inside it is still processed. See theHtmlMapper.isDiscardElement(String)method for a way to discard the entire contents of an element.- Specified by:
- mapSafeElementin interface- HtmlMapper
- Parameters:
- name- HTML element name (upper case)
- Returns:
- XHTML element name (lower case), or
 nullif the element is unsafe
 
 
- 
 
-