Package org.apache.tika.sax
Class LinkContentHandler
java.lang.Object
org.xml.sax.helpers.DefaultHandler
org.apache.tika.sax.LinkContentHandler
- All Implemented Interfaces:
ContentHandler,DTDHandler,EntityResolver,ErrorHandler
Content handler that collects links from an XHTML document.
-
Constructor Summary
ConstructorsConstructorDescriptionDefault constructorLinkContentHandler(boolean collapseWhitespaceInAnchor) Default constructor -
Method Summary
Modifier and TypeMethodDescriptionvoidcharacters(char[] ch, int start, int length) voidendElement(String uri, String local, String name) getLinks()Returns the list of collected links.voidignorableWhitespace(char[] ch, int start, int length) voidstartElement(String uri, String local, String name, Attributes attributes) Methods inherited from class org.xml.sax.helpers.DefaultHandler
endDocument, endPrefixMapping, error, fatalError, notationDecl, processingInstruction, resolveEntity, setDocumentLocator, skippedEntity, startDocument, startPrefixMapping, unparsedEntityDecl, warning
-
Constructor Details
-
LinkContentHandler
public LinkContentHandler()Default constructor -
LinkContentHandler
public LinkContentHandler(boolean collapseWhitespaceInAnchor) Default constructor- Parameters:
collapseWhitespaceInAnchor-
-
-
Method Details
-
getLinks
Returns the list of collected links.- Returns:
- collected links
-
startElement
- Specified by:
startElementin interfaceContentHandler- Overrides:
startElementin classDefaultHandler
-
characters
public void characters(char[] ch, int start, int length) - Specified by:
charactersin interfaceContentHandler- Overrides:
charactersin classDefaultHandler
-
ignorableWhitespace
public void ignorableWhitespace(char[] ch, int start, int length) - Specified by:
ignorableWhitespacein interfaceContentHandler- Overrides:
ignorableWhitespacein classDefaultHandler
-
endElement
- Specified by:
endElementin interfaceContentHandler- Overrides:
endElementin classDefaultHandler
-