Package org.apache.tika.utils
Class RegexUtils
java.lang.Object
org.apache.tika.utils.RegexUtils
Inspired from Nutch code class OutlinkExtractor. Apply regex to extract
content
-
Constructor Summary
-
Method Summary
-
Constructor Details
-
RegexUtils
public RegexUtils()
-
-
Method Details
-
extractLinks
Extract urls from plain text.- Parameters:
content
- The plain text content to examine- Returns:
- List of urls within found in the plain text
-