Class RegexUtils


  • public class RegexUtils
    extends Object
    Inspired from Nutch code class OutlinkExtractor. Apply regex to extract content
    • Constructor Detail

      • RegexUtils

        public RegexUtils()
    • Method Detail

      • extractLinks

        public static List<String> extractLinks​(String content)
        Extract urls from plain text.
        Parameters:
        content - The plain text content to examine
        Returns:
        List of urls within found in the plain text