Class JSParseFilter

  • All Implemented Interfaces:
    Configurable, HtmlParseFilter, Parser, Pluggable

    public class JSParseFilter
    extends Object
    implements HtmlParseFilter, Parser
    This class is a heuristic link extractor for JavaScript files and code snippets. The general idea of a two-pass regex matching comes from Heritrix. Parts of the code come from OutlinkExtractor.java