Class CrawlDbFilter


  • public class CrawlDbFilter
    extends Mapper<Text,​CrawlDatum,​Text,​CrawlDatum>
    This class provides a way to separate the URL normalization and filtering steps from the rest of CrawlDb manipulation code.
    Author:
    Andrzej Bialecki