Package org.apache.nutch.crawl
Class LinkDbFilter
- java.lang.Object
-
- org.apache.hadoop.mapreduce.Mapper<Text,Inlinks,Text,Inlinks>
-
- org.apache.nutch.crawl.LinkDbFilter
-
-
Nested Class Summary
-
Nested classes/interfaces inherited from class org.apache.hadoop.mapreduce.Mapper
Mapper.Context
-
-
Field Summary
Fields Modifier and Type Field Description static String
URL_FILTERING
static String
URL_NORMALIZING
static String
URL_NORMALIZING_SCOPE
-
Constructor Summary
Constructors Constructor Description LinkDbFilter()
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description void
map(Text key, Inlinks value, Mapper.Context context)
void
setup(Mapper.Context context)
-
-
-
Field Detail
-
URL_FILTERING
public static final String URL_FILTERING
- See Also:
- Constant Field Values
-
URL_NORMALIZING
public static final String URL_NORMALIZING
- See Also:
- Constant Field Values
-
URL_NORMALIZING_SCOPE
public static final String URL_NORMALIZING_SCOPE
- See Also:
- Constant Field Values
-
-
Method Detail
-
setup
public void setup(Mapper.Context context)
-
map
public void map(Text key, Inlinks value, Mapper.Context context) throws IOException, InterruptedException
- Overrides:
map
in classMapper<Text,Inlinks,Text,Inlinks>
- Throws:
IOException
InterruptedException
-
-