Uses of Interface
org.apache.nutch.indexer.IndexingFilter
-
Packages that use IndexingFilter Package Description org.apache.nutch.analysis.lang Text document language identifier.org.apache.nutch.indexer.anchor An indexing plugin for inbound anchor text.org.apache.nutch.indexer.arbitrary Indexing filter to add document arbitrary data to the index from the output of a user-specified class.org.apache.nutch.indexer.basic A basic indexing plugin, adds basic fields: url, host, title, content, etc.org.apache.nutch.indexer.feed Indexing filter to index meta data from RSS feeds.org.apache.nutch.indexer.filter org.apache.nutch.indexer.geoip This plugin implements an indexing filter which takes advantage of the GeoIP2-java API.org.apache.nutch.indexer.jexl This plugin implements a dynamic indexing filter which uses JEXL expressions to allow filtering based on the page's metadataorg.apache.nutch.indexer.links org.apache.nutch.indexer.metadata Indexing filter to add document metadata to the index.org.apache.nutch.indexer.more A more indexing plugin, adds "more" index fields:last modified date, MIME type, content length.org.apache.nutch.indexer.replace Indexing filter to allow pattern replacements on metadata.org.apache.nutch.indexer.staticfield A simple plugin called at indexing that adds fields with static data.org.apache.nutch.indexer.subcollection Indexing filter to assign documents to subcollections.org.apache.nutch.indexer.tld Top Level Domain Indexing plugin.org.apache.nutch.indexer.urlmeta URL Meta Tag Indexing Pluginorg.apache.nutch.microformats.reltag A microformats Rel-Tag Parser/Indexer/Querier plugin.org.creativecommons.nutch Sample plugins that parse and index Creative Commons metadata. -
-
Uses of IndexingFilter in org.apache.nutch.analysis.lang
Classes in org.apache.nutch.analysis.lang that implement IndexingFilter Modifier and Type Class Description class
LanguageIndexingFilter
AnIndexingFilter
that add alang
(language) field to the document. -
Uses of IndexingFilter in org.apache.nutch.indexer.anchor
Classes in org.apache.nutch.indexer.anchor that implement IndexingFilter Modifier and Type Class Description class
AnchorIndexingFilter
Indexing filter that offers an option to either index all inbound anchor text for a document or deduplicate anchors. -
Uses of IndexingFilter in org.apache.nutch.indexer.arbitrary
Classes in org.apache.nutch.indexer.arbitrary that implement IndexingFilter Modifier and Type Class Description class
ArbitraryIndexingFilter
Adds arbitrary searchable fields to a document from the class and method the user identifies in the config. -
Uses of IndexingFilter in org.apache.nutch.indexer.basic
Classes in org.apache.nutch.indexer.basic that implement IndexingFilter Modifier and Type Class Description class
BasicIndexingFilter
Adds basic searchable fields to a document. -
Uses of IndexingFilter in org.apache.nutch.indexer.feed
Classes in org.apache.nutch.indexer.feed that implement IndexingFilter Modifier and Type Class Description class
FeedIndexingFilter
-
Uses of IndexingFilter in org.apache.nutch.indexer.filter
Classes in org.apache.nutch.indexer.filter that implement IndexingFilter Modifier and Type Class Description class
MimeTypeIndexingFilter
AnIndexingFilter
that allows filtering of documents based on the MIME Type detected by Tika -
Uses of IndexingFilter in org.apache.nutch.indexer.geoip
Classes in org.apache.nutch.indexer.geoip that implement IndexingFilter Modifier and Type Class Description class
GeoIPIndexingFilter
This plugin implements an indexing filter which takes advantage of the GeoIP2-java API. -
Uses of IndexingFilter in org.apache.nutch.indexer.jexl
Classes in org.apache.nutch.indexer.jexl that implement IndexingFilter Modifier and Type Class Description class
JexlIndexingFilter
AnIndexingFilter
that allows filtering of documents based on a JEXL expression. -
Uses of IndexingFilter in org.apache.nutch.indexer.links
Classes in org.apache.nutch.indexer.links that implement IndexingFilter Modifier and Type Class Description class
LinksIndexingFilter
-
Uses of IndexingFilter in org.apache.nutch.indexer.metadata
Classes in org.apache.nutch.indexer.metadata that implement IndexingFilter Modifier and Type Class Description class
MetadataIndexer
Indexer which can be configured to extract metadata from the crawldb, parse metadata or content metadata. -
Uses of IndexingFilter in org.apache.nutch.indexer.more
Classes in org.apache.nutch.indexer.more that implement IndexingFilter Modifier and Type Class Description class
MoreIndexingFilter
Add (or reset) a few metaData properties as respective fields (if they are available), so that they can be accurately used within the search index. -
Uses of IndexingFilter in org.apache.nutch.indexer.replace
Classes in org.apache.nutch.indexer.replace that implement IndexingFilter Modifier and Type Class Description class
ReplaceIndexer
Do pattern replacements on selected field contents prior to indexing. -
Uses of IndexingFilter in org.apache.nutch.indexer.staticfield
Classes in org.apache.nutch.indexer.staticfield that implement IndexingFilter Modifier and Type Class Description class
StaticFieldIndexer
A simple plugin called at indexing that adds fields with static data. -
Uses of IndexingFilter in org.apache.nutch.indexer.subcollection
Classes in org.apache.nutch.indexer.subcollection that implement IndexingFilter Modifier and Type Class Description class
SubcollectionIndexingFilter
-
Uses of IndexingFilter in org.apache.nutch.indexer.tld
Classes in org.apache.nutch.indexer.tld that implement IndexingFilter Modifier and Type Class Description class
TLDIndexingFilter
Adds the public suffix (aka. -
Uses of IndexingFilter in org.apache.nutch.indexer.urlmeta
Classes in org.apache.nutch.indexer.urlmeta that implement IndexingFilter Modifier and Type Class Description class
URLMetaIndexingFilter
This is part of the URL Meta plugin. -
Uses of IndexingFilter in org.apache.nutch.microformats.reltag
Classes in org.apache.nutch.microformats.reltag that implement IndexingFilter Modifier and Type Class Description class
RelTagIndexingFilter
AnIndexingFilter
that addtag
field(s) to the document. -
Uses of IndexingFilter in org.creativecommons.nutch
Classes in org.creativecommons.nutch that implement IndexingFilter Modifier and Type Class Description class
CCIndexingFilter
Adds basic searchable fields to a document.
-