public class RelTagIndexingFilter extends java.lang.Object implements IndexingFilter
IndexingFilter that adds tag
field(s) to the document.X_POINT_ID| Constructor and Description |
|---|
RelTagIndexingFilter() |
| Modifier and Type | Method and Description |
|---|---|
NutchDocument |
filter(NutchDocument doc,
java.lang.String url,
WebPage page)
The
RelTagIndexingFilter filter object. |
Configuration |
getConf()
Get the
Configuration object |
java.util.Collection<WebPage.Field> |
getFields()
Gets all the fields for a given
WebPage Many datastores need to
setup the mapreduce job by specifying the fields needed. |
void |
setConf(Configuration conf)
Set the
Configuration object |
public java.util.Collection<WebPage.Field> getFields()
WebPage Many datastores need to
setup the mapreduce job by specifying the fields needed. All extensions
that work on WebPage are able to specify what fields they need.getFields in interface FieldPluggablepublic void setConf(Configuration conf)
Configuration objectsetConf in interface Configurablepublic Configuration getConf()
Configuration objectgetConf in interface Configurablepublic NutchDocument filter(NutchDocument doc, java.lang.String url, WebPage page) throws IndexingException
RelTagIndexingFilter filter object.filter in interface IndexingFilterdoc - The NutchDocument objecturl - URL to be filtered for rel-tag'spage - WebPage object relative to the URLIndexingExceptionCopyright © 2019 The Apache Software Foundation