public class CCIndexingFilter extends java.lang.Object implements IndexingFilter
| Modifier and Type | Field and Description |
|---|---|
static java.lang.String |
FIELD
The name of the document field we use.
|
X_POINT_ID| Constructor and Description |
|---|
CCIndexingFilter() |
| Modifier and Type | Method and Description |
|---|---|
void |
addUrlFeatures(NutchDocument doc,
java.lang.String urlString)
Add the features represented by a license URL.
|
NutchDocument |
filter(NutchDocument doc,
java.lang.String url,
WebPage page)
Adds fields or otherwise modifies the document that will be indexed for a
parse.
|
Configuration |
getConf() |
java.util.Collection<WebPage.Field> |
getFields() |
void |
setConf(Configuration conf) |
public void addUrlFeatures(NutchDocument doc, java.lang.String urlString)
public void setConf(Configuration conf)
setConf in interface Configurablepublic Configuration getConf()
getConf in interface Configurablepublic java.util.Collection<WebPage.Field> getFields()
getFields in interface FieldPluggablepublic NutchDocument filter(NutchDocument doc, java.lang.String url, WebPage page) throws IndexingException
IndexingFilterfilter in interface IndexingFilterdoc - document instance for collecting fieldsurl - page urlIndexingExceptionCopyright © 2019 The Apache Software Foundation