public class JsoupIndexingFilter extends java.lang.Object implements IndexingFilter
| Modifier and Type | Field and Description |
|---|---|
static org.slf4j.Logger |
LOG |
X_POINT_ID| Constructor and Description |
|---|
JsoupIndexingFilter() |
| Modifier and Type | Method and Description |
|---|---|
NutchDocument |
filter(NutchDocument doc,
java.lang.String url,
WebPage page)
Adds fields or otherwise modifies the document that will be indexed for a
parse.
|
Configuration |
getConf() |
java.util.Collection<WebPage.Field> |
getFields() |
void |
setConf(Configuration conf) |
public NutchDocument filter(NutchDocument doc, java.lang.String url, WebPage page) throws IndexingException
IndexingFilterfilter in interface IndexingFilterdoc - document instance for collecting fieldsurl - page urlIndexingExceptionpublic java.util.Collection<WebPage.Field> getFields()
getFields in interface FieldPluggablepublic Configuration getConf()
getConf in interface Configurablepublic void setConf(Configuration conf)
setConf in interface ConfigurableCopyright © 2019 The Apache Software Foundation