public class HTMLLanguageParser extends java.lang.Object implements ParseFilter
X_POINT_ID| Constructor and Description |
|---|
HTMLLanguageParser() |
| Modifier and Type | Method and Description |
|---|---|
Parse |
filter(java.lang.String url,
WebPage page,
Parse parse,
HTMLMetaTags metaTags,
org.w3c.dom.DocumentFragment doc)
Scan the HTML document looking at possible indications of content language
html lang attribute (http://www.w3.org/TR/REC-html40/struct/dirlang.html#h-8.1) meta dc.language (http://dublincore.org/documents/2000/07/16/usageguide/qualified -html.shtml#language) meta http-equiv (content-language) (http://www.w3.org/TR/REC-html40/struct/global.html#h-7.4.4.2) |
Configuration |
getConf() |
java.util.Collection<WebPage.Field> |
getFields() |
void |
setConf(Configuration conf) |
public Parse filter(java.lang.String url, WebPage page, Parse parse, HTMLMetaTags metaTags, org.w3c.dom.DocumentFragment doc)
filter in interface ParseFilterpublic void setConf(Configuration conf)
setConf in interface Configurablepublic Configuration getConf()
getConf in interface Configurablepublic java.util.Collection<WebPage.Field> getFields()
getFields in interface FieldPluggableCopyright © 2019 The Apache Software Foundation