public class StaticFieldIndexer extends Object implements IndexingFilter
X_POINT_ID| Constructor and Description |
|---|
StaticFieldIndexer() |
| Modifier and Type | Method and Description |
|---|---|
NutchDocument |
filter(NutchDocument doc,
Parse parse,
Text url,
CrawlDatum datum,
Inlinks inlinks)
The
StaticFieldIndexer filter object which adds fields as per
configuration setting. |
Configuration |
getConf()
Get the
Configuration object |
protected String |
regexEscape(String in)
Escapes any character that needs escaping so it can be used in a regexp.
|
void |
setConf(Configuration conf)
Set the
Configuration object |
public NutchDocument filter(NutchDocument doc, Parse parse, Text url, CrawlDatum datum, Inlinks inlinks) throws IndexingException
StaticFieldIndexer filter object which adds fields as per
configuration setting. See index.static in nutch-default.xml.filter in interface IndexingFilterdoc - The NutchDocument objectparse - The relevant Parse object passing through the filterurl - URL to be filtered for anchor textdatum - The CrawlDatum entryinlinks - The Inlinks containing anchor textIndexingExceptionpublic void setConf(Configuration conf)
Configuration objectsetConf in interface Configurablepublic Configuration getConf()
Configuration objectgetConf in interface ConfigurableCopyright © 2021 The Apache Software Foundation