public class OrphanScoringFilter extends AbstractScoringFilter
| Modifier and Type | Field and Description |
|---|---|
static Text |
ORPHAN_KEY_WRITABLE |
X_POINT_ID| Constructor and Description |
|---|
OrphanScoringFilter() |
| Modifier and Type | Method and Description |
|---|---|
void |
orphanedScore(Text url,
CrawlDatum datum)
This method may change the score or status of CrawlDatum during CrawlDb
update, when the URL is neither fetched nor has any inlinks.
|
void |
setConf(Configuration conf) |
void |
updateDbScore(Text url,
CrawlDatum old,
CrawlDatum datum,
List<CrawlDatum> inlinks)
Used for orphan control.
|
distributeScoreToOutlinks, generatorSortValue, getConf, indexerScore, initialScore, injectedScore, passScoreAfterParsing, passScoreBeforeParsingpublic static Text ORPHAN_KEY_WRITABLE
public void setConf(Configuration conf)
setConf in interface ConfigurablesetConf in class AbstractScoringFilterpublic void updateDbScore(Text url, CrawlDatum old, CrawlDatum datum, List<CrawlDatum> inlinks) throws ScoringFilterException
updateDbScore in interface ScoringFilterupdateDbScore in class AbstractScoringFilterurl - of the recordold - CrawlDatumdatum - new CrawlDatuminLinks - list of inlinked CrawlDatumsScoringFilterExceptionpublic void orphanedScore(Text url, CrawlDatum datum)
ScoringFilterurl - URL of the pagedatum - CrawlDatum for pageCopyright © 2021 The Apache Software Foundation