public class DeduplicationJob extends NutchTool implements Tool
| Modifier and Type | Class and Description |
|---|---|
static class |
DeduplicationJob.DBFilter |
static class |
DeduplicationJob.DedupReducer |
static class |
DeduplicationJob.StatusUpdateReducer
Combine multiple new entries for a url.
|
currentJob, currentJobNum, numJobs, results, status| Constructor and Description |
|---|
DeduplicationJob() |
| Modifier and Type | Method and Description |
|---|---|
static void |
main(String[] args) |
Map<String,Object> |
run(Map<String,Object> args,
String crawlId)
Runs the tool, using a map of arguments.
|
int |
run(String[] args) |
getProgress, getStatus, killJob, stopJobgetConf, setConfclone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, waitgetConf, setConfpublic int run(String[] args) throws IOException
run in interface ToolIOExceptionCopyright © 2021 The Apache Software Foundation