public class NodeDumper extends Configured implements Tool
| Modifier and Type | Class and Description |
|---|---|
static class |
NodeDumper.Dumper
Outputs the hosts or domains with an associated value.
|
static class |
NodeDumper.Sorter
Outputs the top urls sorted in descending order.
|
| Constructor and Description |
|---|
NodeDumper() |
| Modifier and Type | Method and Description |
|---|---|
void |
dumpNodes(Path webGraphDb,
org.apache.nutch.scoring.webgraph.NodeDumper.DumpType type,
long topN,
Path output,
boolean asEff,
org.apache.nutch.scoring.webgraph.NodeDumper.NameType nameType,
org.apache.nutch.scoring.webgraph.NodeDumper.AggrType aggrType,
boolean asSequenceFile)
Runs the process to dump the top urls out to a text file.
|
static void |
main(String[] args) |
int |
run(String[] args)
Runs the node dumper tool.
|
getConf, setConfclone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, waitgetConf, setConfpublic void dumpNodes(Path webGraphDb, org.apache.nutch.scoring.webgraph.NodeDumper.DumpType type, long topN, Path output, boolean asEff, org.apache.nutch.scoring.webgraph.NodeDumper.NameType nameType, org.apache.nutch.scoring.webgraph.NodeDumper.AggrType aggrType, boolean asSequenceFile) throws Exception
webGraphDb - The WebGraph from which to pull values.topN - output - IOException - If an error occurs while dumping the top values.ExceptionCopyright © 2021 The Apache Software Foundation