| Package | Description |
|---|---|
| org.apache.nutch.crawl |
Crawl control code and tools to run the crawler.
|
| org.apache.nutch.fetcher |
The Nutch robot.
|
| org.apache.nutch.indexer |
Index content, configure and run indexing and cleaning jobs to
add, update, and delete documents from an index.
|
| org.apache.nutch.metadata |
A Multi-valued Metadata container, and set
of constant fields for Nutch Metadata.
|
| org.apache.nutch.net |
Web-related interfaces: URL
filters
and normalizers. |
| org.apache.nutch.parse |
The
Parse interface and related classes. |
| org.apache.nutch.protocol |
Classes related to the
Protocol interface,
see also org.apache.nutch.net.protocols. |
| org.apache.nutch.service.impl | |
| org.apache.nutch.tools |
Miscellaneous tools.
|
| org.apache.nutch.util |
Miscellaneous utility classes.
|
| Class and Description |
|---|
| AbstractChecker
Scaffolding class for the various Checker implementations.
|
| GenericWritableConfigurable
A generic Writable wrapper that can inject Configuration to
Configurables |
| NutchTool |
| Class and Description |
|---|
| NutchTool |
| Class and Description |
|---|
| AbstractChecker
Scaffolding class for the various Checker implementations.
|
| NutchTool |
| Class and Description |
|---|
| GenericWritableConfigurable
A generic Writable wrapper that can inject Configuration to
Configurables |
| Class and Description |
|---|
| AbstractChecker
Scaffolding class for the various Checker implementations.
|
| Class and Description |
|---|
| AbstractChecker
Scaffolding class for the various Checker implementations.
|
| NutchTool |
| Class and Description |
|---|
| MimeUtil
This is a facade class to insulate Nutch from its underlying Mime Type
substrate library, Apache Tika.
|
| Class and Description |
|---|
| NutchTool |
| Class and Description |
|---|
| NutchTool |
| Class and Description |
|---|
| ObjectCache |
| TrieStringMatcher
TrieStringMatcher is a base class for simple tree-based string matching.
|
| TrieStringMatcher.TrieNode
Node class for the character tree.
|
Copyright © 2021 The Apache Software Foundation