web pages,
host metadata) of data in abstracted storage.See: Description
| Class | Description |
|---|---|
| Host |
Host represents a store of webpages or other data which resides on a server or other computer so that it can be accessed over the Internet
|
| Host.Builder |
RecordBuilder for Host instances.
|
| Host.Tombstone | |
| ParseStatus |
A nested container representing parse status data captured from invocation of parsers on fetch of a WebPage
|
| ParseStatus.Builder |
RecordBuilder for ParseStatus instances.
|
| ParseStatus.Tombstone | |
| ProtocolStatus |
A nested container representing data captured from web server responses.
|
| ProtocolStatus.Builder |
RecordBuilder for ProtocolStatus instances.
|
| ProtocolStatus.Tombstone | |
| StorageUtils |
Entry point to Gora store/mapreduce functionality.
|
| WebPage |
WebPage is the primary data structure in Nutch representing crawl data for a given WebPage at some point in time
|
| WebPage.Builder |
RecordBuilder for WebPage instances.
|
| WebPage.Tombstone | |
| WebTableCreator |
| Enum | Description |
|---|---|
| Host.Field |
Enum containing all data bean's fields.
|
| Mark | |
| ParseStatus.Field |
Enum containing all data bean's fields.
|
| ProtocolStatus.Field |
Enum containing all data bean's fields.
|
| WebPage.Field |
Enum containing all data bean's fields.
|
web pages,
host metadata) of data in abstracted storage.Copyright © 2019 The Apache Software Foundation