Features

Some of the features of the HBase connector include:

Content Retrieved

The HBase connector retrieves the content as stored in the objectData field of the table in the HBase server.

Due to API limitations, HBase connector the HBase connector has the following limitations:

id: MD5 id of the document

humanName: The document id in a human readable form

.

createdTimestamp: The timestamp of when the document was created

.

updatedTimestamp: The timestamp of when the document was last updated

.

crawlTimestamp: The timestamp of when the document was crawled

.

objectData: The Aspire

Object

object in json format that has the content of the document

.

binaryFilepath: The path of the document binary file

.