Manages the items that needs to be processed by the workflow, these items may or may not be sent to scanned.
Fields:
Field Name | Example | Description |
---|---|---|
_id | C:\test-folder\folderA\testDocument.txt | The unique id of the document |
metadata | [depends on each connector] | The necessary metadata fields the connector needs to fetch or populate this document |
type | [depends on each connector] | The serialized version of the ItemType of the document |
status | C, P or A | The document processing status: C: Completed, means it have been already processed P: in Progress, means it is currently been processed A: Available, means it is available for been processed |
action | add, update, delete | The action to be performed to the search engine for the document |
timestamp | 1465334398471 | The time-stamp when this document was added to the queue |
signature | CBEC1210FE2D51A8166C3E70D38F8A07 | An MD5 signature, when a document changes this signature should also change |
parentId | C:\test-folder\folderA | The id of the parent document, in other words the document that scanned the current document |
processor | File_System-192.168.1.15:50505 | The identifier of the Aspire server that processed or is processing the current document |
shouldScan | false | Determines whether or not this document should be considered for scanning |
shouldProcess | true | Determines whether or not this document should be considered for being processed by the workflow |
retries | 0 | The number of times this document has been retried |
name | testDocument.txt | The name of this document |
isCrawlRootItem | false | Indicates if this is one of the root crawl items (for internal control) |
hierarchyId | C:\test-folder\folderA\testDocument.txt | Unique Id for using to generate the hierarchy for this document, it may be different from the _id field |