REST API Seeds configuration guidelines can be found here.

Amazon S3 Connector Seeds "type" must always be specified as "s3".

Amazon S3 Connector Seeds "seed" will be the path to be crawled. It can be a bucket, folder or file.

Properties specific to the Amazon S3 Connector Seeds are described below.


Amazon S3 Seed Properties


FieldRequiredDefaultMultipleNotesExample
propertiesNo-NoConfiguration object
seedYes-NoThe path to be crawled. It can be a bucket, folder or file."/myBucket/myFolder"
processSplitFilesNofalseNoIf set to true, files that are split are treated as a single document instead of multiple documents.
true / false
splitPatternsNo[]YesList of regular expressions to match folders that contain split files.[".*splitted.*",".*split.*"]

Usage example


POST /aspire/_api/seeds
{
	"seed": "/myBucket/myFolder",
	"type": "s3",
	"description": "Amazon S3 Seed Test",
	"connector": "187ae173-32c2-48fe-b4fe-1b6eea02f166",
	"connection": "8f3233a9-4c8d-4520-b335-ea1d9433720e",
	"workflows": [],
	"tags": [],
	"properties": {
		"seed": "/myBucket/myFolder",
		"processSplitFiles": true,
		"splitPatterns": []
	}
}



  • No labels