Step 1. In the Data Source section, click on the Import Data tile to start setting up the connector.
Step 2. The Import Data Source window will pop up.
Enter the Connection Name and select a data category from the dropdown list or create a new category. Data Category is very useful for organizing your data.
Step 3. Click on Data Connectors and select S3.
The next screen will show the existing connectors and an option to create a new one. Click on Add New Account to proceed.
Step 4. Select Access Key from the Access Type drop-down list. Fill in the following pop-up with the appropriate information. The integration name is for the connection in the Accern Platform, the bucket is where you would like to pull or push data, the region is where your account is hosted, the Secret Access Key and Access Key are credentials you should have for your S3 account. Click Next to proceed.
Step 5. On this window, fill out the Data Store Tile name which will appear on the UI, define the folder path in the S3 bucket, select the file format (must match the file format that resides in the selected S3 bucket).
If you choose CSV or JSONL format, make sure to map the fields on the file to the required fields on the platform (Document ID, Document Title, Document Content, Date of Publication, Date of Collection). Click Add to complete the data connector creation process. A tile will be generated in the Delivery section. This can be toggled on or off for your use cases.
*** For the "Source Folder Path" field, please exclude the bucket name and only enter the path to the desired destination folder. For example, if this is the S3 URL of the folder: s3://client-historical/accern-text-data/bucket/, you'll only need to use accern-text-data/bucket