Step 1. In the Data Source section, click on the Import Data tile to start setting up the connector.
Step 2. The Import Data Source window will pop up. Enter the Connection Name and select a data category from the dropdown list or create a new category. Data Category is very useful for organizing your data.
Step 3. Click on Data Connectors and select S3.
The next screen will show the existing connectors and an option to create a new one. Click on Add New Account to proceed.
Step 4. Fill out the Integration and Bucket name where you want to import the data from. Click on the "link" in "Please use this link to create cross-account role".
Step 5. After clicking the link, you will be able to access the Quick Create Stack template on the S3 UI that has been pre-built by us to help you create the ARN role faster. The ExternalID and OtherAccountNumber are automatically generated, you only need to define the S3 bucket name, which you want to grant the Accern Platform access to. Click Create Stack to proceed.
Accern External-ID: 076e3e0e-de98-4881-8a76-6a4e695bb504
(Please be advised that when you use this pre-built template to create the role, you're granting the Accern platform the ListObjectsInBucket and AllObjectsAction permissions. Let us know if you wish to customize these permissions.)
Step 6. An ARN role for Accern to access the specified S3 bucket has now been generated. Copy the entire role information under the "Value" column and paste it in the "Role ARN" field on Accern Platform UI. Click Next to proceed.
Copy the entire role information under the "Value" column and paste it in the "Role ARN" field on Accern Platform UI. Click Next to proceed.
Step 7. On this window, fill out the Data Store Tile name which will appear on the UI, define the folder path in the S3 bucket, select the file format (must match the file format that resides in the selected S3 bucket). If you choose CSV or JSONL format, make sure to map the fields on the file to the required fields on the platform (Document ID, Document Title, Document Content, Date of Publication, Date of Collection). Click Add to complete the data connector creation process. A tile will be generated in the Delivery section. This can be toggled on or off for your use cases.
*** For the "Source Folder Path" field, please exclude the bucket name and only enter the path to the desired destination folder. For example, if this is the S3 URL of the folder: s3://client-historical/accern-text-data/bucket/, you'll only need to use accern-text-data/bucket