S3
This page will go over how to configure and run our S3 destination.
Overview
To run our S3 connector, you will need to specify the following:
Parent bucket
Desired output format
(Optional) Prefix
Bucket Structure
Artie Transfer will save table data in this particular format:
Upon each flush, there will be a new file created within this folder, the filename is: {{unix_timestamp}}_{{randomString(4)}}.parquet.gz
Unix Timestamp is the latest timestamp of row processed
Random string is created to allow parallelism
Creating a service account
Typing
Artie Type | Parquet Type |
---|---|
Float | Float |
Integer | Integer |
Numeric | DECIMAL(p,s) |
Boolean | Boolean |
String | String |
Struct | JSON string |
Array<any> | Array<string> |
Timestamp | Int64, Unix timestamp (in ms) |
Time | String |
Date | String |
Last updated