Artie Transfer
Search
K
Comment on page

S3

This page will go over how to configure and run our S3 destination.

Overview

To run our S3 connector, you will need to specify the following:
  1. 1.
    Parent bucket
  2. 3.
    Desired output format
  3. 4.
    (Optional) Prefix

Bucket Structure

Artie Transfer will save table data in this particular format:
/{{bucketName}}/{{optionalPrefix}}/{{fullyQualifiedTableName}}/{{YYYY-MM-DD}}
Example: /artie/foo/db.schema.tableName/2023-08-06
Upon each flush, there will be a new file created within this folder, the filename is: {{unix_timestamp}}_{{randomString(4)}}.parquet.gz
  • Unix Timestamp is the latest timestamp of row processed
  • Random string is created to allow parallelism

Typing

Artie Type
Parquet Type
Float
Float
Integer
Integer
Numeric
DECIMAL(p,s)
Boolean
Boolean
String
String
Struct
JSON string
Array<any>
Array<string>
Timestamp
Int64, Unix timestamp (in ms)
Time
String
Date
String
Last modified 3mo ago