Use Data Loader in Astra Portal
Use the Data Loader in the Astra Portal to load data into your database. Bring your own data or use a sample dataset.
-
In the Astra Portal, go to Databases, and click your database’s name.
-
Click Load Data to start the Data Loader.
-
Select an option to load your data:
-
Upload your own dataset: Select a CSV file to load.
CSV files must be less than 40MB, and the column names must not contain spaces. Underscores (
_
) are allowed. -
Load an example dataset: Select one of the two examples given to use as a sample dataset.
-
Load data from DynamoDB: Export your DynamoDB data to an Amazon S3 bucket, and then, in your AWS console, grant read access to the following ARN:
arn:aws:iam::445559476293:role/astra-loader
Your bucket policy should use the following:
{ "Statement": [ { "Action": [ "s3:ListBucket", "s3:GetBucketLocation" ], "Principal": { "AWS": "arn:aws:iam::445559476293:role/astra-loader" }, "Effect": "Allow", "Resource": "arn:aws:s3:::BUCKET_NAME" }, { "Action": [ "s3:GetObject" ], "Principal": { "AWS": "arn:aws:iam::445559476293:role/astra-loader" }, "Effect": "Allow", "Resource": "arn:aws:s3:::BUCKET_NAME/*" } ] }
This bucket policy allows Astra DB automation to pull data from your identified shared S3 bucket, and load the data into Astra DB. You can remove the permission after loading the data.
In the Bucket and Key fields, enter your S3 bucket name and key value. To find the key in the AWS console, go to the S3 subdirectory that contains your exported DynamoDB data, and locate the key on the Properties tab.
-
-
Click Next.
-
Enter a name for the table.
-
In the Data Preview and Types section, select the data type for each column.
The Astra DB Data Loader automatically selects data types for your dataset. If needed, change these selections.
-
In the Keys and Clustering section, select the partition key and clustering column for your data.
-
Click Next.
-
For Target Database and Target Keyspace, select the database and keyspace where you want to load the data.
-
Click Next to initiate data loading. The upload process can take a few minutes.
Next, interact with your data by connecting to your database.