In this video I am going to show you how to upload your data to Databricks platform in few different ways.
00:00 - Intro
00:43 - Upload local files to Databricks
03:06 - Upload a dataset from Kaggle to Databricks
11:37 - Upload from S3 bucket
Databricks is a unified platform for data, analytics and AI. There’s lots of things you can use it for - you can build your data warehouse, run ETL jobs, train AI models, visualise your data and build APIs on top of it - all scalable out of the box.
Databricks Documentation: https://docs.databricks.com
Databricks free edition https://www.databricks.com/learn/free-edition
S3 bucket policy:
{
"Version": "2012-10-17",
"Statement": [
{
"Sid": "AllowPublicReadAccess",
"Effect": "Allow",
"Principal": "*",
"Action": "s3:GetObject",
"Resource": "arn:aws:s3:::your-bucket-name/*"
}
]
}