Databricks Lakeflow Declarative Data Pipeline Course
Data pipeline with Databricks using LakeFlow Declarative Pipelines with a realistic sales data use case. This step-by-step project demonstrates how to design, implement, and automate a modern Lakehouse architecture that supports batch and streaming data, ML/AI integration, and production dashboards. What you will learn in this tutorial: Unity Catalog setup for governance (metastore, catalog, schema, permissions) Creating synthetic sales datasets and inserting them into Delta tables (no external CSV imports) Designing a Bronze → Silver → Gold Medallion Architecture with LakeFlow Declarative Pipelines Applying data transformations, joins, and unions across multiple Delta tables Using AutoLoader for incremental ingestion and simulating real-time data appends Implementing Change Data Capture (CDC) and Materialized Views Building Streaming Tables for continuous pipeline updates Training an AutoML sales prediction model with AutoGluon Managing MLflow model deployment for batch and real-time inference Automating workflows with LakeFlow Jobs orchestration Creating Databricks SQL dashboards for reporting and analytics Why this project matters: This project is designed as a professional Databricks Lakehouse template for data engineers, data scientists, and AI engineers. By following along, you’ll learn how to combine: Data engineering (pipelines, transformations, CDC, orchestration) Data science & ML (AutoML with AutoGluon, MLflow deployment) MLOps best practices (model scoring, monitoring, real-time inference) Business intelligence (SQL dashboards for sales insights) This portfolio-ready project demonstrates your ability to deliver enterprise-grade data solutions using Databricks LakeFlow, Delta Lake, Unity Catalog, and MLflow — all in one workflow. This is not just theory — it’s a real JUMIA sales pipeline case study, giving you skills you can apply immediately in data engineering, analytics, and MLOps. My Portfolio: https://benjaminuka.streamlit.app All codes and queries GitHub: https://github.com/uka-ben/sales-pipeline--databricks-declarative-pipeline/tree/main LinkedIn: https://www.linkedin.com/in/benjamin-uka-imo Don’t forget to LIKE 👍, SUBSCRIBE and SHARE this video to help others learn modern pipelines with Databricks LakeFlow. #databricks #sql #python #data #ai
Download
0 formatsNo download links available.