Back to Browse

Elevating Data Quality Standards With Databricks DQX

9.5K views
Jul 7, 2025
38:49

Join us for an introductory session on Databricks DQX, a Python-based framework designed to validate the quality of PySpark DataFrames. Discover how DQX can empower you to proactively tackle data quality challenges, enhance pipeline reliability and make more informed business decisions with confidence. Traditional data quality tools often fall short by providing limited, actionable insights, relying heavily on post-factum monitoring, and being restricted to batch processing. DQX overcomes these limitations by enabling real-time quality checks at the point of data entry, supporting both batch and streaming data validation and delivering granular insights at the row and column level. If you’re seeking a simple yet powerful data quality framework that integrates seamlessly with Databricks, this session is for you. Talk By: Marcin Wojtyczka, Sr. Resident Solutions Architect, Databricks ; Neha Milak, RSA, Databricks Here’s more to explore: Unified and open governance for data and AI: https://www.databricks.com/product/unity-catalog See all the product announcements from Data + AI Summit: https://www.databricks.com/events/dataaisummit-2025-announcements Connect with us: Website: https://databricks.com Twitter: https://twitter.com/databricks LinkedIn: https://www.linkedin.com/company/databricks Instagram: https://www.instagram.com/databricksinc Facebook: https://www.facebook.com/databricksinc

Download

1 formats

Video Formats

360pmp461.8 MB

Right-click 'Download' and select 'Save Link As' if the file opens in a new tab.

Elevating Data Quality Standards With Databricks DQX | NatokHD