Pandera is a popular, lightweight data validation library for Python dataframes. While it started as a pandas dataframe validation library, over the years it added support for Modin, PySpark, Polars, and other backends. That said, if your data lived in a database—as data often does—you needed to extract it from the source system using one of these libraries (if possible) and only then execute your checks.
Learn -from the contributor who made this possible- how the new Ibis backend for Pandera works, enabling you to validate your data efficiently and at scale without ever extracting it from the source system. Since Ibis provides a unified Python dataframe API to 20+ databases and other engines, this one integration extends pandera to work with the execution backend you need.
Download
0 formats
No download links available.
Validate your data anywhere: introducing the Ibis backend for Pandera | NatokHD