Welcome to our Dask Tutorial with @the-data-queerie from Coiled. Learn more at coiled.io
In this module, Richard will cover topics that include:
- What is parallel computing?
- Why do we need parallel computing?
- Do I need to use Dask?
- When should I use Dask?
- Which types of datasets are best to use with Dask?
- Why are data size limitations important?
- How to avoid memory errors when crunching data
- Why pandas is limited
_______
What is Dask?
Dask is a free and open-source library for parallel computing in Python. Dask is a community project maintained by developers and organizations.
What is Coiled?
Coiled is a Dask company. Using Coiled, you can scale Dask in your cloud.
coiled.io