This video discusses how to apply preprocessing in practice. It covers pitfalls such as data leakage and data distortion, and how we can avoid them by using the fit-predict paradigm. Next, we look at how machine learning pipelines are implemented in scikit-learn, and how to use them for training and model selection. Or should we say pipeline selection?