In this video, we are going to see how can we preprocess out dataset for Machine Learning Tasks using the Titanic dataset
We are going to deal will null or missing values in the dataset. We are also going to deal with categorical features and apply one hot encoding. We are also going to see how to select columns for our training and testing. We are going to drop unnecessary columns and rows. We are also going to learn how to normalize or standardize our columns and why to normalize in the first place.
Link for the dataset:
https://raw.githubusercontent.com/agconti/kaggle-titanic/master/data/train.csv
Code link:
https://github.com/DatumLearning/Machine-Learning-Playlist/blob/main/1_titanic.ipynb
Download
0 formats
No download links available.
P1 - Data Processing in Machine Learning | null (missing) values, one hot encoding. Titanic Dataset. | NatokHD