๐๐จ๐ฆ๐ฉ๐ฅ๐๐ฆ๐๐ง๐ญ ๐๐๐ข๐ฏ๐ ๐๐๐ฒ๐๐ฌ is a supervised machine learning algorithm which has been used for a classification task in this example. This algorithm is a modification of ๐๐ฎ๐ญ๐ข๐ง๐จ๐ฆ๐ข๐๐ฅ ๐๐๐ข๐ฏ๐ ๐๐๐ฒ๐๐ฌ and it works well in the case of unbalanced datasets.
I used ๐ต๐ฎ๐บ_๐๐ฝ๐ฎ๐บ.๐ฐ๐๐ dataset for this example. The dataset is available in the repository. It contains 2 types of emails: ๐ก๐๐ฆ & ๐ฌ๐ฉ๐๐ฆ.
๐ฎ๐๐๐ฏ๐๐ ๐๐ ๐ ๐๐๐๐: https://github.com/randomaccess2023/MG2023/tree/main/Video%2067
๐๐ข๐ฅ๐ค๐ง๐ฉ๐๐ฃ๐ฉ ๐ฉ๐๐ข๐๐จ๐ฉ๐๐ข๐ฅ๐จ:
01:01 - Import required libraries
02:39 - Load ๐ก๐๐ฆ_๐ฌ๐ฉ๐๐ฆ dataset
03:53 - Drop unnecessary columns
06:53 - Apply preprocessing
07:47 - Separate features and labels
08:09 - Split the dataset
09:42 - Apply ๐๐๐๐ญ๐จ๐ซ๐ข๐ณ๐๐ญ๐ข๐จ๐ง and ๐๐จ๐ฆ๐ฉ๐ฅ๐๐ฆ๐๐ง๐ญ ๐๐๐ข๐ฏ๐ ๐๐๐ฒ๐๐ฌ
13:03 - Plot ๐๐จ๐ง๐๐ฎ๐ฌ๐ข๐จ๐ง_๐ฆ๐๐ญ๐ซ๐ข๐ฑ
17:42 - Print ๐๐ฅ๐๐ฌ๐ฌ๐ข๐๐ข๐๐๐ญ๐ข๐จ๐ง_๐ซ๐๐ฉ๐จ๐ซ๐ญ
#datascience #pythonprogramming #python #complementnaivebayes #naivebayes #jupyternotebook #jupyter #machinelearning #hamspamdataset