Spam or Ham Classifier

Spam or Ham Classifier

Created
Nov 18, 2022 10:55 AM
Tags
notion image
Associated with: University of California, Berkeley
Class: DATA 100: Principles and Techniques in Data Science.

  • Built a Spam and Ham email classifier. The baseline model had an accuracy of 0.85. After cross-validation for feature and model selection, and preventing overfitting, the final model had an accuracy of 0.92.

    Learnings:

    • EDA techniques
    • Feature Engineering techniques
    • modeling
    • Evaluating a Logistic Regression model