Recently Published
Milestone #4
Fall 2025 PH251 R for Public Health group project, scenario 1, milestone #4
Plot
A
Spam vs Ham Classification
This project uses the SpamAssassin corpus to build a Naive Bayes classifier that predicts whether emails are spam or ham. After preprocessing and vectorizing the text, the model is trained and evaluated on a test set. The results highlight the challenges of imbalanced text data while demonstrating the full workflow for building a spam detection model in R.