Generated a machine learning model to detect whether email is spam or not.
Note for setting up
- open SpamEmailML.ipynb on jupyter/machine learning studios
- make sure spambase.csv is in same folder (data file)
- cell - run all to create and run model
Project inspiration/goal
- develop a machine learning model that can detect spam emails
- spam emails are quite prevalent, and having this technology would save a persons time
Classification model used
- gaussian naive bayes since it is generally used for this problem, and deals with continuous features well
Other
- this model is specific to the context of the data set provided (business context in usa), and is not in any way a general spam email filter that can be commercially used.
- data at - https://archive.ics.uci.edu/ml/datasets/Spambase
- it provides a method that can be used to create a more general spam filter, but requires much more example emails/data