ml-svm-spamassassin-spam-classifier

This jupyter notebook application fits a Support Vector Machine model to classify emails as spam or not using the SpamAssassin Public Corpus data, and also provides the ability to predict new emails.

Setup

Clone this repo to your desktop.
Extract the data.rar file to the root directory of this project.
Create a new anaconda environment with all the requirements using the following command:
```
conda env create -f environment.yml
```
Activate the environment using (windows)
```
activate spam-classifier-env
```
or if you are on a linux machine
```
source activate spam-classifier-env
```
Run jupyter notebook from the root directory to open up the notebook in your browser.

Usage

To predict new emails:

Run all cells in the jupyter notebook to train the model.
Add the txt file(s) of the new raw sample(s) you want to predict to the "data\samples" directory. (Note: Make sure the data.rar file has already been extracted.)
Run predictSamples(filenames) where "filenames" is a python list of the names of all the samples you want to predict, such as:
```
filenames = ['emailSample1.txt', 'emailSample2.txt']
```

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.gitignore		.gitignore
README.md		README.md
data.rar		data.rar
environment.yml		environment.yml
model.ipynb		model.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ml-svm-spamassassin-spam-classifier

Setup

Usage

About

Releases

Packages

Languages

anees-asghar/ml-svm-spamassassin-spam-classifier

Folders and files

Latest commit

History

Repository files navigation

ml-svm-spamassassin-spam-classifier

Setup

Usage

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages