Emails-Spam-Classifier

This is a Binary Classification Problem Statement in which we have to classify the Ham and the Spam emails.

About the Spam Dataset

Here is the introduction of the dataset i will be using for this project.

The dataset used in this project is from Apache SpamAssassin.

Apache SpamAssassin is the #1 Open Source anti-spam platform giving system administrators a filter to classify email and block spam (unsolicited bulk email).

It uses a robust scoring framework and plug-ins to integrate a wide range of advanced heuristic and statistical analysis tests on email headers and body text including text analysis, Bayesian filtering, DNS blocklists, and collaborative filtering databases.

Apache SpamAssassin is a project of the Apache Software Foundation (ASF). You can find more about them from the below link:

https://spamassassin.apache.org/

The dataset we will be using is hosted at the below link:

http://spamassassin.apache.org/old/publiccorpus/

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
datasets/spam		datasets/spam
Flow Chart.png		Flow Chart.png
README.md		README.md
spam_classifier.ipynb		spam_classifier.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Emails-Spam-Classifier

About the Spam Dataset

About

Releases

Packages

Languages

anubhav6864/Spam-Classifier-Emails-

Folders and files

Latest commit

History

Repository files navigation

Emails-Spam-Classifier

About the Spam Dataset

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages