Skip to content

Latest commit

 

History

History

Spam_Detection

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 

SMS Spam Detection

Project Overview

This project focuses on detecting spam messages using natural language processing (NLP) techniques and machine learning models.

Dataset

The dataset includes SMS messages labeled as spam or ham (not spam).

Project Steps

  1. Data Preprocessing:

    • Text cleaning and normalization.
    • Tokenization and stemming.
    • Converting text data into numerical representations using techniques like TF-IDF.
  2. Exploratory Data Analysis (EDA):

    • Visualizing the distribution of spam and ham messages.
    • Analyzing common words and phrases in spam messages.
  3. Model Building:

    • Training various machine learning models like Naive Bayes, SVM, and Random Forest.
    • Evaluating model performance using metrics such as accuracy, precision, recall, and F1-score.
  4. Model Evaluation:

    • Comparing different models.
    • Selecting the best model based on evaluation metrics.

Usage

To run this project, ensure you have the required packages installed and execute the notebook.

Requirements

Refer to the requirements.txt file for a list of dependencies.