Skip to content

emmaqingjie/final_project_data_science

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

96 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Final Project: Computational Tools for Data Science - Group 34

This repository contains the final project for the Computational Tools for Data Science course. The goal was to analyze gym user behavior using a dataset from Kaggle (link) to provide actionable insights for gym owners.

Key Analyses

  • Frequent Itemset Mining: Uncovered peak attendance times and common check-in behaviors.
  • Clustering: Identified user profiles using DBSCAN and evaluated cluster quality with Silhouette and Davies-Bouldin scores.
  • Classification: Predicted workout preferences based on demographic features like age and gender.
  • Anomaly Detection: Flagged irregular attendance patterns for retention strategy insights.

Repository Structure

  • data/: Processed datasets used in the analysis.
    • data/_raw: Raw datasets.
    • data/plots: Visualizations and figures generated during the analysis.
  • scripts/: Jupyter notebooks.

Dataset

The dataset includes information on gym check-ins, user metadata, and workout sessions.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published