Handwritten Digit Classifier Project This project utilizes a machine learning pipeline to classify handwritten digits. The project trains, tests, and cross-validates k-nearest neighbors and neural network models to find the best predictive accuracy with the least amount of overfitting.
Titanic Survivor Prediction Project Machine learning project using functions for efficient manipulation of feature engineering, feature selection, model selection, and hyperparameter tuning. Compares logistic regression, k-nearest neighbors, and random forest models to predict Titanic survivors.
D.C. Bike Rental Prediction Project This machine learning project compares linear regression, decision trees, and random forest models to predict bike rentals in Washington D.C.
Linear Regression Home Sale Price Prediction Project This project uses a machine learning workflow including feature engineering, feature selection, a linear regression train/test model, and k-fold validation to predict home sale prices. Functions are used to make the workflow efficient and iterable.
K-nearest Neighbors Car Price Prediction Project This project uses the k-nearest neighbors technique and machine learning algorithms and workflow to predict used car prices based on feature selection.
2020 ATP Tour Tennis Project This project utilizes a datset from Jeff Sackmann for the 2020 men's professional tennis tour to explore players' seasons and stats. The project involves some data preparation, visualization, and descripive analysis.
Hacker News Project This project is an example of using Python to sort, reformat, clean, and analyze data. It investigates posts to Hacker News.
German eBay Used Car Project This project uses the pandas and NumPy libraries in Python to clean and analyze data from a German eBay used car listing file.
Employee Exit Surveys Project This project involves cleaning and combining two datasets to analyze employee exit surveys in Australia.
Star Wars Survey Project This is a fun little project exploring a FiveThirtyEight survey of rankings of the Star Wars films.
College Degree Gender Gap Project This project is an example of creating visualizations in Python using the matplot library. It explores the gender gap in college degrees.
SQLite Project This project uses SQLite and a Python workflow to explore, analyze, and visualize data from the CIA World Factbook.