Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
data_prep.ipynb		data_prep.ipynb
machine_learning.ipynb		machine_learning.ipynb
teams.csv		teams.csv

README.md

Project Overview

In this tutorial, we'll cover the full process of building a beginner machine learning project. This includes creating a hypothesis, setting up the model, and measuring error. By the end, you'll understand how to build an end-to-end machine learning project using Python and Jupyter.

To make this interesting, we'll use a fun dataset. We'll use data from historical Olympic games. We'll try to predict how many medals a country will win based on historical and current data.

Machine learning project steps

Most machine learning projects follow a similar outline, which we'll also follow here. This outline will help you tackle any machine learning problem.

Project Steps

Form a hypothesis.
Find and explore the data.
(If necessary) Reshape the data to predict your target.
Clean the data for ML.
Pick an error metric.
Split your data.
Train a model.

Code

You can find the code for this project here.

File overview:

machine_learning.ipynb - the main project code
data_prep.ipynb - the code to generate the team-level dataset from an athlete-level dataset

Local Setup

Installation

To follow this project, please install the following locally:

Python 3.8+
Python packages
- pandas
- numpy
- scikit-learn
- seaborn

Data

We'll be using data from the Olympics, which was originally on Kaggle.

You can download the files we'll use in this project here:

teams.csv - the team-level data that we use in this project.
athlete_events.csv - this is the original athlete-level data

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

beginner_ml

beginner_ml

README.md

Project Overview

Machine learning project steps

Code

Local Setup

Installation

Data

Files

beginner_ml

Directory actions

More options

Directory actions

More options

Latest commit

History

beginner_ml

Folders and files

parent directory

README.md

Project Overview

Machine learning project steps

Code

Local Setup

Installation

Data