Skip to content

bllamasy/capstone

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

42 Commits
 
 
 
 
 
 

Repository files navigation

ML in Production - Capstone

This repo corresponds to the course of Machine Learning in Production. This code is done for educational purposes. As such, it is neither a real production code, nor a toy example easy to understand but useless. We tried to make it as similar as possible to real production systems, highlighting some parts and missing others to make it more readable.

2020's Edition

In 2020's edition we will train a model to tag Stackoverflow's questions. Data is publicly available here. Basically

  • We build a pipeline in Airflow to preprocess data in Google's BigQuery.
  • We create Python packages, with their corresponding tests, to preprocess text, train a model and predict it.
  • We create Dockerfiles that runs a Flask app that serves the model.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 73.4%
  • Jupyter Notebook 16.8%
  • Makefile 8.1%
  • Dockerfile 1.7%