These are materials I use for a class on machine learning with scikit-learn, keras and tensorflow. Each file is a self contained unit that demonstrates a specific thing. Downloading or cloning this repository before class is a great way to follow along.
Getting your machine setup is essential to following along with a class. I try really hard to make this class as easy to setup as possible, but setting up a machine for development can be challenging if you haven't done it before. When in doubt, copy the error messages into Google! All of the programs you need to install for this are extremely standard and for any issue you run into, hundreds of other people have encountered the same issue.
If you are uncomfortable opening up a terminal, I strongly recommend doing a quick tutorial before you take this class. Setting up your machine can be painful but once you're setup you can get a ton out of the class. I recommend getting started ahead of time.
If you're on Windows I recommend checking out http://thepythonguru.com/.
If you're on a Mac check out http://www.macworld.co.uk/how-to/mac/coding-with-python-on-mac-3635912/
If you're on linux, you're probably already reasonably well setup :).
If you run into trouble, the book Learn Python the Hard Way has installation steps in great detail: https://learnpythonthehardway.org/book/ex0.html. It also has a refresher on using a terminal in the appendix.
If you are comfortable opening up a terminal but want a python intro/refresher check out https://www.learnpython.org/ for a really nice introduction to Python.
A lot of people like to follow along with ipython or jupyter notebooks and I think that's great! It makes data exploration easier. I also really appreciate pull requests to make the code clearer.
If you've never used pandas or numpy - they are great tools and I use them heavily in my work and for this class. I assume no knlowedge of pandas and numpy but you may want to do some learning on your own. You can get a quick overview of pandas at http://pandas.pydata.org/pandas-docs/stable/10min.html. There is a great overview of numpy at https://docs.scipy.org/doc/numpy/user/quickstart.html.
Before the class, please run the following commands to install the prerequisite code.
You can download python from https://www.python.org/downloads/. There are more detailed instructions for windows installation at https://www.howtogeek.com/197947/how-to-install-python-on-windows/.
The material should work with python 2 or 3. If you don't know the difference, installing python 2 seems to lead to slightly less problems with the installation.
git clone https://github.com/lukas/ml-class.git
cd ml-class
If you get an error message here, most likely you don't have git installed. Go to https://www.atlassian.com/git/tutorials/install-git for intructions on installing git.
pip install pandas
pip install scikit-learn
pip install tensorflow
pip install keras
pip install h5py
pip install flask
pip install scikit-image
pip install scipy
pip install pillow
The most common problem is an old version of python. Its easy to have multiple versions of python installed at once and Macs in particular come with a default version of python that is too old to install tensorflow.
Try running:
python --version
If your version is less than 2.7.12, you have a version issue. Try reinstalling python 2.
To make sure your installation works go to the directory where this file is and run
python test-scikit.py
You should see the output "Scikit is installed!"
python test-keras.py
You should see the output "Using TensorFlow backend. Keras is installed!"
Please download the large files before the class
python keras-download.py
Also download VGG weights file from https://drive.google.com/file/d/0Bz7KyqmuGsilT0J5dmRCM0ROVHc
Order of presentation of files, if you want to follow along
- test-scikit.py
- load-data.py
- pandas-explore.py
- feature-extraction-1.py
- feature-extraction-2.py
- feature-extraction-3.py
- classifier.py
- test-algorithm-1.py
- test-algorithm-2.py
- test-algorithm-dummy.py
- test-algorithm-cross-validation.py
- test-algorithm-cross-validation-dummy.py
- custom-tokenizer.py
- pipeline.py
- pipeline-bigrams.py
- pipeline-bigrams-cross-validation.py
- feature-selection.py
- grid-search.py
- pipeline-save.py
- pipeline-server.py
- perceptron.py
- keras-digits.py
- keras-scikit-learn.py
- keras-one-hot.py
- log-loss.py
- keras-perceptron-1.py
- keras-perceptron-2.py
- keras-perceptron-3.py
- keras-perceptron-4.py
- keras-perceptron-save.py
- keras-perceptron-load.py
- keras-mlp.py
- keras-dropout.py
- tensorflow-mult.py
- tensorflow-perceptron.py
- convolution-demo.py
- maxpool-demo.py
- keras-cnn-1.py
- keras-cnn-2.py
- keras-cnn-inspect.py
- keras-deep-dream.py
- keras-vgg-inspect.py
- keras-resnet50-inspect.py
- keras-inception-inspect.py
- smile.py
- smile-generator.py
- smile-server-1.py
- smile-server-2.py
- smile-server-3.py
- dogcat-1.py
- dogcat-generator.py
- dogcat-bottleneck.py
- dogcat-transfer.py
- dogcat-finetune.py
- dogcat-transfer-and-finetune.py
- lstm-preprocess-text.py
- lstm-generate-text-rnn.py
- lstm-train-rnn.py
- lstm-train-lstm.py
- lstm-generate-text-lstm.py
- lstm-generate-text-rnn.py
- lstm-imdb-sentiment-cnn.py