These are materials I use for a class on machine learning with scikit-learn, keras and tensorflow. Each file is a self contained unit that demonstrates a specific thing. Downloading or cloning this repository before class is a great way to follow along.
Getting your machine setup is essential to following along with a class. I try really hard to make this class as easy to setup as possible, but setting up a machine for development can be challenging if you haven't done it before. When in doubt, copy the error messages into Google! All of the programs you need to install for this are extremely standard and for any issue you run into, hundreds of other people have encountered the same issue.
If you are uncomfortable opening up a terminal, I strongly recommend doing a quick tutorial before you take this class. Setting up your machine can be painful but once you're setup you can get a ton out of the class. I recommend getting started ahead of time.
If you're on Windows I recommend checking out http://thepythonguru.com/.
If you're on a Mac check out http://www.macworld.co.uk/how-to/mac/coding-with-python-on-mac-3635912/
If you're on linux, you're probably already reasonably well setup :).
If you run into trouble, the book Learn Python the Hard Way has installation steps in great detail: https://learnpythonthehardway.org/book/ex0.html. It also has a refresher on using a terminal in the appendix.
If you are comfortable opening up a terminal but want a python intro/refresher check out https://www.learnpython.org/ for a really nice introduction to Python.
A lot of people like to follow along with ipython or jupyter notebooks and I think that's great! It makes data exploration easier. I also really appreciate pull requests to make the code clearer.
If you've never used pandas or numpy - they are great tools and I use them heavily in my work and for this class. I assume no knlowedge of pandas and numpy but you may want to do some learning on your own. You can get a quick overview of pandas at http://pandas.pydata.org/pandas-docs/stable/10min.html. There is a great overview of numpy at https://docs.scipy.org/doc/numpy/user/quickstart.html.
Before the class, please run the following commands to install the prerequisite code.
Install git: https://git-scm.com/download/win
Install anaconda
Try running the following from the command prompt:
python --version
You should see something like
Python 3.6.1 :: Anaconda 4.4.0 (64-bit)
If don't see "Anaconda" in the output, search for "anaconda prompt" from the start menu and enter your command prompt this way. It's also best to use a virtual environment to keep your packages silo'ed. Do so with:
conda create -n ml-class python=3.6
activate ml-class
Whenever you start a new terminal, you will need to call activate ml-class
.
The most common problem is an old version of python. Its easy to have multiple versions of python installed at once and Macs in particular come with a default version of python that is too old to install tensorflow.
Try running:
python --version
If your version is less than 2.7.12, you have a version issue. Try reinstalling python 2.
git clone https://github.com/lukas/ml-class.git
cd ml-class
pip install wandb
conda install -c conda-forge scikit-learn
conda install -c conda-forge tensorflow
conda install -c conda-forge keras
You can download python from https://www.python.org/downloads/. There are more detailed instructions for windows installation at https://www.howtogeek.com/197947/how-to-install-python-on-windows/.
The material should work with python 2 or 3. On Windows, you need to install thre 64 bit version of python 3.5 or 3.6 in order to install tensorflow.
git clone https://github.com/lukas/ml-class.git
cd ml-class
If you get an error message here, most likely you don't have git installed. Go to https://www.atlassian.com/git/tutorials/install-git for intructions on installing git.
pip install wandb
pip install pandas
pip install scikit-learn
pip install tensorflow
pip install keras
pip install h5py
pip install flask
pip install scikit-image
pip install scipy
pip install pillow
To make sure your installation works go to the directory where this file is and run
python test-scikit.py
You should see the output "Scikit is installed!"
python test-keras.py
You should see the output "Using TensorFlow backend. Keras is installed!"
Please download the large files before the class
python keras-download.py
Also download VGG weights file from https://drive.google.com/file/d/0Bz7KyqmuGsilT0J5dmRCM0ROVHc
If you want to download your own images for classification, one fun/easy way to do it is install the Chrome extension bulk image downloader at http://www.talkapps.org/bulk-image-downloader
Order of presentation of files, if you want to follow along
- load-data.py
- pandas-explore.py
- feature-extraction-1.py
- feature-extraction-2.py
- feature-extraction-3.py
- classifier.py
- test-algorithm-1.py
- test-algorithm-2.py
- test-algorithm-dummy.py
- test-algorithm-cross-validation.py
- test-algorithm-cross-validation-dummy.py
- custom-tokenizer.py
- pipeline.py
- pipeline-bigrams.py
- pipeline-bigrams-cross-validation.py
- feature-selection.py
- grid-search.py
- pipeline-custom-features.py
- pipeline-save.py
- pipeline-server.py
- perceptron.py
- keras-digits.py
- keras-scikit-learn.py
- keras-one-hot.py
- log-loss.py
- keras-perceptron/perceptron-1.py
- keras-perceptron/perceptron-2.py
- keras-perceptron/perceptron-3.py
- keras-perceptron/perceptron-4.py
- keras-perceptron/perceptron-checkpoint.py
- keras-perceptron/perceptron-save.py
- keras-perceptron/perceptron-load.py
- keras-perceptron/perceptron-regression.py
- keras-mlp/mlp.py
- keras-mlp/dropout.py
- tensorflow-mult.py
- tensorflow-perceptron.py
- keras-weights.py
- keras-weights-answer.py
- convolution-demo.py
- maxpool-demo.py
- keras-cnn/cnn-1.py
- keras-cnn/cnn-2.py
- keras-cnn/cnn-inspect.py
- keras-deep-dream.py
- keras-vgg-inspect.py
- keras-resnet50-inspect.py
- keras-inception-inspect.py
- smile.py
- smile-generator.py
- smile-server-1.py
- smile-server-2.py
- smile-server-3.py
- dogcat-1.py
- dogcat-generator.py
- dogcat-bottleneck.py
- dogcat-transfer.py
- dogcat-finetune.py
- dogcat-transfer-and-finetune.py
- lstm-time-series-output.py
- lstm-time-series-stateful.py
- lstm-time-series-timesteps.py
- lstm-time-series-window.py
- lstm-preprocess-text.py
- lstm-generate-text-rnn.py
- lstm-train-rnn.py
- lstm-train-lstm.py
- lstm-generate-text-lstm.py
- lstm-generate-text-rnn.py
- lstm-imdb-sentiment-cnn.py
- inspect-net.py