Lesson 6: Regularization; Convolutions; Data ethics

Overview

Today we discuss some powerful techniques for improving training and avoiding over-fitting:

Dropout: remove activations at random during training in order to regularize the model
Data augmentation: modify model inputs during training in order to effectively increase data size
Batch normalization: adjust the parameterization of a model in order to make the loss surface smoother.

Next up, we'll learn all about convolutions, which can be thought of as a variant of matrix multiplication with tied weights, and are the operation at the heart of modern computer vision models (and, increasingly, other types of models too).

We'll use this knowledge to create a class activated map, which is a heat-map that shows which parts of an image were most important in making a prediction.

Finally, we'll cover a topic that many students have told us is the most interesting and surprising part of the course: data ethics. We'll learn about some of the ways in which models can go wrong, with a particular focus on feedback loops, why they cause problems, and how to avoid them. We'll also look at ways in which bias in data can lead to biased algorithms, and discuss questions that data scientists can and should be asking to help ensure that their work doesn't lead to unexpected negative outcomes.

Lesson Resources

Detailed lesson notes - thanks to @hiromi
Notebooks:
Lesson 6 in-class discussion thread
Lesson 6 advanced discussion

Other Resources

platform.ai discussion
50 Years of Test (Un)fairness: Lessons for Machine Learning
Convolutions:
Convolution Arithmetic:
Normalization:
Cross entropy loss:
How CNNs work:
Image processing and computer vision:
"Yes you should understand backprop":
BERT state-of-the-art language model for NLP:
Hubel and Wiesel:
Perception:

Edit this page.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

notes-1-6.md

notes-1-6.md

Lesson 6: Regularization; Convolutions; Data ethics

Overview

Lesson Resources

Other Resources

Files

notes-1-6.md

Latest commit

History

notes-1-6.md

File metadata and controls

Lesson 6: Regularization; Convolutions; Data ethics

Overview

Lesson Resources

Other Resources