GitHub - shania3322/NNproject: Practicing neural network models by building from scratch

Task1: Build a multi-layer neural network from scratch

My very first version network built from scratch (It looks ugly at the moment). The only library used here is Numpy. The mode difficult part is calculating partial derivatives in backpropagation. I found 3 different ways to calculate the gradient.

Method 1: Calculating partial derivatives with respect to each weight and bia. It is uneffecient but it is a good begining to understand how the a small change in a parameter causes the change of the output. Tutorial

Method 2: Using Jacobian matrix to find the gradient by calculate the first-order partial derivatives between vectors.Jacobian

Method 3: Using a scalar function to find the gradient. The method is explained in Matrix Calculus

Best Train Scores:

params
Network layers	784-32-10
batch size	10
train	40,000 images
validation	400 images (4-fold-cross-validation: train:validation=0.99:0.01)
initialization for weights and bias	standard normal distribution
learning rate	0.1
Epochs	15

Best Test Scores:

TODO

TODO:

~~Fix training~~
Add numpy Type Hint
Add method 3

Referrences:

Task2: Build transformer from scratch

Referred to paper Attention Is All You Need

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
assets		assets
.gitignore		.gitignore
README.md		README.md
helper.py		helper.py
main.py		main.py
model.py		model.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Task1: Build a multi-layer neural network from scratch

Tutorial for Method 1

Jacobian

Matrix Calculus

Task2: Build transformer from scratch

About

Releases

Packages

Languages

shania3322/NNproject

Folders and files

Latest commit

History

Repository files navigation

Task1: Build a multi-layer neural network from scratch

Tutorial for Method 1

Jacobian

Matrix Calculus

Task2: Build transformer from scratch

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages