makemore

makemore takes one text file as input, where each line is assumed to be one training thing, and generates more things like it. Under the hood, it is an autoregressive character-level language model, with a wide choice of models from bigrams all the way to a Transformer (exactly as seen in GPT). For example, we can feed it a database of names, and makemore will generate cool baby name ideas that all sound name-like, but are not already existing names. Or if we feed it a database of company names then we can generate new ideas for a name of a company. Or we can just feed it valid scrabble words and generate english-like babble.

This is not meant to be too heavyweight library with a billion switches and knobs. It is one hackable file, and is mostly intended for educational purposes. PyTorch is the only requirement.

KV Cache

An inference-time technique that makes attention O(n) by storing past keys and values. Trade memory for time.

Speed Improvement:

147.1 seconds without KV cache --> 20.1 seconds with KV cache

Generated 4000 (shakespeare) lines of upto upto 77 characters

Observation

KV cache won't work during training since weights are changing
Works with all types of embeddings
Won't work when the context window is shifted since the KV cache in memory would be invalid since they use the old postional embeddings
Basically works within one context window

Up Next

RoPE
Speculative Decoding

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
out		out
LICENSE		LICENSE
README.md		README.md
kv_cache.ipynb		kv_cache.ipynb
makemore.py		makemore.py
names.txt		names.txt
shakespeare_cleaned.txt		shakespeare_cleaned.txt
without_kvcache.ipynb		without_kvcache.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

makemore

KV Cache

Speed Improvement:

Observation

Up Next

About

Releases

Packages

Languages

License

Patchwork53/makemore

Folders and files

Latest commit

History

Repository files navigation

makemore

KV Cache

Speed Improvement:

Observation

Up Next

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages