Project 2 (Building Large Language Models) for Stanford CS324: Understanding and Developing Large Language Models (Winter 2022)

This repository contains the code for project 2 on training language models in the Winter 2022 iteration of CS324: Understanding and Developing Large Language Models at Stanford. This code was developed by Sang Michael Xie and Percy Liang.

In the first part of the project, we use fine-tuning/continued-pretraining to instill length controllability into GPT-2 small, where we can control the word length of the model's generated response by prepending metadata about length. The data, preprocessing code, and training code are provided, but some hyperparameters need to be tuned.
In the second part of the project, we aim to instill a new capability of choice into a language model. The project code is built on top of HuggingFace, and the data for the first part is based on OpenWebText. By default, we provide scripts for running the project on CodaLab, a platform for reproducibile research, but the scripts and code can be adapted to run locally or on other platforms as well.

Useful links:

Setup and quickstart for CodaLab:

CodaLab guide

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
openwebtext_tiny		openwebtext_tiny
scripts		scripts
src		src
wordlength_eval_data		wordlength_eval_data
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
requirements_docker.txt		requirements_docker.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Project 2 (Building Large Language Models) for Stanford CS324: Understanding and Developing Large Language Models (Winter 2022)

About

Releases

Packages

Languages

License

amaliujia/cs324_p2

Folders and files

Latest commit

History

Repository files navigation

Project 2 (Building Large Language Models) for Stanford CS324: Understanding and Developing Large Language Models (Winter 2022)

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages