Exploring Supervised Finetuning Methods for Large Language Models

2024 Final project for Deep Learning

This is the fork of LLamafactory used to perform all finetuning experiments.

Our custom YAML files used for fine-tuning can be found in project/<DATASET>/train. Our custom YAML files used for computing ROUGE scores are found in project/<DATASET>/eval. Our collected results and graphs are found in project/<DATASET>/results.

To run LLamafactory with our custom YAML files first clone this repository and run:

cd LLaMA-Factory
pip install -e .[torch,metrics]

Then run:

nvidia-smi --query-gpu=index,timestamp,utilization.gpu,memory.total,memory.free,memory.used --format=csv, --loop=10 > MEMORY_LOG_FILE_NAME.csv 2>&1 & CUDA_VISIBLE_DEVICES=0 llamafactory-cli train project/<DATASET>/train/llama3_<DATASET>_<SFT_METHOD_NAME>.yaml

To evaluate, run:

CUDA_VISIBLE_DEVICES=0 llamafactory-cli train project/<DATASET>/eval/llama3_<DATASET>_<SFT_METHOD_NAME>.yaml

The model and the results should be saved in the saves folder not tracked by this repository.

Name		Name	Last commit message	Last commit date
Latest commit History 1,380 Commits
.github		.github
assets		assets
data		data
evaluation		evaluation
examples		examples
project		project
scripts		scripts
src		src
tests		tests
.dockerignore		.dockerignore
.gitattributes		.gitattributes
.gitignore		.gitignore
CITATION.cff		CITATION.cff
Dockerfile		Dockerfile
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
README_zh.md		README_zh.md
docker-compose.yml		docker-compose.yml
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Exploring Supervised Finetuning Methods for Large Language Models

About

Releases

Packages

Languages

License

Jiminator/LLaMA-Factory

Folders and files

Latest commit

History

Repository files navigation

Exploring Supervised Finetuning Methods for Large Language Models

About

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages