Fine-Tuning Llama2 and Google Gemma Using LoRA and PEFT

This repository contains two notebooks demonstrating the fine-tuning process for the Llama2 and Google Gemma models Low Rank Adaptation (LoRA) with Parameter Efficient Fine Tuning(PEFT) and Transformer Reinforcement Learning on custom datasets.

More to come....

Checkout to the official LoRA: Low-Rank Adaptation of Large Language Models Paper

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning (PEFT) methods GitHub

Getting Started

Prerequisites

Google Colab or a similar Jupyter notebook environment.
Basic knowledge of Python, machine learning, and natural language processing.

Dependencies

For Llama2_Fine_Tuning_HF.ipynb: -python 3.10

torch
transformers
peft
bitsandbytes
trl (for transformer reinforcement learning)
huggingface_hub

The Finetuning_Google_Gemma.ipynb notebook may require similar dependencies, which are typically installed within the notebook.

Installation

Clone this repository and open the notebooks in your preferred Jupyter environment. I prefer running them in Google Colab with the free tire T4 GPU. Ensure you have the required dependencies installed by running the installation commands provided in the notebooks.

Usage

Llama2 Fine-Tuning

Load the llama-2-7b model.
Train the model on my custom dataset, referred to as "mental_health_data" in the notebook, or any hugging face dataset of choice.
The notebook guides you through fine-tuning the model, visualizing training plots through tensor board, performing inference, and storing the fine-tuned model.

Google Gemma Fine-Tuning

Load the gemma-7b model as described in the notebook.
Follow the instructions to fine-tune the model on your dataset (either custom or a hugging face dataset).
Test the outputs of the fine-tuned model.

Additional Notes

Fine-tuning large language models requires careful consideration of resource constraints, especially in environments like Google Colab.
The notebooks provide insights into parameter-efficient fine-tuning, data preparation, and model evaluation through tensor board.

Contributing

Contributions to improve the notebooks or the fine-tuning processes are welcome. Please ensure to follow the standard pull request process.

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
Finetuning_Google_Gemma.ipynb		Finetuning_Google_Gemma.ipynb
Llama2_Fine_Tuning.ipynb		Llama2_Fine_Tuning.ipynb
Llama2_Fine_Tuning_HF.ipynb		Llama2_Fine_Tuning_HF.ipynb
README.md		README.md
mental_health_data.txt		mental_health_data.txt
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Fine-Tuning Llama2 and Google Gemma Using LoRA and PEFT

Contents

Getting Started

Prerequisites

Dependencies

Installation

Usage

Llama2 Fine-Tuning

Google Gemma Fine-Tuning

Additional Notes

Contributing

About

Languages

TVR28/LLama2_Finetuning_PEFT_LoRA

Folders and files

Latest commit

History

Repository files navigation

Fine-Tuning Llama2 and Google Gemma Using LoRA and PEFT

Contents

Getting Started

Prerequisites

Dependencies

Installation

Usage

Llama2 Fine-Tuning

Google Gemma Fine-Tuning

Additional Notes

Contributing

About

Resources

Stars

Watchers

Forks

Languages