GitHub - felixstander/xtuner at 8ff60e982a4342cdab2694cfb44b4c80ec3eedcf

Branches Tags

Name		Name	Last commit message	Last commit date
Latest commit History 67 Commits
.github		.github
docs		docs
examples		examples
requirements		requirements
xtuner		xtuner
.gitignore		.gitignore
.pre-commit-config-zh-cn.yaml		.pre-commit-config-zh-cn.yaml
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
README_zh-CN.md		README_zh-CN.md
requirements.txt		requirements.txt
setup.cfg		setup.cfg
setup.py		setup.py

Repository files navigation

English | 简体中文

👋 join us on Twitter, Discord and WeChat

🎉 News

[2023.08.xx] XTuner is released, with multiple fine-tuned adapters on HuggingFace.

📖 Introduction

XTuner is a toolkit for efficiently fine-tuning LLM, developed by the MMRazor and MMDeploy teams.

Efficiency: Support LLM fine-tuning on consumer-grade GPUs. The minimum GPU memory required for 7B LLM fine-tuning is only 8GB, indicating that users can use nearly any GPU (even the free resource, e.g., Colab) to fine-tune custom LLMs.
Versatile: Support various LLMs (InternLM, Llama2, ChatGLM2, Qwen, Baichuan, ...), datasets (MOSS_003_SFT, Colorist, Code Alpaca, Arxiv GenTitle, Chinese Law, OpenOrca, Open-Platypus, ...) and algorithms (QLoRA, LoRA), allowing users to choose the most suitable solution for their requirements.
Compatibility: Compatible with DeepSpeed 🚀 and HuggingFace 🤗 training pipeline, enabling effortless integration and utilization.

🌟 Demos

QLoRA Fine-tune
Plugin-based Chat
Ready-to-use models and datasets from XTuner API

🔥 Supports

Models	SFT Datasets	Data Pipelines	Algorithms
InternLM InternLM-Chat Llama Llama2 Llama2-Chat ChatGLM2 Qwen Qwen-Chat Baichuan-7B Baichuan-13B-Base Baichuan-13B-Chat ...	MOSS-003-SFT 🔧 Colorist 🎨 Code Alpaca Arxiv GenTitle Chinese Law OpenOrca Alpaca en / zh oasst1 Medical Dialogue Open-Platypus ...	Incremental Pre-training Single-turn Conversation SFT Multi-turn Conversation SFT	QLoRA LoRA Full parameter fine-tune

🛠️ Quick Start

Installation

Install XTuner with pip

pip install xtuner

or from source

git clone https://github.com/InternLM/xtuner.git
cd xtuner
pip install -e .

Chat

Examples of Plugins-based Chat 🔥🔥🔥

XTuner provides tools to chat with pretrained / fine-tuned LLMs.

For example, we can start the chat with Llama2-7B-Plugins by

xtuner chat hf meta-llama/Llama-2-7b-hf --adapter xtuner/Llama-2-7b-qlora-moss-003-sft --bot-name Llama2 --prompt-template moss_sft --with-plugins calculate solve search --command-stop-word "<eoc>" --answer-stop-word "<eom>" --no-streamer

For more examples, please see chat.md.

Fine-tune

XTuner supports the efficient fine-tune (e.g., QLoRA) for LLMs.

Step 0, prepare the config. XTuner provides many ready-to-use configs and we can view all configs by
```
xtuner list-cfg
```
Or, if the provided configs cannot meet the requirements, please copy the provided config to the specified directory and make specific modifications by
```
xtuner copy-cfg ${CONFIG_NAME} ${SAVE_DIR}
```

Step 1, start fine-tuning. For example, we can start the QLoRA fine-tuning of InternLM-7B with oasst1 dataset by

# On a single GPU
xtuner train internlm_7b_qlora_oasst1_e3
# On multiple GPUs
(DIST) NPROC_PER_NODE=${GPU_NUM} xtuner train internlm_7b_qlora_oasst1_e3
(SLURM) srun ${SRUN_ARGS} xtuner train internlm_7b_qlora_oasst1_e3 --launcher slurm

For more examples, please see finetune.md.

Deployment

Step 0, convert the pth adapter to HuggingFace adapter, by

xtuner convert adapter_pth2hf \
    ${CONFIG} \
    ${PATH_TO_PTH_ADAPTER} \
    ${SAVE_PATH_TO_HF_ADAPTER}

or, directly merge the pth adapter to pretrained LLM, by

xtuner convert merge_adapter \
    ${CONFIG} \
    ${PATH_TO_PTH_ADAPTER} \
    ${SAVE_PATH_TO_MERGED_LLM} \
    --max-shard-size 2GB

Step 1, deploy fine-tuned LLM with any other framework, such as LMDeploy 🚀.
```
pip install lmdeploy
python -m lmdeploy.pytorch.chat ${NAME_OR_PATH_TO_LLM} \
    --max_new_tokens 256 \
    --temperture 0.8 \
    --top_p 0.95 \
    --seed 0
```
🎯 We are woking closely with LMDeploy, to implement the deployment of plugins-based chat!

Evaluation

We recommend using OpenCompass, a comprehensive and systematic LLM evaluation library, which currently supports 50+ datasets with about 300,000 questions.

🤝 Contributing

We appreciate all contributions to XTuner. Please refer to CONTRIBUTING.md for the contributing guideline.

🎖️ Acknowledgement

License

This project is released under the Apache License 2.0. Please also adhere to the Licenses of models and datasets being used.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🎉 News

📖 Introduction

🌟 Demos

🔥 Supports

🛠️ Quick Start

Installation

Chat

Fine-tune

Deployment

Evaluation

🤝 Contributing

🎖️ Acknowledgement

License

About

Releases

Packages

Languages

License

felixstander/xtuner

Folders and files

Latest commit

History

Repository files navigation

🎉 News

📖 Introduction

🌟 Demos

🔥 Supports

🛠️ Quick Start

Installation

Chat

Fine-tune

Deployment

Evaluation

🤝 Contributing

🎖️ Acknowledgement

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages