forked from Vision-CAIR/MiniGPT-4
-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
1 parent
8718057
commit 307f0ee
Showing
2 changed files
with
33 additions
and
2 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,30 @@ | ||
## How to Prepare Vicuna Weight | ||
Vicuna is an open-source LLAMA-based LLM that has a performance close to ChatGPT. | ||
We currently use the v0 version of Vicuna-13B. | ||
|
||
To prepare Vicuna’s weight, first download Vicuna’s **delta** weight from [https://huggingface.co/lmsys/vicuna-13b-delta-v0](https://huggingface.co/lmsys/vicuna-13b-delta-v0). In case you have git-lfs installed (https://git-lfs.com), this can be done by | ||
|
||
``` | ||
git lfs install | ||
git clone https://huggingface.co/lmsys/vicuna-13b-delta-v0 | ||
``` | ||
|
||
Note that this is not directly the working weight, but the difference between the working weight and the original weight of LLAMA-13B. (Due to LLAMA’s rules, we cannot distribute the weight of LLAMA.) | ||
|
||
Then, you need to obtain the original LLAMA-13B weights in the HuggingFace format either following the instruction provided by HuggingFace [here](https://huggingface.co/docs/transformers/main/model_doc/llama) or from the Internet. | ||
|
||
When these two weights are ready, we can use tools from Vicuna’s team to create the real working weight. | ||
First, Install their library that is compatible with v0 Vicuna by | ||
|
||
``` | ||
pip install git+https://github.com/huggingface/[email protected] | ||
``` | ||
|
||
Then, run the following command to create the final working weight | ||
|
||
``` | ||
python -m fastchat.model.apply_delta --base /path/to/llama-13b-hf/ --target /path/to/save/working/vicuna/weight/ --delta /path/to/vicuna-13b-delta-v0/ | ||
``` | ||
|
||
Now you are good to go! | ||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters