Personal copilot blog #1413

pacman100 · 2023-08-24T14:52:40Z

What does this PR do?

Personal Co-Pilot for everyone ✨

sayakpaul · 2023-08-25T02:53:18Z

personal_copilot.md

@@ -0,0 +1,676 @@
+---
+title: "HugCoder 🤗: Train Your Own Coding Assistant 🚀" 
+thumbnail: /blog/assets/159_safecoder/thumbnail.jpg


To be updated. And an entry needs to be added to "_blog.yml".

personal_copilot.md

sgugger

Left a couple more comments on top of the already great comments here.

personal_copilot.md

pcuenca

Super interesting! I only did a first pass. I agree with @sayakpaul and @BenjaminBossan that the memory computations and training approach (QLoRA vs full fine-tuning) might require a bit more hand-holding.

We potentially don't need to show results from all the experiments you did. For example, we can recommend QLoRA as the cheapest and fastest method, and direct interested readers to the traditional fine-tuning scripts.

assets/170_personal_copilot/activation_memory_computation.png

personal_copilot.md

pcuenca · 2023-08-29T10:02:09Z

personal_copilot.md

+
+Voila! ⭐️
+
+The demo at the start is this 1B model that is running locally on my Mac laptop.


How many tokens per second are you getting? I think it'd be interesting for the community as it's an usual comparison metric.

personal_copilot.md

loubnabnl

Really great and comprehensive blog post! Previous comments have already covered most points.

personal_copilot.md

loubnabnl · 2023-08-30T08:41:05Z

personal_copilot.md

+
+To keep the serialization of this content relatively memory-friendly, we used chunking and the feather format. Refer to [this script](https://github.com/sayakpaul/hf-codegen/blob/main/data/prepare_dataset.py) for the full implementation. 
+
+Our dataset prepared this way is available [here](https://huggingface.co/datasets/sayakpaul/hf-codegen-v2) and it looks like so:


would be cool to have the cards of the datasets mentioned in the blog filled

personal_copilot.md

lvwerra

Great content! Mostly some high level feedback to make it a bit easier to follow/read the post:

I would avoid adding code for completeness, you can always link to a repo. Only show it if there is anything very interesting/useful and you explain it in detail. Inference code is there a few times and I don't think it's necessary for the blog.
The Dance of LoRA section shows an interesting approach but it is very long. I'd consider shorten it a bit and only show the most interesting findings and combinations. E.g. there are a lot of examples and after 2-3 it becomes a bit harder to focus on them. Also consider showing them as code blocks rather than screenshots - it looks a bit nicer in the post.
It's ok to show a few examples where it works and where not but I would probably not take it too far based on examples. We do have benchmarks to check how well models work for chat or code completion and ultimately one should rely on those to guide decisions. Maybe this is a bit out of scope for this project but a note would be great.

Hope this helps!

personal_copilot.md

Co-authored-by: Pedro Cuenca <[email protected]> Co-authored-by: Sayak Paul <[email protected]> Co-authored-by: Benjamin Bossan <[email protected]> Co-authored-by: Loubna Ben Allal <[email protected]>

…opilot-blog

Co-authored-by: Pedro Cuenca <[email protected]>

pacman100 · 2023-10-22T23:11:32Z

Hello, I've addressed all the comments. I'm planning to release the blog tomorrow (Monday).

personal_copilot.md

sayakpaul · 2023-10-24T03:36:15Z

personal_copilot.md

+
+## Full Finetuning
+
+We will look at how to do full fine-tuning of starcoder-15B on 8 A100 80GB GPUs using PyTorch Fully Sharded Data Parallel (FSDP) technique. For more information on FSDP, please refer [Fine-tuning Llama 2 70B using PyTorch FSDP](https://huggingface.co/blog/ram-efficient-pytorch-fsdp) and [Accelerate Large Model Training using PyTorch Fully Sharded Data Parallel](https://huggingface.co/blog/pytorch-fsdp).


Let's maintain consistency when referring to model checkpoints. Let's maybe follow bigcode/starcoder15B.

sayakpaul · 2023-10-24T03:37:16Z

personal_copilot.md

+
+| | |
+|---|---|
+| Model | Pass@1 |


What does Pass@1 denote?

sayakpaul · 2023-10-24T03:38:24Z

personal_copilot.md

+4. Dataset: [smangrul/hf-stack-v1](https://huggingface.co/datasets/smangrul/hf-stack-v1)
+5. Trained Model: [smangrul/peft-lora-starcoder15B-v2-personal-copilot-A100-40GB-colab](https://huggingface.co/smangrul/peft-lora-starcoder15B-v2-personal-copilot-A100-40GB-colab)
+
+The command to launch training is given at [run_peft.sh](https://github.com/pacman100/DHS-LLM-Workshop/blob/main/personal_copilot/training/run_peft.sh). The total training time was **12.5 Hours**. Taking the cost of **$1.10 / hr** based on [lambdalabs](https://lambdalabs.com/service/gpu-cloud/pricing), the total cost would be **$13.75**. That's pretty good 🚀! In terms of cost, it's **7.8X** lower than the cost for full fine-tuning.  


We have already talked about the memory requirements with and without QLoRA. So, I guess it's okay to skip that part here? You have already done it but ensuring if we don't want to add a sentence about the memory part.

Here, we are comparing the cost of training. I think this is important metrics from the end users point of view.

personal_copilot.md

sayakpaul

Left some comments.

I still think the blog reads a bit heavy. I wouldn't mind splitting it up into multiple blogs for easier readability with specific focus areas:

Creating a personal code assistant
Deployment and a VS Code extension
Mixing of LoRAs for Code LLMs

WDYT?

pacman100 · 2023-10-24T21:00:53Z

I still think the blog reads a bit heavy. I wouldn't mind splitting it up into multiple blogs for easier readability with specific focus areas:

Creating a personal code assistant
Deployment and a VS Code extension
Mixing of LoRAs for Code LLMs
WDYT?

Hello Sayak, I think there is no need to split this into multiple blog posts as the overall signal in the sub-blog posts would not be much. I like the current way the blog is structured in an end-to-end manner. Readers can easily skip the sections and at the same time come back to the same blog to pick it up when interested.

Co-authored-by: Sayak Paul <[email protected]>

…opilot-blog

sayakpaul · 2023-10-25T03:08:48Z

@pcuenca or @lvwerra could either of you give this a final check and approve accordingly?

pcuenca

Very nice, left some nits.

_blog.yml

personal-copilot.md

Co-authored-by: Pedro Cuenca <[email protected]>

pacman100 added 4 commits August 24, 2023 13:53

add content

356a7e0

add the demo video

5c8b503

content addition checkpoint

1f3330d

completing the blog

8f9669b

pacman100 requested review from lvwerra, sayakpaul, lewtun, philschmid, sgugger and loubnabnl August 24, 2023 14:52

pacman100 added 3 commits August 24, 2023 20:28

Update personal_copilot.md

c1a796d

fix spacing

806570b

reorder

bc3665d

pacman100 requested review from BenjaminBossan and removed request for BenjaminBossan August 24, 2023 15:19

Update personal_copilot.md

8dfd0e5

sayakpaul reviewed Aug 25, 2023

View reviewed changes

personal_copilot.md Outdated Show resolved Hide resolved

sayakpaul reviewed Aug 25, 2023

View reviewed changes

personal_copilot.md Outdated Show resolved Hide resolved

sayakpaul reviewed Aug 25, 2023

View reviewed changes

personal_copilot.md Outdated Show resolved Hide resolved

sayakpaul reviewed Aug 25, 2023

View reviewed changes

personal_copilot.md Outdated Show resolved Hide resolved

sayakpaul reviewed Aug 25, 2023

View reviewed changes

personal_copilot.md Outdated Show resolved Hide resolved

sayakpaul reviewed Aug 25, 2023

View reviewed changes

personal_copilot.md Outdated Show resolved Hide resolved

sayakpaul reviewed Aug 25, 2023

View reviewed changes

personal_copilot.md Outdated Show resolved Hide resolved

sayakpaul reviewed Aug 25, 2023

View reviewed changes

personal_copilot.md Outdated Show resolved Hide resolved

sayakpaul reviewed Aug 25, 2023

View reviewed changes

personal_copilot.md Outdated Show resolved Hide resolved

sayakpaul reviewed Aug 25, 2023

View reviewed changes

personal_copilot.md Outdated Show resolved Hide resolved

sayakpaul reviewed Aug 25, 2023

View reviewed changes

personal_copilot.md Outdated Show resolved Hide resolved

sayakpaul reviewed Aug 25, 2023

View reviewed changes

personal_copilot.md Outdated Show resolved Hide resolved

sayakpaul reviewed Aug 25, 2023

View reviewed changes

personal_copilot.md Outdated Show resolved Hide resolved

sgugger reviewed Aug 25, 2023

View reviewed changes

personal_copilot.md Outdated Show resolved Hide resolved

personal_copilot.md Outdated Show resolved Hide resolved

personal_copilot.md Outdated Show resolved Hide resolved

pcuenca reviewed Aug 29, 2023

View reviewed changes

loubnabnl reviewed Aug 30, 2023

View reviewed changes

lvwerra reviewed Aug 31, 2023

View reviewed changes

personal_copilot.md Outdated Show resolved Hide resolved

personal_copilot.md Outdated Show resolved Hide resolved

personal_copilot.md Outdated Show resolved Hide resolved

personal_copilot.md Outdated Show resolved Hide resolved

personal_copilot.md Outdated Show resolved Hide resolved

pacman100 and others added 5 commits October 19, 2023 11:28

Apply suggestions from code review

93c5eb6

Co-authored-by: Pedro Cuenca <[email protected]> Co-authored-by: Sayak Paul <[email protected]> Co-authored-by: Benjamin Bossan <[email protected]> Co-authored-by: Loubna Ben Allal <[email protected]>

Merge remote-tracking branch 'upstream/main' into smangrul/personal-c…

ffa95fb

…opilot-blog

Update personal_copilot.md

924bde8

Co-authored-by: Pedro Cuenca <[email protected]>

start addressing comments

0d2ee12

Address all comments

b80a940

sayakpaul reviewed Oct 24, 2023

View reviewed changes

personal_copilot.md Outdated Show resolved Hide resolved

sayakpaul reviewed Oct 24, 2023

View reviewed changes

personal_copilot.md Outdated Show resolved Hide resolved

sayakpaul reviewed Oct 24, 2023

View reviewed changes

personal_copilot.md Outdated Show resolved Hide resolved

sayakpaul reviewed Oct 24, 2023

View reviewed changes

personal_copilot.md Outdated Show resolved Hide resolved

sayakpaul reviewed Oct 24, 2023

View reviewed changes

personal_copilot.md Outdated

| | |

|---|---|

| Model | Pass@1 |

Copy link

Member

sayakpaul Oct 24, 2023

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

What does Pass@1 denote?

sayakpaul reviewed Oct 24, 2023

View reviewed changes

personal_copilot.md Outdated Show resolved Hide resolved

sayakpaul reviewed Oct 24, 2023

View reviewed changes

personal_copilot.md Outdated Show resolved Hide resolved

sayakpaul reviewed Oct 24, 2023

View reviewed changes

pacman100 and others added 4 commits October 24, 2023 14:08

Apply suggestions from code review

6629169

Co-authored-by: Sayak Paul <[email protected]>

Merge remote-tracking branch 'upstream/main' into smangrul/personal-c…

77a1978

…opilot-blog

addressing comments

92b75d9

add thumbnail

7f0cd01

pcuenca approved these changes Oct 27, 2023

View reviewed changes

pacman100 and others added 2 commits October 27, 2023 10:29

Apply suggestions from code review

4e7357d

Co-authored-by: Pedro Cuenca <[email protected]>

addressing final comments and adding acknowledgements

4fd904d

pacman100 merged commit d3b6ae2 into huggingface:main Oct 27, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Personal copilot blog #1413

Personal copilot blog #1413

pacman100 commented Aug 24, 2023

sayakpaul Aug 25, 2023

sgugger left a comment

pcuenca left a comment

pcuenca Aug 29, 2023

loubnabnl left a comment

loubnabnl Aug 30, 2023

lvwerra left a comment

pacman100 commented Oct 22, 2023

sayakpaul Oct 24, 2023

sayakpaul Oct 24, 2023

sayakpaul Oct 24, 2023

pacman100 Oct 24, 2023

sayakpaul left a comment

pacman100 commented Oct 24, 2023

sayakpaul commented Oct 25, 2023

pcuenca left a comment


		Voila! ⭐️

		The demo at the start is this 1B model that is running locally on my Mac laptop.


		To keep the serialization of this content relatively memory-friendly, we used chunking and the feather format. Refer to [this script](https://github.com/sayakpaul/hf-codegen/blob/main/data/prepare_dataset.py) for the full implementation.

		Our dataset prepared this way is available [here](https://huggingface.co/datasets/sayakpaul/hf-codegen-v2) and it looks like so:


		## Full Finetuning

		We will look at how to do full fine-tuning of starcoder-15B on 8 A100 80GB GPUs using PyTorch Fully Sharded Data Parallel (FSDP) technique. For more information on FSDP, please refer [Fine-tuning Llama 2 70B using PyTorch FSDP](https://huggingface.co/blog/ram-efficient-pytorch-fsdp) and [Accelerate Large Model Training using PyTorch Fully Sharded Data Parallel](https://huggingface.co/blog/pytorch-fsdp).

Personal copilot blog #1413

Personal copilot blog #1413

Conversation

pacman100 commented Aug 24, 2023

What does this PR do?

sayakpaul Aug 25, 2023

Choose a reason for hiding this comment

sgugger left a comment

Choose a reason for hiding this comment

pcuenca left a comment

Choose a reason for hiding this comment

pcuenca Aug 29, 2023

Choose a reason for hiding this comment

loubnabnl left a comment

Choose a reason for hiding this comment

loubnabnl Aug 30, 2023

Choose a reason for hiding this comment

lvwerra left a comment

Choose a reason for hiding this comment

pacman100 commented Oct 22, 2023

sayakpaul Oct 24, 2023

Choose a reason for hiding this comment

sayakpaul Oct 24, 2023

Choose a reason for hiding this comment

sayakpaul Oct 24, 2023

Choose a reason for hiding this comment

pacman100 Oct 24, 2023

Choose a reason for hiding this comment

sayakpaul left a comment

Choose a reason for hiding this comment

pacman100 commented Oct 24, 2023

sayakpaul commented Oct 25, 2023

pcuenca left a comment

Choose a reason for hiding this comment