Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
Muennighoff authored Aug 15, 2023
1 parent 0712e44 commit 3334119
Showing 1 changed file with 2 additions and 2 deletions.
4 changes: 2 additions & 2 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -10,7 +10,7 @@ This repository provides an overview of all components from the paper [OctoPack:
- [Data](#data)
- [CommitPack](#commitpack)
- [CommitPackFT](#commitpackft)
- [Other](#other)
- [Other](#other)
- [Evaluation](#evaluation)
- [Run](#run)
- [Creation](#creation)
Expand Down Expand Up @@ -75,7 +75,7 @@ CommitPackFT is uploaded [here](https://huggingface.co/datasets/bigcode/commitpa
1. **Prepare:** Download [CommitPack](https://huggingface.co/datasets/bigcode/commitpack) via e.g. `git clone bigcode/commitpack` or follow all the steps above to recreate it.
2. **Filter:** Run `python dataset/commitpackft/commitpackft_filters1.py` followed by `python dataset/commitpackft/commitpackft_filters2.py`. You may want to modify some of the global variables defined in the scripts.

#### Other
### Other

- StarCoder Self-Instruct: Uploaded [here](https://huggingface.co/datasets/codeparrot/self-instruct-starcoder), to recreate see [this repository](https://github.com/ArmelRandy/Self-instruct).
- xP3x: Uploaded [here](https://huggingface.co/datasets/bigcode/xp3x-octopack), to recreate see the script at `dataset/xp3x/filter_xp3x.py`.
Expand Down

0 comments on commit 3334119

Please sign in to comment.