Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
Muennighoff authored Aug 18, 2023
1 parent 29be744 commit f8c8a60
Showing 1 changed file with 11 additions and 0 deletions.
11 changes: 11 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -21,6 +21,7 @@ This repository provides an overview of all components from the paper [OctoPack:
- [SantaCoder Pretraining (SantaCoderPack)](#santacoder-pretraining-santacoderpack)
- [Other](#other-1)
- [Visuals](#visuals)
- [Licenses](#licenses)
- [Citation](#citation)

<!-- /TOC -->
Expand Down Expand Up @@ -258,6 +259,16 @@ Tables:
- Table 4: Create via `visual/distribution_languages.py`
- Other Tables: Manual

## Licenses

Everything is licensed as permissively as possible to us.

CommitPack, CommitPackFT, HumanEvalPack, and all code are licensed under the MIT License of this repository. Note that each sample within CommitPack and CommitPackFT has its own license corresponding to the repository it stems from as indicated by the `license` field. All samples stem from permissively licensed repositories. You can check the [paper appendix](https://arxiv.org/abs/2308.07124) for the licenses we filtered for.

OctoCoder is licensed under the [same license as StarCoder](https://huggingface.co/spaces/bigcode/bigcode-model-license-agreement) (Commercial except for use cases deemed harmful).

OctoGeeX is licensed under the [same license as CodeGeeX2](https://huggingface.co/bigcode/octogeex#license) (Commercial but a form needs to be submitted).

## Citation

```bibtex
Expand Down

0 comments on commit f8c8a60

Please sign in to comment.