Skip to content

KakaoBrain KoGPT (Korean Generative Pre-trained Transformer)

License

Notifications You must be signed in to change notification settings

fromSun2Moon/kogpt

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

KoGPT

KakaoBrain License: AGPL v3 License: Commercial License: CC BY-NC-ND 4.0

KakaoBrain Korean Generative Pre-Training Models

Usage

Experiments

In-context Few-Shots

Models #params NSMC (Acc.) YNAT (F1) KLUE-STS (F1)
HyperCLOVA[1] 1.3B 83.9 58.7 60.9
HyperCLOVA[1] 6.9B 83.8 67.5 59.3
HyperCLOVA[1] 13.0B 87.9 67.9 60.0
HyperCLOVA[1] 39.0B 88.0 71.4 61.6
HyperCLOVA[1] 82.0B 88.2 72.7 65.1
Ours 6.0B 87.8 78.0 64.3

Finetuning

Models #params method NSMC (Acc.)
SKT-AI/KoGPT2[2] 125M finetuning 93.3
HyperCLOVA[1] 1.3B p-tuning 91.7
HyperCLOVA[1] 39.0B p-tuning 93.0
Ours 6.0B finetuning 95.7

Citation

If you apply this library or model to any project and research, please cite our code:

@article{kakaobrain2021kogpt,
  title         = {KakaoBrain Korean Generative Pre-Training}
  author        = {Ildoo Kim and Gunsoo Han and Jiyeon Ham and Woonhyuk Baek},
  year          = {2021},
  howpublished  = {\url{https://github.com/kakaobrain/kogpt}},
}

References

[1] HyperCLOVA: Kim, Boseop, et al. "What changes can large-scale language models bring? intensive study on hyperclova: Billions-scale korean generative pretrained transformers." arXiv preprint arXiv:2109.04650 (2021).
[2] SKT-AI/KoGPT2: "SKT-AI/KoGPT2: Korean GPT-2 pretrained cased (KoGPT2)." https://github.com/SKT-AI/KoGPT2 (2021).

License

The source code of KakaoBrain KoGPT are licensed under AGPL 3.0 and Commercial License.
The pretrained wieghts of KakaoBrain KoGPT are licensed under CC-BY-NC-ND 4.0 라이선스 License.

카카오브레인 KoGPT소스코드(source code)AGPL 3.0Commercial 라이선스 하에 공개되어 있습니다.
카카오브레인 KoGPT사전학습된 가중치(pretrained weights)CC-BY-NC-ND 4.0 라이선스 라이선스 하에 공개되어 있습니다.
모델 및 코드, 사전학습된 가중치를 사용할 경우 라이선스 내용을 준수해 주십시오. 라이선스 전문은 LICENSE.agpl-3.0, LICENSE.commercial, LICENSE.cc-by-nc-nd-4.0 파일에서 확인하실 수 있습니다.

About

KakaoBrain KoGPT (Korean Generative Pre-trained Transformer)

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Python 100.0%