Highlights
- Pro
Stars
Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)
A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations
๐ค Pretrained BERT model & WordPiece tokenizer trained on Korean Comments ํ๊ตญ์ด ๋๊ธ๋ก ํ๋ฆฌํธ๋ ์ด๋ํ BERT ๋ชจ๋ธ๊ณผ ๋ฐ์ดํฐ์
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 2, and other large language models.
ใํ ๊ถ์ผ๋ก ๋๋ด๋ ์ค์ LLM ํ์ธํ๋ใ ์์ ์ฝ๋
Unsupervised Word Segmentation for Neural Machine Translation and Text Generation
Unsupervised text tokenizer for Neural Network-based text generation.
This is a simple demonstration of more advanced, agentic patterns built on top of the Realtime API.
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery ๐งโ๐ฌ
Inspect: A framework for large language model evaluations
๊ฐ์ง์ฐ๊ตฌ์ ์ธ๊ณผ์ถ๋ก ํ ํน๊ฐ ๋ฐ ๋ฐํ์๋ฃ ๋ชจ์์ ๋๋ค.
Source code for end-to-end dialogue model from the MultiWOZ paper (Budzianowski et al. 2018, EMNLP)
Korean SAT leader board
Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends
Convert PDF to markdown + JSON quickly with high accuracy
๋ชจ๋์ ํ๊ตญ์ด ํ ์คํธ ๋ถ์ with ํ์ด์ฌ
Personal Assistant built using python libraries. It does almost anything which includes sending emails, Optical Text Recognition, Dynamic News Reporting at any time with API integration, Todo list โฆ
huggingface์ ์๋ ํ๊ตญ์ด ๋ฐ์ดํฐ ์ธํธ
Huly โ All-in-One Project Management Platform (alternative to Linear, Jira, Slack, Notion, Motion)
All Algorithms implemented in Python