- Paris, France
- https://huggingface.co/eliebak
- @eliebakouch
- in/eliebak
Highlights
- Pro
-
-
awesome-open-source-lms Public
Forked from allenai/awesome-open-source-lmsFriends of OLMo and their links.
Creative Commons Attribution 4.0 International UpdatedDec 10, 2024 -
nanotron Public
Forked from huggingface/nanotronMinimalistic large language model 3D-parallelism training
Python Apache License 2.0 UpdatedDec 2, 2024 -
-
lingua Public
Forked from facebookresearch/linguaMeta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.
Python BSD 3-Clause "New" or "Revised" License UpdatedNov 2, 2024 -
entropix Public
Forked from xjdr-alt/entropixEntropy Based Sampling and Parallel CoT Decoding
TypeScript Apache License 2.0 UpdatedOct 9, 2024 -
lighteval Public
Forked from huggingface/lightevalLightEval is a lightweight LLM evaluation suite that Hugging Face has been using internally with the recently released LLM data processing library datatrove and LLM training library nanotron.
Python MIT License UpdatedSep 19, 2024 -
mergekit Public
Forked from arcee-ai/mergekitTools for merging pretrained large language models.
Python GNU Lesser General Public License v3.0 UpdatedSep 16, 2024 -
DistillKit Public
Forked from arcee-ai/DistillKitAn Open Source Toolkit For LLM Distillation
Python GNU Affero General Public License v3.0 UpdatedAug 17, 2024 -
EasyContext Public
Forked from jzhang38/EasyContextMemory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.
Python Apache License 2.0 UpdatedJul 26, 2024 -
trl Public
Forked from huggingface/trlTrain transformer language models with reinforcement learning.
Python Apache License 2.0 UpdatedJul 23, 2024 -
llama.cpp Public
Forked from ggml-org/llama.cppLLM inference in C/C++
C++ MIT License UpdatedJul 15, 2024 -
bigcode-evaluation-harness Public
Forked from bigcode-project/bigcode-evaluation-harnessA framework for the evaluation of autoregressive code generation language models.
Python Apache License 2.0 UpdatedJun 29, 2024 -
build-nanogpt Public
Forked from karpathy/build-nanogptVideo+code lecture on building nanoGPT from scratch
Python UpdatedJun 16, 2024 -
llm.c Public
Forked from karpathy/llm.cLLM training in simple, raw C/CUDA
Cuda MIT License UpdatedJun 14, 2024 -
nanoGPT Public
Forked from karpathy/nanoGPTThe simplest, fastest repository for training/finetuning medium-sized GPTs.
Python MIT License UpdatedJun 8, 2024 -
modded-nanogpt Public
Forked from KellerJordan/modded-nanogptGPT-2 (124M) quality in 5B tokens
Python UpdatedJun 7, 2024