-
OpenRLHF Public
Forked from OpenRLHF/OpenRLHFAn Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
Python Apache License 2.0 UpdatedJan 28, 2025 -
simpleRL-reason Public
Forked from hkust-nlp/simpleRL-reasonThis is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data
Python MIT License UpdatedJan 28, 2025 -
TinyZero Public
Forked from Jiayi-Pan/TinyZeroClean, accessible reproduction of DeepSeek R1-Zero
Python Apache License 2.0 UpdatedJan 26, 2025 -
inspect_ai Public
Forked from UKGovernmentBEIS/inspect_aiInspect: A framework for large language model evaluations
Python MIT License UpdatedJan 8, 2025 -
sae-auto-interp Public
Forked from EleutherAI/sae-auto-interpJupyter Notebook Apache License 2.0 UpdatedDec 18, 2024 -
SAELens Public
Forked from jbloomAus/SAELensTraining Sparse Autoencoders on Language Models
Jupyter Notebook MIT License UpdatedDec 8, 2024 -
entropix Public
Forked from xjdr-alt/entropixEntropy Based Sampling and Parallel CoT Decoding
TypeScript Apache License 2.0 UpdatedOct 27, 2024 -
lighteval Public
Forked from huggingface/lightevalLighteval is your all-in-one toolkit for evaluating LLMs across multiple backends
Python MIT License UpdatedOct 25, 2024 -
entropix-smollm Public
Forked from SinatrasC/entropix-smollmsmolLM with Entropix sampler on pytorch
Jupyter Notebook Apache License 2.0 UpdatedOct 23, 2024 -
SAE-based-representation-engineering Public
Forked from yuzhaouoe/SAE-based-representation-engineeringSteering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering
Python MIT License UpdatedOct 22, 2024 -
lm-evaluation-harness Public
Forked from EleutherAI/lm-evaluation-harnessA framework for few-shot evaluation of language models.
Python MIT License UpdatedOct 20, 2024 -
orbit Public
Forked from andymatuschak/orbitExperimental spaced repetition platform for exploring ideas in memory augmentation and programmable attention
TypeScript Other UpdatedOct 14, 2024 -
smol-podcaster Public
Forked from FanaHOVA/smol-podcastersmol-podcaster is your autonomous podcast production intern 🐣
Python MIT License UpdatedOct 12, 2024 -
sglang Public
Forked from sgl-project/sglangSGLang is a fast serving framework for large language models and vision language models.
Python Apache License 2.0 UpdatedOct 5, 2024 -
optillm Public
Forked from codelion/optillmOptimizing inference proxy for LLMs
Python Apache License 2.0 UpdatedSep 15, 2024 -
-
-
-
nanoGPT Public
Forked from karpathy/nanoGPTThe simplest, fastest repository for training/finetuning medium-sized GPTs.
Python MIT License UpdatedAug 19, 2024 -
build-nanogpt Public
Forked from karpathy/build-nanogptVideo+code lecture on building nanoGPT from scratch
Python UpdatedAug 13, 2024 -
buildware-ai Public
Forked from mckaywrigley/buildware-aiTypeScript MIT License UpdatedAug 5, 2024 -
GodMode Public
Forked from smol-ai/GodModeAI Chat Browser: Fast, Full webapp access to ChatGPT / Claude / Bard / Bing / Llama2! I use this 20 times a day.
TypeScript MIT License UpdatedJul 29, 2024 -
OpenDevin Public
Forked from All-Hands-AI/OpenHands🐚 OpenDevin: Code Less, Make More
Python MIT License UpdatedJun 13, 2024 -
distilabel Public
Forked from argilla-io/distilabelDistilabel is a framework for synthetic data and AI feedback for AI engineers that require high-quality outputs, full data ownership, and overall efficiency.
Python Apache License 2.0 UpdatedMay 23, 2024 -
-
FastChat Public
Forked from lm-sys/FastChatAn open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Python Apache License 2.0 UpdatedApr 19, 2024 -
llama_index Public
Forked from run-llama/llama_indexLlamaIndex is a data framework for your LLM applications
Python MIT License UpdatedMar 5, 2024 -
germanrag Public
GermanRAG - a German dataset for finetuning Retrieval Augmented Generation
-
lit-gpt Public
Forked from Lightning-AI/litgptHackable implementation of state-of-the-art open-source LLMs based on nanoGPT. Supports flash attention, 4-bit and 8-bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-l…
Python Apache License 2.0 UpdatedJan 26, 2024 -
direct-preference-optimization Public
Forked from eric-mitchell/direct-preference-optimizationReference implementation for DPO (Direct Preference Optimization)
Python Apache License 2.0 UpdatedJan 26, 2024