rasdani

rasdani

14 followers · 18 following

Achievements

x3 x2

Achievements

x3 x2

OpenRLHF Public
Forked from OpenRLHF/OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python Apache License 2.0 Updated Jan 28, 2025
simpleRL-reason Public
Forked from hkust-nlp/simpleRL-reason

This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data

Python MIT License Updated Jan 28, 2025
TinyZero Public
Forked from Jiayi-Pan/TinyZero

Clean, accessible reproduction of DeepSeek R1-Zero

Python Apache License 2.0 Updated Jan 26, 2025
inspect_ai Public
Forked from UKGovernmentBEIS/inspect_ai

Inspect: A framework for large language model evaluations

Python MIT License Updated Jan 8, 2025
sae-auto-interp Public
Forked from EleutherAI/sae-auto-interp

Jupyter Notebook Apache License 2.0 Updated Dec 18, 2024
SAELens Public
Forked from jbloomAus/SAELens

Training Sparse Autoencoders on Language Models

Jupyter Notebook MIT License Updated Dec 8, 2024
entropix Public
Forked from xjdr-alt/entropix

Entropy Based Sampling and Parallel CoT Decoding

TypeScript Apache License 2.0 Updated Oct 27, 2024
lighteval Public
Forked from huggingface/lighteval

Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends

Python MIT License Updated Oct 25, 2024
entropix-smollm Public
Forked from SinatrasC/entropix-smollm

smolLM with Entropix sampler on pytorch

Jupyter Notebook Apache License 2.0 Updated Oct 23, 2024
SAE-based-representation-engineering Public
Forked from yuzhaouoe/SAE-based-representation-engineering

Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering

Python MIT License Updated Oct 22, 2024
lm-evaluation-harness Public
Forked from EleutherAI/lm-evaluation-harness

A framework for few-shot evaluation of language models.

Python MIT License Updated Oct 20, 2024
orbit Public
Forked from andymatuschak/orbit

Experimental spaced repetition platform for exploring ideas in memory augmentation and programmable attention

TypeScript Other Updated Oct 14, 2024
smol-podcaster Public
Forked from FanaHOVA/smol-podcaster

smol-podcaster is your autonomous podcast production intern 🐣

Python MIT License Updated Oct 12, 2024
sglang Public
Forked from sgl-project/sglang

SGLang is a fast serving framework for large language models and vision language models.

Python Apache License 2.0 Updated Oct 5, 2024
optillm Public
Forked from codelion/optillm

Optimizing inference proxy for LLMs

Python Apache License 2.0 Updated Sep 15, 2024
rStar Public
Forked from zhentingqi/rStar

Python MIT License Updated Sep 10, 2024
AnkiBrain Public
Forked from RosettaTechnologies/AnkiBrain

Python Updated Sep 6, 2024
inference-is-all-you-need Public

Python 2 1 MIT License Updated Sep 3, 2024
nanoGPT Public
Forked from karpathy/nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python MIT License Updated Aug 19, 2024
build-nanogpt Public
Forked from karpathy/build-nanogpt

Video+code lecture on building nanoGPT from scratch

Python Updated Aug 13, 2024
buildware-ai Public
Forked from mckaywrigley/buildware-ai

TypeScript MIT License Updated Aug 5, 2024
GodMode Public
Forked from smol-ai/GodMode

AI Chat Browser: Fast, Full webapp access to ChatGPT / Claude / Bard / Bing / Llama2! I use this 20 times a day.

TypeScript MIT License Updated Jul 29, 2024
OpenDevin Public
Forked from All-Hands-AI/OpenHands

🐚 OpenDevin: Code Less, Make More

Python MIT License Updated Jun 13, 2024
distilabel Public
Forked from argilla-io/distilabel

Distilabel is a framework for synthetic data and AI feedback for AI engineers that require high-quality outputs, full data ownership, and overall efficiency.

Python Apache License 2.0 Updated May 23, 2024
mmteb-wiki Public

Jupyter Notebook Updated May 14, 2024
FastChat Public
Forked from lm-sys/FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python Apache License 2.0 Updated Apr 19, 2024
llama_index Public
Forked from run-llama/llama_index

LlamaIndex is a data framework for your LLM applications

Python MIT License Updated Mar 5, 2024
germanrag Public

GermanRAG - a German dataset for finetuning Retrieval Augmented Generation

Python 6 Apache License 2.0 Updated Feb 4, 2024
lit-gpt Public
Forked from Lightning-AI/litgpt

Hackable implementation of state-of-the-art open-source LLMs based on nanoGPT. Supports flash attention, 4-bit and 8-bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-l…

Python Apache License 2.0 Updated Jan 26, 2024
direct-preference-optimization Public
Forked from eric-mitchell/direct-preference-optimization

Reference implementation for DPO (Direct Preference Optimization)

Python Apache License 2.0 Updated Jan 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rasdani

Achievements

Achievements

Block or report rasdani

OpenRLHF Public

simpleRL-reason Public

TinyZero Public

inspect_ai Public

sae-auto-interp Public

SAELens Public

entropix Public

lighteval Public

entropix-smollm Public

SAE-based-representation-engineering Public

lm-evaluation-harness Public

orbit Public

smol-podcaster Public

sglang Public

optillm Public

rStar Public

AnkiBrain Public

inference-is-all-you-need Public

nanoGPT Public

build-nanogpt Public

buildware-ai Public

GodMode Public

OpenDevin Public

distilabel Public

mmteb-wiki Public

FastChat Public

llama_index Public

germanrag Public

lit-gpt Public

direct-preference-optimization Public