- Bern, Switzerland
- https://lewtun.github.io/blog/
- @_lewtun
Highlights
- Pro
-
datatrove Public
Forked from huggingface/datatroveFreeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
Python Apache License 2.0 UpdatedNov 15, 2024 -
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
-
WebShop Public
Forked from princeton-nlp/WebShop[NeurIPS 2022] 🛒WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents
Python MIT License UpdatedSep 3, 2024 -
lm-evaluation-harness Public
Forked from EleutherAI/lm-evaluation-harnessA framework for few-shot evaluation of language models.
Python MIT License UpdatedAug 20, 2024 -
MixEval Public
Forked from philschmid/MixEvalThe official evaluation suite and dynamic data release for MixEval.
Python UpdatedAug 12, 2024 -
FastChat Public
Forked from lm-sys/FastChatAn open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
-
-
SPIN Public
Forked from uclaml/SPINThe official implementation of Self-Play Fine-Tuning (SPIN)
Python Apache License 2.0 UpdatedFeb 23, 2024 -
distilabel Public
Forked from argilla-io/distilabel⚗️ AI Feedback framework for scalable LLM alignment
Python Apache License 2.0 UpdatedFeb 1, 2024 -
SubjQA Public
Forked from megagonlabs/SubjQAA question-answering dataset with a focus on subjective information
UpdatedJan 8, 2024 -
google-research Public
Forked from google-research/google-researchGoogle Research
Jupyter Notebook Apache License 2.0 UpdatedDec 22, 2023 -
alpaca_eval Public
Forked from tatsu-lab/alpaca_evalAn automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
Jupyter Notebook Apache License 2.0 UpdatedNov 29, 2023 -
-
trl Public
Forked from huggingface/trlTrain transformer language models with reinforcement learning.
Python Apache License 2.0 UpdatedJun 19, 2023 -
pretraining-with-human-feedback Public
Forked from tomekkorbak/pretraining-with-human-feedbackCode accompanying the paper Pretraining Language Models with Human Preferences
-
deepcode Public
Machine learning on source code for the Rocket platform
-
hepml Public
Practical machine learning for physicists
-
dslectures Public
Course materials for introductory data science
-
stanford_alpaca Public
Forked from tatsu-lab/stanford_alpacaCode and documentation to train Stanford's Alpaca models, and generate the data.
Python Apache License 2.0 UpdatedMar 14, 2023 -
-
BIG-bench Public
Forked from google/BIG-benchBeyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models
Python Apache License 2.0 UpdatedFeb 22, 2023 -
chatgpt-failures Public
Forked from giuven95/chatgpt-failuresFailure archive for ChatGPT and similar models
Python UpdatedFeb 16, 2023 -
language-model-agents Public
Forked from Rallio67/language-model-agentsExperiments with generating opensource language model assistants
Jupyter Notebook Apache License 2.0 UpdatedFeb 10, 2023 -
-
chatty-lms Public
A Hugging Face Space to compare various dialogue-prompted language models
-
self-instruct Public
Forked from yizhongw/self-instructAligning pretrained language models with instruction data generated by themselves.
Python Apache License 2.0 UpdatedJan 10, 2023 -
Open-Assistant Public
Forked from LAION-AI/Open-AssistantPython Apache License 2.0 UpdatedJan 4, 2023 -
langchain Public
Forked from langchain-ai/langchain⚡ Building applications with LLMs through composability ⚡
-
awesome-rlhf Public
A curated list of resources dedicated to Reinforcement Learning from Human Feedback (RLHF).
-
following-instructions-human-feedback Public
Forked from openai/following-instructions-human-feedbackUpdatedDec 11, 2022