Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.

Python 2,290 165 Updated Feb 4, 2025

huggingface / lerobot

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 8,750 924 Updated Feb 6, 2025

isaac-sim / IsaacGymEnvs

Isaac Gym Reinforcement Learning Environments

Python 2,197 445 Updated Oct 26, 2024

ShangtongZhang / reinforcement-learning-an-introduction

Python Implementation of Reinforcement Learning: An Introduction

Python 13,794 4,863 Updated Aug 9, 2024

princetonvisualai / RememberThePast-DatasetDistillation

Python 37 8 Updated Nov 19, 2022

RchalYang / offlinerl

Repo for offline reinforcement learning methods

Python 9 3 Updated Jul 21, 2020

openai / spinningup

An educational resource to help anyone learn deep reinforcement learning.

Python 10,467 2,272 Updated Aug 5, 2024

hanjuku-kaso / awesome-offline-rl

An index of algorithms for offline reinforcement learning (offline-rl)

953 87 Updated May 23, 2024

CSCfi / singularity-recipes

Public recipe files for Apptainer containers used on CSC HPC environments

R 8 5 Updated Jan 14, 2025

rustdesk / rustdesk

An open-source remote desktop application designed for self-hosting, as an alternative to TeamViewer.

Rust 81,141 11,397 Updated Feb 5, 2025

timoklein / redo

ReDo: The Dormant Neuron Phenomenon in Deep Reinforcement Learning (pytorch)

Python 25 4 Updated Oct 22, 2024

hakuhodo-technologies / scope-rl

SCOPE-RL: A python library for offline reinforcement learning, off-policy evaluation, and selection

Python 121 12 Updated Mar 18, 2024

nakamotoo / Cal-QL

official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning

Python 83 5 Updated Jul 31, 2024

mwxely / AIGS

AI-Generated Images as Data Source: The Dawn of Synthetic Era

TeX 149 11 Updated Dec 8, 2023

rinongal / textual_inversion

Jupyter Notebook 2,955 285 Updated Feb 27, 2023

BorgwardtLab / Neural-Persistence

Code for the paper 'Neural Persistence: A Complexity Measure for Deep Neural Networks Using Algebraic Topology'

Python 30 7 Updated Feb 25, 2019

Timothyxxx / Chain-of-ThoughtsPapers

A trend starts from "Chain of Thought Prompting Elicits Reasoning in Large Language Models".

1,993 132 Updated Oct 5, 2023

bgavran / Category_Theory_Machine_Learning

List of papers studying machine learning through the lens of category theory

Python 1,320 79 Updated Feb 7, 2025

OPTML-Group / ILM-VP

[CVPR23] "Understanding and Improving Visual Prompting: A Label-Mapping Perspective" by Aochuan Chen, Yuguang Yao, Pin-Yu Chen, Yihua Zhang, and Sijia Liu

Python 52 13 Updated Sep 17, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Shiye Lei LeavesLei

Block or report LeavesLei

Stars

deepseek-ai / DeepSeek-V3

ycjing / Awesome-Model-Merging

kozakdenys / qr-code-styling

LeavesLei / OBD

kamenbliznashki / normalizing_flows

ikostrikov / pytorch-flows

BMPixel / moffee

LeavesLei / attentive_learning

opendilab / awesome-RLHF

huggingface / alignment-handbook

allenai / reward-bench

argilla-io / distilabel