Skip to content
View LeavesLei's full-sized avatar

Block or report LeavesLei

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

👫 A curated list of Model Merging methods.

89 4 Updated Sep 16, 2024

Automaticly generate your styled QR code in your web app.

TypeScript 1,674 514 Updated Jan 10, 2025

[NeurIPS 2024] Official repository for "Offline Behavior Distillation"

Python 3 Updated Oct 31, 2024

Pytorch implementations of density estimation algorithms: BNAF, Glow, MAF, RealNVP, planar flows

Python 611 103 Updated Jul 12, 2021

PyTorch implementations of algorithms for density estimation

Python 577 75 Updated May 13, 2021

moffee: Make Markdown Ready to Present

Python 1,021 47 Updated Nov 22, 2024

[TNNLS] Official repository for "Attentive Learning Facilitates Generalization of Neural Networks"

Python 4 Updated Jan 18, 2024

A curated list of reinforcement learning with human feedback resources (continually updated)

3,673 227 Updated Jan 27, 2025

Robust recipes to align language models with human and AI preferences

Python 4,964 427 Updated Nov 21, 2024

RewardBench: the first evaluation tool for reward models.

Python 499 57 Updated Jan 25, 2025

Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.

Python 2,290 165 Updated Feb 4, 2025

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 8,750 924 Updated Feb 6, 2025

Isaac Gym Reinforcement Learning Environments

Python 2,197 445 Updated Oct 26, 2024

Python Implementation of Reinforcement Learning: An Introduction

Python 13,794 4,863 Updated Aug 9, 2024

Repo for offline reinforcement learning methods

Python 9 3 Updated Jul 21, 2020

An educational resource to help anyone learn deep reinforcement learning.

Python 10,467 2,272 Updated Aug 5, 2024

An index of algorithms for offline reinforcement learning (offline-rl)

953 87 Updated May 23, 2024

Public recipe files for Apptainer containers used on CSC HPC environments

R 8 5 Updated Jan 14, 2025

An open-source remote desktop application designed for self-hosting, as an alternative to TeamViewer.

Rust 81,141 11,397 Updated Feb 5, 2025

ReDo: The Dormant Neuron Phenomenon in Deep Reinforcement Learning (pytorch)

Python 25 4 Updated Oct 22, 2024

SCOPE-RL: A python library for offline reinforcement learning, off-policy evaluation, and selection

Python 121 12 Updated Mar 18, 2024

official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning

Python 83 5 Updated Jul 31, 2024

AI-Generated Images as Data Source: The Dawn of Synthetic Era

TeX 149 11 Updated Dec 8, 2023
Jupyter Notebook 2,955 285 Updated Feb 27, 2023

Code for the paper 'Neural Persistence: A Complexity Measure for Deep Neural Networks Using Algebraic Topology'

Python 30 7 Updated Feb 25, 2019

A trend starts from "Chain of Thought Prompting Elicits Reasoning in Large Language Models".

1,993 132 Updated Oct 5, 2023

List of papers studying machine learning through the lens of category theory

Python 1,320 79 Updated Feb 7, 2025

[CVPR23] "Understanding and Improving Visual Prompting: A Label-Mapping Perspective" by Aochuan Chen, Yuguang Yao, Pin-Yu Chen, Yihua Zhang, and Sijia Liu

Python 52 13 Updated Sep 17, 2023
Next