Stars
Automaticly generate your styled QR code in your web app.
[NeurIPS 2024] Official repository for "Offline Behavior Distillation"
Pytorch implementations of density estimation algorithms: BNAF, Glow, MAF, RealNVP, planar flows
PyTorch implementations of algorithms for density estimation
[TNNLS] Official repository for "Attentive Learning Facilitates Generalization of Neural Networks"
A curated list of reinforcement learning with human feedback resources (continually updated)
Robust recipes to align language models with human and AI preferences
RewardBench: the first evaluation tool for reward models.
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
Isaac Gym Reinforcement Learning Environments
Python Implementation of Reinforcement Learning: An Introduction
Repo for offline reinforcement learning methods
An educational resource to help anyone learn deep reinforcement learning.
An index of algorithms for offline reinforcement learning (offline-rl)
Public recipe files for Apptainer containers used on CSC HPC environments
An open-source remote desktop application designed for self-hosting, as an alternative to TeamViewer.
ReDo: The Dormant Neuron Phenomenon in Deep Reinforcement Learning (pytorch)
SCOPE-RL: A python library for offline reinforcement learning, off-policy evaluation, and selection
official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning
AI-Generated Images as Data Source: The Dawn of Synthetic Era
Code for the paper 'Neural Persistence: A Complexity Measure for Deep Neural Networks Using Algebraic Topology'
A trend starts from "Chain of Thought Prompting Elicits Reasoning in Large Language Models".
List of papers studying machine learning through the lens of category theory
[CVPR23] "Understanding and Improving Visual Prompting: A Label-Mapping Perspective" by Aochuan Chen, Yuguang Yao, Pin-Yu Chen, Yihua Zhang, and Sijia Liu