Skip to content
View LeavesLei's full-sized avatar

Block or report LeavesLei

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[Preprint] Official repository for "Short-length Adversarial Training Helps LLMs Defend Long-length Jailbreak Attacks: Theoretical and Empirical Evidence"

Python 1 Updated Feb 7, 2025

👫 A curated list of Model Merging methods.

89 4 Updated Sep 16, 2024

Automaticly generate your styled QR code in your web app.

TypeScript 1,683 515 Updated Jan 10, 2025

[NeurIPS 2024] Official repository for "Offline Behavior Distillation"

Python 3 Updated Oct 31, 2024

Pytorch implementations of density estimation algorithms: BNAF, Glow, MAF, RealNVP, planar flows

Python 611 103 Updated Jul 12, 2021

PyTorch implementations of algorithms for density estimation

Python 577 75 Updated May 13, 2021

moffee: Make Markdown Ready to Present

Python 1,023 48 Updated Nov 22, 2024

[TNNLS] Official repository for "Attentive Learning Facilitates Generalization of Neural Networks"

Python 4 Updated Jan 18, 2024

A curated list of reinforcement learning with human feedback resources (continually updated)

3,692 228 Updated Jan 27, 2025

Robust recipes to align language models with human and AI preferences

Python 4,976 428 Updated Nov 21, 2024

RewardBench: the first evaluation tool for reward models.

Python 502 57 Updated Feb 8, 2025

Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.

Python 2,385 171 Updated Feb 11, 2025

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 8,918 943 Updated Feb 11, 2025

Isaac Gym Reinforcement Learning Environments

Python 2,202 446 Updated Oct 26, 2024

Python Implementation of Reinforcement Learning: An Introduction

Python 13,803 4,862 Updated Aug 9, 2024

Repo for offline reinforcement learning methods

Python 9 3 Updated Jul 21, 2020

An educational resource to help anyone learn deep reinforcement learning.

Python 10,489 2,272 Updated Aug 5, 2024

An index of algorithms for offline reinforcement learning (offline-rl)

955 87 Updated May 23, 2024

Public recipe files for Apptainer containers used on CSC HPC environments

R 8 5 Updated Jan 14, 2025

An open-source remote desktop application designed for self-hosting, as an alternative to TeamViewer.

Rust 81,989 11,474 Updated Feb 10, 2025

ReDo: The Dormant Neuron Phenomenon in Deep Reinforcement Learning (pytorch)

Python 25 4 Updated Oct 22, 2024

SCOPE-RL: A python library for offline reinforcement learning, off-policy evaluation, and selection

Python 121 12 Updated Mar 18, 2024

official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning

Python 84 5 Updated Jul 31, 2024

AI-Generated Images as Data Source: The Dawn of Synthetic Era

TeX 149 11 Updated Dec 8, 2023
Jupyter Notebook 2,957 285 Updated Feb 27, 2023

Code for the paper 'Neural Persistence: A Complexity Measure for Deep Neural Networks Using Algebraic Topology'

Python 30 8 Updated Feb 25, 2019

A trend starts from "Chain of Thought Prompting Elicits Reasoning in Large Language Models".

1,997 132 Updated Oct 5, 2023

List of papers studying machine learning through the lens of category theory

Python 1,324 79 Updated Feb 10, 2025
Next