LeavesLei

Follow

Shiye Lei LeavesLei

Follow

AI/ML PhD Student @ USYD

10 followers · 9 following

USYD
Sydney, Australia
shiyelei.com

Stars

fshp971 / adv-ICL

[Preprint] Official repository for "Short-length Adversarial Training Helps LLMs Defend Long-length Jailbreak Attacks: Theoretical and Empirical Evidence"

Python 1 Updated Feb 7, 2025

deepseek-ai / DeepSeek-V3

Python 83,131 13,262 Updated Feb 8, 2025

ycjing / Awesome-Model-Merging

👫 A curated list of Model Merging methods.

89 4 Updated Sep 16, 2024

kozakdenys / qr-code-styling

Automaticly generate your styled QR code in your web app.

TypeScript 1,683 515 Updated Jan 10, 2025

LeavesLei / OBD

[NeurIPS 2024] Official repository for "Offline Behavior Distillation"

Python 3 Updated Oct 31, 2024

kamenbliznashki / normalizing_flows

Pytorch implementations of density estimation algorithms: BNAF, Glow, MAF, RealNVP, planar flows

Python 611 103 Updated Jul 12, 2021

ikostrikov / pytorch-flows

PyTorch implementations of algorithms for density estimation

Python 577 75 Updated May 13, 2021

BMPixel / moffee

moffee: Make Markdown Ready to Present

Python 1,023 48 Updated Nov 22, 2024

LeavesLei / attentive_learning

[TNNLS] Official repository for "Attentive Learning Facilitates Generalization of Neural Networks"

Python 4 Updated Jan 18, 2024

opendilab / awesome-RLHF

A curated list of reinforcement learning with human feedback resources (continually updated)

3,692 228 Updated Jan 27, 2025

huggingface / alignment-handbook

Robust recipes to align language models with human and AI preferences

Python 4,976 428 Updated Nov 21, 2024

allenai / reward-bench

RewardBench: the first evaluation tool for reward models.

Python 502 57 Updated Feb 8, 2025

argilla-io / distilabel

Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.

Python 2,385 171 Updated Feb 11, 2025

huggingface / lerobot

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 8,918 943 Updated Feb 11, 2025

isaac-sim / IsaacGymEnvs

Isaac Gym Reinforcement Learning Environments

Python 2,202 446 Updated Oct 26, 2024

ShangtongZhang / reinforcement-learning-an-introduction

Python Implementation of Reinforcement Learning: An Introduction

Python 13,803 4,862 Updated Aug 9, 2024

princetonvisualai / RememberThePast-DatasetDistillation

Python 37 8 Updated Nov 19, 2022

RchalYang / offlinerl

Repo for offline reinforcement learning methods

Python 9 3 Updated Jul 21, 2020

openai / spinningup

An educational resource to help anyone learn deep reinforcement learning.

Python 10,489 2,272 Updated Aug 5, 2024

hanjuku-kaso / awesome-offline-rl

An index of algorithms for offline reinforcement learning (offline-rl)

955 87 Updated May 23, 2024

CSCfi / singularity-recipes

Public recipe files for Apptainer containers used on CSC HPC environments

R 8 5 Updated Jan 14, 2025

rustdesk / rustdesk

An open-source remote desktop application designed for self-hosting, as an alternative to TeamViewer.

Rust 81,989 11,474 Updated Feb 10, 2025

timoklein / redo

ReDo: The Dormant Neuron Phenomenon in Deep Reinforcement Learning (pytorch)

Python 25 4 Updated Oct 22, 2024

hakuhodo-technologies / scope-rl

SCOPE-RL: A python library for offline reinforcement learning, off-policy evaluation, and selection

Python 121 12 Updated Mar 18, 2024

nakamotoo / Cal-QL

official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning

Python 84 5 Updated Jul 31, 2024

mwxely / AIGS

AI-Generated Images as Data Source: The Dawn of Synthetic Era

TeX 149 11 Updated Dec 8, 2023

rinongal / textual_inversion

Jupyter Notebook 2,957 285 Updated Feb 27, 2023

BorgwardtLab / Neural-Persistence

Code for the paper 'Neural Persistence: A Complexity Measure for Deep Neural Networks Using Algebraic Topology'

Python 30 8 Updated Feb 25, 2019

Timothyxxx / Chain-of-ThoughtsPapers

A trend starts from "Chain of Thought Prompting Elicits Reasoning in Large Language Models".

1,997 132 Updated Oct 5, 2023

bgavran / Category_Theory_Machine_Learning

List of papers studying machine learning through the lens of category theory

Python 1,324 79 Updated Feb 10, 2025