- Hong Kong
- https://lzhengisme.github.io/
Stars
Collections of RLxLM experiments using minimal codes
CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction
Code for Paper: Teaching Language Models to Critique via Reinforcement Learning
Collecting research materials on neural samplers with diffusion/flow models
Code for MGDM algorithm (https://arxiv.org/abs/2502.03332)
Official implementation of Scaling Laws in Patchification: An Image Is Worth 50,176 Tokens And More
Forewarned is Forearmed: Leveraging LLMs for Data Synthesis through Failure-Inducing Exploration
Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction
This is a collection of resources for computer-use GUI agents, including videos, blogs, papers, and projects.
[ICLR'25] Data and code for our paper "Why Does the Effective Context Length of LLMs Fall Short?"
[ICLR2025] DiffuGPT and DiffuLLaMA: Scaling Diffusion Language Models via Adaptation from Autoregressive Models
[ICLR 2025] Code for the paper "Beyond Autoregression: Discrete Diffusion for Complex Reasoning and Planning"
Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.
HKUNLP / GSM-Plus
Forked from qtli/GSM-PlusGSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.
ICLR'25 Oral: Improving Probabilistic Diffusion Models With Optimal Covariance Matching
Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale"
A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).
🚀 Easy, open-source LLM finetuning with one-line commands, seamless cloud integration, and popular optimization frameworks. ✨
Official inference repo for FLUX.1 models
Official PyTorch Implementation of the Longhorn Deep State Space Model
[NeurIPS 2024] Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?