Skip to content
View LZhengisme's full-sized avatar

Block or report LZhengisme

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 68 1 Updated Feb 18, 2025

Collections of RLxLM experiments using minimal codes

Python 11 Updated Feb 17, 2025

CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction

Python 362 22 Updated Feb 19, 2025

Code for Paper: Teaching Language Models to Critique via Reinforcement Learning

Python 62 2 Updated Feb 17, 2025

Collecting research materials on neural samplers with diffusion/flow models

37 Updated Feb 18, 2025

Code for MGDM algorithm (https://arxiv.org/abs/2502.03332)

Jupyter Notebook 8 Updated Feb 6, 2025

Official implementation of Scaling Laws in Patchification: An Image Is Worth 50,176 Tokens And More

9 Updated Feb 6, 2025

Forewarned is Forearmed: Leveraging LLMs for Data Synthesis through Failure-Inducing Exploration

Python 4 Updated Jan 20, 2025
Python 6 Updated Dec 10, 2024

Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction

Python 225 14 Updated Jan 14, 2025

Extending context length of visual language models

Python 6 Updated Dec 18, 2024

Red Teaming Visual language models

Jupyter Notebook 6 Updated Oct 27, 2023

This is a collection of resources for computer-use GUI agents, including videos, blogs, papers, and projects.

225 6 Updated Feb 18, 2025

[ICLR'25] Data and code for our paper "Why Does the Effective Context Length of LLMs Fall Short?"

Python 69 3 Updated Nov 25, 2024

[ICLR2025] DiffuGPT and DiffuLLaMA: Scaling Diffusion Language Models via Adaptation from Autoregressive Models

Python 89 4 Updated Nov 22, 2024

[ICLR 2025] Code for the paper "Beyond Autoregression: Discrete Diffusion for Complex Reasoning and Planning"

Python 32 2 Updated Feb 14, 2025
Python 9 1 Updated Feb 7, 2025

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,434 240 Updated Jan 27, 2025

GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.

Python 2 Updated Jul 8, 2024

ICLR'25 Oral: Improving Probabilistic Diffusion Models With Optimal Covariance Matching

Python 3 Updated Oct 19, 2024

Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale"

Python 215 18 Updated Feb 16, 2025

A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).

Python 192 16 Updated Feb 12, 2025
Python 23 2 Updated Aug 23, 2024

🚀 Easy, open-source LLM finetuning with one-line commands, seamless cloud integration, and popular optimization frameworks. ✨

Python 89 3 Updated Aug 14, 2024

Official inference repo for FLUX.1 models

Python 20,263 1,423 Updated Feb 6, 2025

Official PyTorch Implementation of the Longhorn Deep State Space Model

Python 48 3 Updated Dec 4, 2024
Python 21 Updated Jun 24, 2024

[NeurIPS 2024] Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?

Jupyter Notebook 116 8 Updated Aug 26, 2024
Next