Skip to content
View haonan3's full-sized avatar
🤠
🤠

Highlights

  • Pro

Block or report haonan3

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

154 results for source starred repositories
Clear filter

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 909 44 Updated Feb 28, 2025

Frontier Multimodal Foundation Models for Image and Video Understanding

Jupyter Notebook 567 35 Updated Feb 24, 2025

Data distillation benchmark

HTML 55 3 Updated Feb 17, 2025

🚀 Efficient implementations of state-of-the-art linear attention models in Torch and Triton

Python 2,028 124 Updated Mar 2, 2025

Data science interview questions with answers. Not ideally (yet)

1,604 355 Updated Jul 10, 2022

Compilation of resources for aspiring data scientists

Python 2,044 644 Updated Jun 14, 2024

Answers to 120 commonly asked data science interview questions.

3,760 1,313 Updated Jan 18, 2024

Codebase for Aria - an Open Multimodal Native MoE

Jupyter Notebook 1,007 85 Updated Jan 22, 2025

A instruction data generation system for multimodal language models.

Jupyter Notebook 31 Updated Jan 31, 2025
Python 38 2 Updated Nov 5, 2024

AnchorAttention: Improved attention for LLMs long-context training

Python 205 6 Updated Jan 15, 2025
Python 19 Updated Nov 8, 2024

🌾 OAT: A research-friendly framework for LLM online alignment, including preference learning, reinforcement learning, etc.

Python 208 12 Updated Feb 24, 2025

[ICLR 2025] When Attention Sink Emerges in Language Models: An Empirical View (Spotlight)

Python 49 1 Updated Oct 17, 2024

Ring attention implementation with flash attention

Python 692 60 Updated Feb 24, 2025

`dattri` is a PyTorch library for developing, benchmarking, and deploying efficient data attribution algorithms.

Python 65 12 Updated Feb 10, 2025
Python 2 Updated Aug 29, 2023
Python 6 Updated Feb 4, 2025

We introduce a novel approach for parameter generation, named neural network parameter diffusion (p-diff), which employs a standard latent diffusion model to synthesize a new set of parameters

Python 848 47 Updated Jan 3, 2025

Lossless Training Speed Up by Unbiased Dynamic Data Pruning

Python 329 18 Updated Sep 24, 2024

This repository is the PyTorch implementation of dynamicAL (NeurIPS 2022)

Python 4 1 Updated Dec 15, 2022

[CVPR2024] Efficient Dataset Distillation via Minimax Diffusion

Python 91 9 Updated Mar 22, 2024

Welcome to the Awesome Feature Learning in Deep Learning Thoery Reading Group! This repository serves as a collaborative platform for scholars, enthusiasts, and anyone interested in delving into th…

169 8 Updated Dec 27, 2024

EcoAssistant: using LLM assistant more affordably and accurately

Python 133 7 Updated Jun 30, 2024

[ICCV 2023] Subclass-balancing contrastive learning for long-tailed recognition

Python 17 2 Updated Oct 30, 2023

[ICCV 2023] MADAug: When to Learn What: Model-Adaptive Data Augmentation Curriculum

Python 19 2 Updated Nov 9, 2023

[ICCV2023] Dataset Quantization

Python 257 18 Updated Jan 6, 2024
Next
154 results for source starred repositories