Skip to content
View haonan3's full-sized avatar
🤠
🤠

Highlights

  • Pro

Block or report haonan3

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

🚀 Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton

Python 1,835 102 Updated Jan 31, 2025

Data science interview questions with answers. Not ideally (yet)

1,592 355 Updated Jul 10, 2022

Compilation of resources for aspiring data scientists

Python 2,023 646 Updated Jun 14, 2024

Answers to 120 commonly asked data science interview questions.

3,753 1,315 Updated Jan 18, 2024

Codebase for Aria - an Open Multimodal Native MoE

Jupyter Notebook 979 83 Updated Jan 22, 2025

A instruction data generation system for multimodal language models.

Jupyter Notebook 29 Updated Jan 31, 2025
Python 36 2 Updated Nov 5, 2024

AnchorAttention: Improved attention for LLMs long-context training

Python 203 6 Updated Jan 15, 2025
Python 19 Updated Nov 8, 2024

🌾 OAT: A research-friendly framework for LLM online alignment, including preference learning, reinforcement learning, etc.

Python 103 6 Updated Jan 29, 2025

[ICLR 2025] When Attention Sink Emerges in Language Models: An Empirical View

Python 47 1 Updated Oct 17, 2024

Ring attention implementation with flash attention

Python 661 57 Updated Dec 19, 2024

`dattri` is a PyTorch library for developing, benchmarking, and deploying efficient data attribution algorithms.

Python 56 11 Updated Dec 30, 2024
Python 2 Updated Aug 29, 2023
Python 5 Updated Feb 22, 2024

We introduce a novel approach for parameter generation, named neural network parameter diffusion (p-diff), which employs a standard latent diffusion model to synthesize a new set of parameters

Python 834 45 Updated Jan 3, 2025

Lossless Training Speed Up by Unbiased Dynamic Data Pruning

Python 324 18 Updated Sep 24, 2024

This repository is the PyTorch implementation of dynamicAL (NeurIPS 2022)

Python 4 1 Updated Dec 15, 2022

[CVPR2024] Efficient Dataset Distillation via Minimax Diffusion

Python 90 9 Updated Mar 22, 2024

Welcome to the Awesome Feature Learning in Deep Learning Thoery Reading Group! This repository serves as a collaborative platform for scholars, enthusiasts, and anyone interested in delving into th…

163 8 Updated Dec 27, 2024

EcoAssistant: using LLM assistant more affordably and accurately

Python 132 8 Updated Jun 30, 2024

[ICCV 2023] Subclass-balancing contrastive learning for long-tailed recognition

Python 17 2 Updated Oct 30, 2023

[ICCV 2023] MADAug: When to Learn What: Model-Adaptive Data Augmentation Curriculum

Python 19 2 Updated Nov 9, 2023

[ICCV2023] Dataset Quantization

Python 256 18 Updated Jan 6, 2024

A JAX library for Density Functional Theory.

Python 47 5 Updated May 4, 2024

Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis

Jupyter Notebook 441 14 Updated May 24, 2024

A curated list of the latest breakthroughs in AI (in 2022) by release date with a clear video explanation, link to a more in-depth article, and code.

3,199 205 Updated Oct 18, 2023
Next