Skip to content
View SlongLiu's full-sized avatar
🎯
Focusing
🎯
Focusing
  • THU | IDEA
  • Beijing | Shenzhen

Highlights

  • Pro

Organizations

@IDEA-opensource

Block or report SlongLiu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

543 results for source starred repositories
Clear filter

This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data

Python 2,575 195 Updated Feb 7, 2025

Official implementation of "Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling"

Python 838 49 Updated Jan 22, 2025

s1: Simple test-time scaling

Python 5,122 571 Updated Feb 12, 2025

The MATH Dataset (NeurIPS 2021)

Python 1,012 91 Updated Aug 5, 2024

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Python 11,526 1,155 Updated Feb 3, 2025

Lightning Attention-2: A Free Lunch for Handling Unlimited Sequence Lengths in Large Language Models

Python 256 20 Updated Apr 24, 2024

🚀 Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton

Python 1,889 113 Updated Feb 12, 2025

The model, data and code for the visual GUI Agent SeeClick

HTML 307 15 Updated Nov 22, 2024

The official implementation of Tensor ProducT ATTenTion Transformer (T6)

Python 286 26 Updated Feb 8, 2025

LoRA fine-tune directly on the quantized models.

Python 26 Updated Nov 25, 2024
Python 2,114 145 Updated Feb 10, 2025

[ICLR 2025] Official PyTorch Implementation of Gated Delta Networks: Improving Mamba2 with Delta Rule

Python 123 9 Updated Dec 31, 2024

The official repository for "2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining"

Python 140 16 Updated Jan 18, 2025
Jupyter Notebook 60 7 Updated Feb 11, 2025

Inference Speed Benchmark for Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Cuda 57 5 Updated Jul 14, 2024

D-FINE: Redefine Regression Task of DETRs as Fine-grained Distribution Refinement [ICLR 2025 Spotlight]

Python 1,439 110 Updated Feb 12, 2025

[NeurIPS 2024] Classification Done Right for Vision-Language Pre-Training

Python 200 7 Updated Jan 13, 2025

Minkowski Engine is an auto-diff neural network library for high-dimensional sparse tensors

Python 2,561 372 Updated Mar 5, 2024

一年过去了,你在华子食堂里花的钱都花在哪儿了?

Python 458 81 Updated Dec 23, 2024

A simple screen parsing tool towards pure vision based GUI agent

Jupyter Notebook 5,680 453 Updated Feb 12, 2025

A collection of vision foundation models unifying understanding and generation.

40 2 Updated Jan 2, 2025

DINO-X: The World's Top-Performing Vision Model for Open-World Object Detection and Understanding

Python 851 36 Updated Jan 21, 2025

A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.

Python 2,449 194 Updated Feb 12, 2025

GMAI-VL & GMAI-VL-5.5M: A Large Vision-Language Model and A Comprehensive Multimodal Dataset Towards General Medical AI.

60 Updated Nov 27, 2024

The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling strong reasoning abilities, self-improvment, and skill curatio…

Python 1,995 175 Updated Nov 7, 2024

Next-Token Prediction is All You Need

Python 1,992 78 Updated Oct 24, 2024

[ICLR 2025] DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads

Python 422 26 Updated Feb 10, 2025
Next