Skip to content
View SlongLiu's full-sized avatar
🎯
Focusing
🎯
Focusing
  • THU | IDEA
  • Beijing | Shenzhen

Highlights

  • Pro

Organizations

@IDEA-opensource

Block or report SlongLiu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Official implementation of "Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling"

Python 834 49 Updated Jan 22, 2025

s1: Simple test-time scaling

Python 3,805 426 Updated Feb 8, 2025

The MATH Dataset (NeurIPS 2021)

Python 1,006 91 Updated Aug 5, 2024

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Python 11,473 1,146 Updated Feb 3, 2025

Lightning Attention-2: A Free Lunch for Handling Unlimited Sequence Lengths in Large Language Models

Python 253 20 Updated Apr 24, 2024

🚀 Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton

Python 1,874 111 Updated Feb 8, 2025

The model, data and code for the visual GUI Agent SeeClick

HTML 302 15 Updated Nov 22, 2024

The official implementation of Tensor ProducT ATTenTion Transformer (T6)

Python 281 26 Updated Feb 8, 2025

LoRA fine-tune directly on the quantized models.

Python 26 Updated Nov 25, 2024
Python 2,079 143 Updated Jan 16, 2025

[ICLR 2025] Official PyTorch Implementation of Gated Delta Networks: Improving Mamba2 with Delta Rule

Python 122 9 Updated Dec 31, 2024

The official repository for "2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining"

Python 135 15 Updated Jan 18, 2025
Jupyter Notebook 55 5 Updated Feb 6, 2025

Inference Speed Benchmark for Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Cuda 56 5 Updated Jul 14, 2024

D-FINE: Redefine Regression Task of DETRs as Fine-grained Distribution Refinement 💥💥💥

Python 1,413 107 Updated Jan 20, 2025

[NeurIPS 2024] Classification Done Right for Vision-Language Pre-Training

Python 200 7 Updated Jan 13, 2025

Minkowski Engine is an auto-diff neural network library for high-dimensional sparse tensors

Python 2,557 372 Updated Mar 5, 2024

一年过去了,你在华子食堂里花的钱都花在哪儿了?

Python 458 81 Updated Dec 23, 2024

A simple screen parsing tool towards pure vision based GUI agent

Jupyter Notebook 5,649 447 Updated Feb 7, 2025

A collection of vision foundation models unifying understanding and generation.

40 2 Updated Jan 2, 2025

DINO-X: The World's Top-Performing Vision Model for Open-World Object Detection and Understanding

Python 844 36 Updated Jan 21, 2025

A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.

Python 2,444 194 Updated Feb 7, 2025

GMAI-VL & GMAI-VL-5.5M: A Large Vision-Language Model and A Comprehensive Multimodal Dataset Towards General Medical AI.

60 Updated Nov 27, 2024

The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling strong reasoning abilities, self-improvment, and skill curatio…

Python 1,989 173 Updated Nov 7, 2024

Next-Token Prediction is All You Need

Python 1,983 79 Updated Oct 24, 2024

[ICLR 2025] DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads

Python 422 26 Updated Jan 22, 2025

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,424 239 Updated Jan 27, 2025
Next