SlongLiu

Follow

🎯

Focusing

Shilong Liu SlongLiu

🎯

Focusing

Follow

Ph.D. Student @ CST of Tsinghua University. Intern @IDEA-Research CVR group. homepage: lsl.zone

392 followers · 96 following

THU | IDEA
Beijing | Shenzhen
lsl.zone

Achievements

Achievements

Highlights

Pro

Organizations

Starred repositories

microsoft / Samba

Official implementation of "Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling"

Python 834 49 Updated Jan 22, 2025

simplescaling / s1

s1: Simple test-time scaling

Python 3,805 426 Updated Feb 8, 2025

hendrycks / math

The MATH Dataset (NeurIPS 2021)

Python 1,006 91 Updated Aug 5, 2024

Lightning-AI / litgpt

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Python 11,473 1,146 Updated Feb 3, 2025

bytedance / UI-TARS

2,176 129 Updated Feb 6, 2025

OpenNLPLab / lightning-attention

Lightning Attention-2: A Free Lunch for Handling Unlimited Sequence Lengths in Large Language Models

Python 253 20 Updated Apr 24, 2024

fla-org / flash-linear-attention

🚀 Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton

Python 1,874 111 Updated Feb 8, 2025

njucckevin / SeeClick

The model, data and code for the visual GUI Agent SeeClick

HTML 302 15 Updated Nov 22, 2024

tensorgi / T6

The official implementation of Tensor ProducT ATTenTion Transformer (T6)

Python 281 26 Updated Feb 8, 2025

csguoh / IntLoRA

LoRA fine-tune directly on the quantized models.

Python 26 Updated Nov 25, 2024

MiniMax-AI / MiniMax-01

Python 2,079 143 Updated Jan 16, 2025

NVlabs / GatedDeltaNet

[ICLR 2025] Official PyTorch Implementation of Gated Delta Networks: Improving Mamba2 with Delta Rule

Python 122 9 Updated Dec 31, 2024

DAMO-NLP-SG / multimodal_textbook

The official repository for "2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining"

Python 135 15 Updated Jan 18, 2025

mdy666 / mdy_triton

Jupyter Notebook 55 5 Updated Feb 6, 2025

test-time-training / ttt-lm-kernels

Inference Speed Benchmark for Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Cuda 56 5 Updated Jul 14, 2024

Peterande / D-FINE

D-FINE: Redefine Regression Task of DETRs as Fine-grained Distribution Refinement 💥💥💥

Python 1,413 107 Updated Jan 20, 2025

x-cls / superclass

[NeurIPS 2024] Classification Done Right for Vision-Language Pre-Training

Python 200 7 Updated Jan 13, 2025

NVIDIA / MinkowskiEngine

Minkowski Engine is an auto-diff neural network library for high-dimensional sparse tensors

Python 2,557 372 Updated Mar 5, 2024

deepseek-ai / DeepSeek-V3

Python 80,060 12,677 Updated Feb 8, 2025

leverimmy / THU-Annual-Eat

一年过去了，你在华子食堂里花的钱都花在哪儿了？

Python 458 81 Updated Dec 23, 2024

microsoft / OmniParser

A simple screen parsing tool towards pure vision based GUI agent

Jupyter Notebook 5,649 447 Updated Feb 7, 2025

shxie2020 / Awesome-UGVFM

A collection of vision foundation models unifying understanding and generation.

40 2 Updated Jan 2, 2025

IDEA-Research / DINO-X-API

DINO-X: The World's Top-Performing Vision Model for Open-World Object Detection and Understanding

Python 844 36 Updated Jan 21, 2025

webdataset / webdataset

A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.

Python 2,444 194 Updated Feb 7, 2025

uni-medical / GMAI-VL

GMAI-VL & GMAI-VL-5.5M: A Large Vision-Language Model and A Comprehensive Multimodal Dataset Towards General Medical AI.

60 Updated Nov 27, 2024

NVlabs / Hydra-MDP

336 17 Updated Aug 30, 2024

BAAI-Agents / Cradle

The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling strong reasoning abilities, self-improvment, and skill curatio…

Python 1,989 173 Updated Nov 7, 2024

baaivision / Emu3

Next-Token Prediction is All You Need

Python 1,983 79 Updated Oct 24, 2024

mit-han-lab / duo-attention

[ICLR 2025] DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads

Python 422 26 Updated Jan 22, 2025

facebookresearch / lingua

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,424 239 Updated Jan 27, 2025

Starred topics

Unity