weidong2018

Follow

wdhan weidong2018

Follow

Research Interests： NLP；multi-modal；LLM

9 followers · 42 following

FDU
shanghai&beijing

Stars

deepseek-ai / DeepEP

DeepEP: an efficient expert-parallel communication library

Cuda 6,812 554 Updated Mar 3, 2025

aryopg / mmlu-redux

Jupyter Notebook 12 1 Updated Nov 9, 2024

MoonshotAI / Moonlight

904 36 Updated Feb 28, 2025

deepseek-ai / open-infra-index

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

6,262 155 Updated Mar 1, 2025

MoonshotAI / MoBA

MoBA: Mixture of Block Attention for Long-Context LLMs

Python 1,580 84 Updated Feb 22, 2025

Unakar / Logic-RL

Reproduce R1 Zero on Logic Puzzle

Python 1,936 125 Updated Feb 26, 2025

pytorch / torchtitan

A PyTorch native library for large model training

Python 3,377 295 Updated Mar 3, 2025

MiniMax-AI / MiniMax-01

Python 2,268 160 Updated Feb 24, 2025

TreeAI-Lab / Awesome-KV-Cache-Management

This repository serves as a comprehensive survey of LLM development, featuring numerous research papers along with their corresponding code links.

71 1 Updated Feb 18, 2025

facebookresearch / blt

Code for BLT research paper

Python 1,419 108 Updated Mar 1, 2025

GeorgeLuImmortal / PaDeLLM_NER

Python 6 Updated Nov 21, 2024

Tencent / HunyuanVideo

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 8,864 729 Updated Feb 20, 2025

supersimple33 / Scaling-Laws

A method for calculating scaling laws for LLMs from publicly available models

Python 9 Updated Apr 22, 2024

allenai / OLMo

Modeling, training, eval, and inference code for OLMo

Python 5,284 561 Updated Mar 1, 2025

Zefan-Cai / Awesome-LLM-KV-Cache

Awesome-LLM-KV-Cache: A curated list of 📙Awesome LLM KV Cache Papers with Codes.

221 12 Updated Dec 7, 2024

black-forest-labs / flux

Official inference repo for FLUX.1 models

Python 20,518 1,442 Updated Feb 6, 2025

fla-org / flash-linear-attention

🚀 Efficient implementations of state-of-the-art linear attention models in Torch and Triton

Python 2,032 124 Updated Mar 2, 2025

nick7nlp / Jamba_Paper_Reading

A description for recent long-context large language model Jamba.

14 1 Updated May 22, 2024

kyegomez / MambaTransformer

Integrating Mamba/SSMs with Transformer for Enhanced Long Context and High-Quality Sequence Modeling

Python 187 17 Updated Jan 27, 2025

jzhang38 / LongMamba

Some preliminary explorations of Mamba's context scaling.

Python 213 11 Updated Feb 8, 2024

gkamradt / LLMTest_NeedleInAHaystack

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Jupyter Notebook 1,730 190 Updated Aug 17, 2024

fanshiqing / grouped_gemm

Forked from tgale96/grouped_gemm

PyTorch bindings for CUTLASS grouped GEMM.

Cuda 98 31 Updated Jan 2, 2025

pipecat-ai / rtvi-web-demo

Example UI implementing the RTVI web client

TypeScript 475 71 Updated Dec 3, 2024

recursal / GoldFinch-paper

Forked from SmerkyG/GoldFinch-paper

GoldFinch and other hybrid transformer components

Python 44 2 Updated Jul 20, 2024

HazyResearch / based

Code for exploring Based models from "Simple linear attention language models balance the recall-throughput tradeoff"

Python 222 14 Updated Feb 17, 2025

wleilei / rwkv_demo

A simple and easily understandable version of RWKV

Python 15 1 Updated Aug 15, 2023

mrsteyk / RWKV-LM-deepspeed

Python 42 3 Updated Mar 29, 2023

BlinkDL / nanoRWKV

Forked from karpathy/nanoGPT

RWKV in nanoGPT style

Python 187 11 Updated Jun 9, 2024

BlinkDL / RWKV-LM

RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…

Python 13,244 893 Updated Feb 27, 2025

BlinkDL / RWKV-CUDA

The CUDA version of the RWKV language model ( https://github.com/BlinkDL/RWKV-LM )

Cuda 219 35 Updated Dec 14, 2024