Sireer

Sireer

15 followers · 115 following

Highlights

Stars

ant-research / HeadArtist

The official code of HeadArtist: Text-conditioned 3D Head Generation with Self Score Distillation

Python 67 4 Updated Aug 16, 2024

Open-LLM-VTuber / Open-LLM-VTuber

Talk to any LLM with hands-free voice interaction, voice interruption, and Live2D taking face running locally across platforms

Python 2,475 250 Updated Mar 2, 2025

facebookresearch / fast3r

Code base for Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass

Python 170 9 Updated Feb 28, 2025

facebookresearch / dualformer

implementation of dualformer

Jupyter Notebook 7 1 Updated Mar 1, 2025

JT-Ushio / MHA2MLA

Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-based LLMs

Python 95 8 Updated Feb 27, 2025

open-mmlab / mmagic

OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image genera…

Jupyter Notebook 7,085 1,074 Updated Aug 6, 2024

bbruceyuan / LLMs-Zero-to-Hero

从无名小卒到大模型（LLM）大英雄~ 欢迎关注后续！！！

Jupyter Notebook 761 50 Updated Feb 22, 2025

vllm-project / llm-compressor

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 1,040 92 Updated Mar 1, 2025

dvlab-research / VisionZip

Official repository for VisionZip (CVPR 2025)

Python 245 11 Updated Feb 27, 2025

FoundationVision / UniTok

A Unified Tokenizer for Visual Generation and Understanding

Python 120 3 Updated Feb 28, 2025

hiyouga / EasyR1

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 936 47 Updated Feb 28, 2025

NVIDIA / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 13,231 2,713 Updated Mar 3, 2025

maybeLx / MVSFormerPlusPlus

Codes of MVSFormer++: Revealing the Devil in Transformer’s Details for Multi-View Stereo (ICLR2024)

Python 205 7 Updated Jan 9, 2025

Westlake-AGI-Lab / Distill-Any-Depth

The repo for "Distill Any Depth: Distillation Creates a Stronger Monocular Depth Estimator"

Python 192 6 Updated Mar 2, 2025

feifeibear / long-context-attention

USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference

Python 441 32 Updated Feb 19, 2025

xdit-project / xDiT

xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism

Python 1,372 114 Updated Feb 28, 2025

crockwell / far

[CVPR 2024 - Highlight] FAR: Flexible, Accurate and Robust 6DoF Relative Camera Pose Estimation

Python 129 9 Updated Mar 11, 2024

NVIDIA / TensorRT

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

C++ 11,258 2,161 Updated Feb 1, 2025