XDaoHong

XDaoHong

2 followers · 0 following

Stars

14 results for source starred repositories

Clear filter

pytorch / FBGEMM

FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/

C++ 1,270 550 Updated Mar 13, 2025

InternLM / lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 5,838 507 Updated Mar 13, 2025

facebookresearch / generative-recommenders

Repository hosting code for "Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Generative Recommendations" (https://arxiv.org/abs/2402.17152).

Python 928 173 Updated Mar 13, 2025

facebookresearch / dlrm

An implementation of a deep learning recommendation model (DLRM)

Python 3,842 852 Updated Oct 11, 2024

pytorch / torchrec

Pytorch domain library for recommendation systems

Python 2,050 487 Updated Mar 13, 2025

deepseek-ai / DeepSeek-V2

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

4,828 502 Updated Sep 25, 2024

microsoft / vattention

Dynamic Memory Management for Serving LLMs without PagedAttention

C 309 23 Updated Feb 20, 2025

NVIDIA / Megatron-LM

Ongoing research training transformer models at scale

Python 11,738 2,640 Updated Mar 13, 2025

foundation-model-stack / fms-fsdp

🚀 Efficiently (pre)training foundation models with native PyTorch features, including FSDP for training and SDPA implementation of Flash attention v2.

Python 227 37 Updated Mar 6, 2025

huggingface / transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 141,141 28,269 Updated Mar 13, 2025

pytorch / torchtitan

A PyTorch native library for large model training

Python 3,443 309 Updated Mar 13, 2025

bytedance / ByteMLPerf

AI Accelerator Benchmark focuses on evaluating AI Accelerators from a practical production perspective, including the ease of use and versatility of software and hardware.

Python 226 69 Updated Mar 12, 2025

apache / seatunnel

SeaTunnel is a next-generation super high-performance, distributed, massive data integration tool.

Java 8,340 1,923 Updated Mar 13, 2025

pytorch / pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 87,847 23,575 Updated Mar 13, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly