Skip to content
View XDaoHong's full-sized avatar

Block or report XDaoHong

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/

C++ 1,271 549 Updated Mar 12, 2025

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 5,831 507 Updated Mar 12, 2025

Repository hosting code for "Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Generative Recommendations" (https://arxiv.org/abs/2402.17152).

Python 928 173 Updated Mar 13, 2025

An implementation of a deep learning recommendation model (DLRM)

Python 3,840 852 Updated Oct 11, 2024

Pytorch domain library for recommendation systems

Python 2,050 487 Updated Mar 13, 2025

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

4,829 502 Updated Sep 25, 2024

Dynamic Memory Management for Serving LLMs without PagedAttention

C 308 23 Updated Feb 20, 2025

Ongoing research training transformer models at scale

Python 11,729 2,640 Updated Mar 13, 2025

🚀 Efficiently (pre)training foundation models with native PyTorch features, including FSDP for training and SDPA implementation of Flash attention v2.

Python 227 37 Updated Mar 6, 2025

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 141,097 28,257 Updated Mar 13, 2025

A PyTorch native library for large model training

Python 3,441 309 Updated Mar 13, 2025

AI Accelerator Benchmark focuses on evaluating AI Accelerators from a practical production perspective, including the ease of use and versatility of software and hardware.

Python 226 69 Updated Mar 12, 2025

SeaTunnel is a next-generation super high-performance, distributed, massive data integration tool.

Java 8,341 1,923 Updated Mar 12, 2025

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 87,826 23,575 Updated Mar 13, 2025
Showing results