hxdtest

hxdtest

8 followers · 19 following

Achievements

Stars

deepseek-ai / DualPipe

A bidirectional pipeline parallelism algorithm for computation-communication overlap in V3/R1 training.

Python 2,602 262 Updated Mar 10, 2025

deepseek-ai / DeepGEMM

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 4,987 502 Updated Mar 16, 2025

deepseek-ai / DeepEP

DeepEP: an efficient expert-parallel communication library

Cuda 7,204 641 Updated Mar 14, 2025

deepseek-ai / open-infra-index

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

6,769 203 Updated Mar 4, 2025

MathFoundationRL / Book-Mathematical-Foundation-of-Reinforcement-Learning

This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."

MATLAB 7,513 782 Updated Mar 15, 2025

int8 / monte-carlo-tree-search

Monte carlo tree search in python

Python 595 170 Updated Jul 2, 2022

pytorch / pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 87,950 23,594 Updated Mar 17, 2025

adam-maj / tiny-gpu

A minimal GPU design in Verilog to learn how GPUs work from the ground up

SystemVerilog 7,986 609 Updated Aug 18, 2024

PKU-YuanGroup / Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 11,931 1,056 Updated Mar 6, 2025

allenai / OLMo

Modeling, training, eval, and inference code for OLMo

Python 5,385 576 Updated Mar 16, 2025

triton-lang / triton

Development repository for the Triton language and compiler

MLIR 14,891 1,863 Updated Mar 17, 2025

CalvinXKY / BasicCUDA

A tutorial for CUDA&PyTorch

C++ 127 27 Updated Jan 21, 2025

SJTU-IPADS / PowerInfer

High-speed Large Language Model Serving for Local Deployment

C++ 8,152 426 Updated Feb 19, 2025

LLM360 / crystalcoder-data-prep

Data preparation code for CrystalCoder 7B LLM

Python 44 5 Updated May 10, 2024

LLM360 / crystalcoder-train

Pre-training code for CrystalCoder 7B LLM

Python 54 7 Updated May 10, 2024

modelscope / data-juicer

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

Python 3,926 218 Updated Mar 16, 2025

haotian-liu / LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 21,841 2,398 Updated Aug 12, 2024

alpa-projects / alpa

Training and serving large-scale neural networks with auto parallelization.

Python 3,113 361 Updated Dec 9, 2023

huggingface / transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 141,355 28,308 Updated Mar 15, 2025

alibaba / BladeDISC

BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.

C++ 849 164 Updated Dec 30, 2024

intelligent-machine-learning / dlrover

DLRover: An Automatic Distributed Deep Learning System

Python 1,368 171 Updated Mar 17, 2025

DeepRec-AI / DeepRec

DeepRec is a high-performance recommendation deep learning framework based on TensorFlow. It is hosted in incubation in LF AI & Data Foundation.

C++ 1,082 360 Updated Jan 21, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

hxdtest

Achievements

Achievements

Block or report hxdtest

Stars

deepseek-ai / DualPipe

deepseek-ai / DeepGEMM

deepseek-ai / DeepEP

deepseek-ai / open-infra-index

MathFoundationRL / Book-Mathematical-Foundation-of-Reinforcement-Learning

int8 / monte-carlo-tree-search

pytorch / pytorch

adam-maj / tiny-gpu

PKU-YuanGroup / Open-Sora-Plan

allenai / OLMo

triton-lang / triton

CalvinXKY / BasicCUDA

SJTU-IPADS / PowerInfer

LLM360 / crystalcoder-data-prep

LLM360 / crystalcoder-train

modelscope / data-juicer

haotian-liu / LLaVA

alpa-projects / alpa

huggingface / transformers

alibaba / BladeDISC

intelligent-machine-learning / dlrover

DeepRec-AI / DeepRec