Skip to content
View Wonderful-Me's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Rice University
  • Houston, United States

Block or report Wonderful-Me

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
35 stars written in Python
Clear filter

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 139,744 28,025 Updated Feb 19, 2025

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 36,843 4,249 Updated Feb 19, 2025

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 35,521 6,025 Updated Feb 19, 2025

An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.

Python 21,935 1,920 Updated Jan 23, 2025

Fully open reproduction of DeepSeek-R1

Python 20,691 1,804 Updated Feb 19, 2025

Universal LLM Deployment Engine with ML Compilation

Python 20,006 1,666 Updated Feb 12, 2025

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Python 6,795 377 Updated Jul 11, 2024

verl: Volcano Engine Reinforcement Learning for LLMs

Python 3,438 300 Updated Feb 19, 2025

Training and serving large-scale neural networks with auto parallelization.

Python 3,102 361 Updated Dec 9, 2023

Line-by-line profiling for Python

Python 2,860 125 Updated Jan 30, 2025

PyTorch native quantization and sparsity for training and inference

Python 1,847 218 Updated Feb 19, 2025

Comprehensive and timely academic information on federated learning (papers, frameworks, datasets, tutorials, workshops)

Python 1,617 193 Updated Dec 30, 2024

计算机网络-自顶向下方法 习题/编程/实验答案

Python 1,414 225 Updated Jan 27, 2022

Implementation of Communication-Efficient Learning of Deep Networks from Decentralized Data

Python 1,319 454 Updated May 7, 2024

xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism

Python 1,271 103 Updated Feb 10, 2025

Rotary Transformer

Python 894 52 Updated Mar 21, 2022

Local models support for Microsoft's graphrag using ollama (llama3, mistral, gemma2 phi3)- LLM & Embedding extraction

Python 892 141 Updated Sep 30, 2024

My learning notes/codes for ML SYS.

Python 793 38 Updated Feb 19, 2025

MoBA: Mixture of Block Attention for Long-Context LLMs

Python 739 35 Updated Feb 19, 2025

Implements harmful/harmless refusal removal using pure HF Transformers

Python 550 79 Updated Jun 12, 2024
Python 314 40 Updated Apr 2, 2024

Deep Learning Energy Measurement and Optimization

Python 239 30 Updated Feb 5, 2025

Push-Button End-to-End Testing of Kubernetes Operators and Controllers

Python 125 43 Updated Feb 14, 2025

ShiftAddLLM: Accelerating Pretrained LLMs via Post-Training Multiplication-Less Reparameterization

Python 102 16 Updated Oct 15, 2024

Google TPU optimizations for transformers models

Python 98 24 Updated Jan 21, 2025
Python 87 11 Updated Oct 9, 2024
Python 72 5 Updated May 4, 2021
Python 60 20 Updated Oct 25, 2022

Tele-LLMs is a series of open-source large language models specialized in telecommunications

Python 19 3 Updated Sep 10, 2024
Next