Wonderful-Me

Follow

🎯

Focusing

Infinity Wonderful-Me

🎯

Focusing

Follow

CS PhD Student at Rice University

15 followers · 24 following

Rice University
Houston, United States

Achievements

Achievements

Stars

87 results for source starred repositories

guinmoon / LLMFarm

llama and other large language models on iOS and MacOS offline using GGML library.

Swift 1,597 110 Updated Jan 27, 2025

mlc-ai / mlc-llm

Universal LLM Deployment Engine with ML Compilation

Python 20,006 1,666 Updated Feb 12, 2025

chaoyanghe / Awesome-Federated-Learning

FedML - The Research and Production Integrated Federated Learning Library: https://fedml.ai

1,942 331 Updated Sep 3, 2022

MoonshotAI / MoBA

MoBA: Mixture of Block Attention for Long-Context LLMs

Python 736 35 Updated Feb 19, 2025

DefTruth / ffpa-attn-mma

📚FFPA: Yet another Faster Flash Prefill Attention with O(1)⚡️SRAM complexity for headdim > 256, 1.8x~3x↑🎉faster than SDPA EA.

Cuda 106 5 Updated Feb 19, 2025

ZJU-LLMs / Foundations-of-LLMs

7,003 575 Updated Jan 14, 2025

GATECH-EIC / ShiftAddLLM

ShiftAddLLM: Accelerating Pretrained LLMs via Post-Training Multiplication-Less Reparameterization

Python 102 16 Updated Oct 15, 2024

microsoft / T-MAC

Low-bit LLM inference on CPU with lookup table

C++ 681 52 Updated Jan 9, 2025

pytorch / executorch

On-device AI across mobile, embedded and edge for PyTorch

C++ 2,519 449 Updated Feb 19, 2025

apple / corenet

CoreNet: A library for training deep neural networks

Jupyter Notebook 7,004 546 Updated Oct 14, 2024

ml-explore / mlx

MLX: An array framework for Apple silicon

C++ 19,164 1,096 Updated Feb 19, 2025

ZhuiyiTechnology / roformer

Rotary Transformer

Python 894 52 Updated Mar 21, 2022

moonbit-community / XMLParser

MoonBit 4 2 Updated Feb 19, 2025

pyutils / line_profiler

Line-by-line profiling for Python

Python 2,860 125 Updated Jan 30, 2025

xlab-uiuc / acto

Push-Button End-to-End Testing of Kubernetes Operators and Controllers

Python 125 43 Updated Feb 14, 2025

FMInference / DejaVu

Python 314 40 Updated Apr 2, 2024

ggml-org / llama.cpp

LLM inference in C/C++

C++ 74,752 10,805 Updated Feb 19, 2025

openai / openai-realtime-agents

This is a simple demonstration of more advanced, agentic patterns built on top of the Realtime API.

TypeScript 4,990 515 Updated Feb 13, 2025

TheAiSingularity / graphrag-local-ollama

Local models support for Microsoft's graphrag using ollama (llama3, mistral, gemma2 phi3)- LLM & Embedding extraction

Python 892 141 Updated Sep 30, 2024

stanford-oval / storm

An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.

Python 21,935 1,920 Updated Jan 23, 2025

SJTU-IPADS / PowerInfer

High-speed Large Language Model Serving for Local Deployment

C++ 8,107 423 Updated Feb 19, 2025

csguoh / Awesome-Mamba-in-Low-Level-Vision

A paper list of recent mamba efforts for low-level vision.

243 9 Updated Feb 13, 2025

ml-energy / zeus

Deep Learning Energy Measurement and Optimization

Python 239 30 Updated Feb 5, 2025

NVIDIA / DALI

A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.

C++ 5,279 630 Updated Feb 19, 2025

HazyResearch / ThunderKittens

Tile primitives for speedy kernels

Cuda 2,048 115 Updated Feb 19, 2025

mlc-ai / xgrammar

Fast, Flexible and Portable Structured Generation

C++ 706 44 Updated Feb 19, 2025

UbiquitousLearning / mllm

Fast Multimodal LLM on Mobile Devices

C++ 696 80 Updated Feb 9, 2025

Yangyi-Chen / Multimodal-AND-Large-Language-Models

Paper list about multimodal and large language models, only used to record papers I read in the daily arxiv for personal needs.

590 40 Updated Feb 19, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 3,438 300 Updated Feb 19, 2025

xdit-project / xDiT

xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism

Python 1,271 103 Updated Feb 10, 2025