Skip to content
View marwage's full-sized avatar

Block or report marwage

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Dynamic resources changes for multi-dimensional parallelism training

Go 22 1 Updated Nov 11, 2024

Fully open reproduction of DeepSeek-R1

Python 17,177 1,407 Updated Feb 7, 2025

Golang bindings for Nvidia Datacenter GPU Manager (DCGM)

C 102 29 Updated Feb 6, 2025

NVIDIA Data Center GPU Manager (DCGM) is a project for gathering telemetry and measuring the health of NVIDIA GPUs

C++ 450 62 Updated Feb 5, 2025

Recipes to scale inference-time compute of open models

Python 982 93 Updated Jan 16, 2025

Machine Learning Interviews from FAANG, Snapchat, LinkedIn. I have offers from Snapchat, Coupang, Stitchfix etc. Blog: mlengineer.io.

10,261 1,665 Updated Aug 31, 2023

Use your Neovim like using Cursor AI IDE!

Lua 9,604 374 Updated Feb 7, 2025

A low-latency & high-throughput serving engine for LLMs

Python 304 39 Updated Jan 31, 2025

g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains

Python 4,175 378 Updated Jan 27, 2025

[ATC '24] Metis: Fast automatic distributed training on heterogeneous GPUs (https://www.usenix.org/conference/atc24/presentation/um)

Python 25 12 Updated Nov 18, 2024

Minimal, single page, smooth-scrolling theme for Hugo static site generator.

HTML 678 265 Updated Jan 30, 2025

Microsoft Azure Traces

Jupyter Notebook 879 150 Updated Feb 6, 2025

A bibliography and survey of the papers surrounding o1

TeX 1,103 46 Updated Nov 16, 2024

Official inference library for Mistral models

Jupyter Notebook 9,917 884 Updated Nov 12, 2024

📺 Discover the latest machine learning / AI courses on YouTube.

16,222 1,941 Updated Jan 22, 2024

[CVPR 2024] DeepCache: Accelerating Diffusion Models for Free

Python 843 40 Updated Jun 27, 2024

PyTorch implementation for "Parallel Sampling of Diffusion Models", NeurIPS 2023 Spotlight

Python 129 7 Updated Oct 13, 2023

Official inference repo for FLUX.1 models

Python 19,976 1,392 Updated Feb 6, 2025

🦜🔗 Build context-aware reasoning applications

Jupyter Notebook 99,755 16,219 Updated Feb 7, 2025

nnScaler: Compiling DNN models for Parallel Training

Python 87 13 Updated Jan 10, 2025

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.

Python 27,404 5,623 Updated Feb 7, 2025

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 27,223 3,426 Updated Jul 23, 2024

An open source implementation of CLIP.

Python 10,921 1,033 Updated Jan 4, 2025

Generative Models by Stability AI

Python 25,218 2,792 Updated Sep 4, 2024

Universal LLM Deployment Engine with ML Compilation

Python 19,884 1,649 Updated Feb 6, 2025

Reference implementations of MLPerf™ inference benchmarks

Python 1,298 542 Updated Feb 6, 2025

LLM inference in C/C++

C++ 73,363 10,578 Updated Feb 7, 2025

Minimalistic large language model 3D-parallelism training

Python 1,424 143 Updated Feb 5, 2025

Python package for dataset imports from UCI ML Repository

Jupyter Notebook 272 108 Updated Aug 6, 2024
Next