patemotter

Pate Motter patemotter

PhD in computer science with a focus on high performance computing. Now working at Google on inference performance on TPUs.

20 followers · 12 following

Google
Seattle, WA
patemotter.com

Achievements

Organizations

Lists (1)

Sort

Learning Resources

1 repository

Stars

AI-Hypercomputer / maxdiffusion

Python 176 19 Updated Jan 10, 2025

AI-Hypercomputer / JetStream

JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).

Python 261 32 Updated Jan 9, 2025

rwitten / HighPerfLLMs2024

Python 262 23 Updated Jul 11, 2024

mlabonne / llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Jupyter Notebook 40,994 4,380 Updated Jul 28, 2024

NVIDIA / TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 9,119 1,053 Updated Jan 8, 2025

SteamDeckHomebrew / decky-loader

A plugin loader for the Steam Deck.

TypeScript 4,946 174 Updated Jan 4, 2025

mistralai / mistral-inference

Official inference library for Mistral models

Jupyter Notebook 9,840 870 Updated Nov 12, 2024

pytorch-labs / gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Python 5,748 523 Updated Dec 14, 2024

Stability-AI / generative-models

Generative Models by Stability AI

Python 25,045 2,777 Updated Sep 4, 2024

huggingface / safetensors

Simple, safe way to store and distribute tensors

Python 2,988 204 Updated Jan 9, 2025

microsoft / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 36,175 4,185 Updated Jan 9, 2025

microsoft / DeepSpeed-MII

MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.

Python 1,937 176 Updated Nov 20, 2024

tefkah / zotero-night

Night theme for Zotero UI and PDF

SCSS 2,442 39 Updated Dec 2, 2024

Dao-AILab / flash-attention

Fast and memory-efficient exact attention

Python 14,995 1,413 Updated Jan 10, 2025

neuralmagic / sparseml

Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models

Python 2,090 149 Updated Aug 1, 2024

patrick-kidger / lineax

Linear solvers in JAX and Equinox. https://docs.kidger.site/lineax

Python 390 24 Updated Jan 5, 2025

google / learned_optimization

Python 758 62 Updated Nov 14, 2024

google / jaxopt

Hardware accelerated, batchable and differentiable optimizers in JAX.

Python 942 69 Updated Sep 17, 2024

google / copybara

Copybara: A tool for transforming and moving code between repositories.

Java 2,199 264 Updated Jan 9, 2025

google / saxml

Python 126 30 Updated Jan 10, 2025

google-gemini / generative-ai-python

The official Python library for the Google Gemini API

Python 1,942 384 Updated Dec 20, 2024

google / aqt

Python 272 27 Updated Dec 24, 2024

google / orbax

Orbax provides common checkpointing and persistence utilities for JAX users

Python 323 38 Updated Jan 10, 2025

patrick-kidger / jaxtyping

Type annotations and runtime checking for shape and dtype of JAX/NumPy/PyTorch/etc. arrays. https://docs.kidger.site/jaxtyping/

Python 1,297 66 Updated Jan 8, 2025

google / praxis

Python 180 42 Updated Dec 20, 2024

google / highway

Performance-portable, length-agnostic SIMD with runtime dispatch

C++ 4,329 326 Updated Jan 9, 2025

louisfb01 / start-machine-learning

A complete guide to start and improve in machine learning (ML), artificial intelligence (AI) in 2025 without ANY background in the field and stay up-to-date with the latest news and state-of-the-ar…

4,519 584 Updated Jan 1, 2025

AI-Hypercomputer / maxtext

A simple, performant and scalable Jax LLM!

Python 1,578 309 Updated Jan 10, 2025

google / flax

Flax is a neural network library for JAX that is designed for flexibility.

Jupyter Notebook 6,250 660 Updated Jan 10, 2025

google / trax

Trax — Deep Learning with Clear Code and Speed

Python 8,130 821 Updated Jan 8, 2025