Skip to content
View patemotter's full-sized avatar

Organizations

@LighthouseHPC @googlers

Block or report patemotter

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).

Python 261 32 Updated Jan 9, 2025
Python 262 23 Updated Jul 11, 2024

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Jupyter Notebook 40,994 4,380 Updated Jul 28, 2024

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 9,119 1,053 Updated Jan 8, 2025

A plugin loader for the Steam Deck.

TypeScript 4,946 174 Updated Jan 4, 2025

Official inference library for Mistral models

Jupyter Notebook 9,840 870 Updated Nov 12, 2024

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Python 5,748 523 Updated Dec 14, 2024

Generative Models by Stability AI

Python 25,045 2,777 Updated Sep 4, 2024

Simple, safe way to store and distribute tensors

Python 2,988 204 Updated Jan 9, 2025

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 36,175 4,185 Updated Jan 9, 2025

MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.

Python 1,937 176 Updated Nov 20, 2024

Night theme for Zotero UI and PDF

SCSS 2,442 39 Updated Dec 2, 2024

Fast and memory-efficient exact attention

Python 14,995 1,413 Updated Jan 10, 2025

Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models

Python 2,090 149 Updated Aug 1, 2024

Linear solvers in JAX and Equinox. https://docs.kidger.site/lineax

Python 390 24 Updated Jan 5, 2025

Hardware accelerated, batchable and differentiable optimizers in JAX.

Python 942 69 Updated Sep 17, 2024

Copybara: A tool for transforming and moving code between repositories.

Java 2,199 264 Updated Jan 9, 2025
Python 126 30 Updated Jan 10, 2025

The official Python library for the Google Gemini API

Python 1,942 384 Updated Dec 20, 2024
Python 272 27 Updated Dec 24, 2024

Orbax provides common checkpointing and persistence utilities for JAX users

Python 323 38 Updated Jan 10, 2025

Type annotations and runtime checking for shape and dtype of JAX/NumPy/PyTorch/etc. arrays. https://docs.kidger.site/jaxtyping/

Python 1,297 66 Updated Jan 8, 2025
Python 180 42 Updated Dec 20, 2024

Performance-portable, length-agnostic SIMD with runtime dispatch

C++ 4,329 326 Updated Jan 9, 2025

A complete guide to start and improve in machine learning (ML), artificial intelligence (AI) in 2025 without ANY background in the field and stay up-to-date with the latest news and state-of-the-ar…

4,519 584 Updated Jan 1, 2025

A simple, performant and scalable Jax LLM!

Python 1,578 309 Updated Jan 10, 2025

Flax is a neural network library for JAX that is designed for flexibility.

Jupyter Notebook 6,250 660 Updated Jan 10, 2025

Trax — Deep Learning with Clear Code and Speed

Python 8,130 821 Updated Jan 8, 2025
Next