Skip to content
View crcrpar's full-sized avatar
  • NVIDIA
  • Tokyo
  • 16:20 (UTC +09:00)

Block or report crcrpar

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 31,863 4,844 Updated Dec 14, 2024

OpTree: Optimized PyTree Utilities

Python 156 7 Updated Dec 12, 2024

cuDNN fuzzers

Python 3 Updated Nov 1, 2024

An efficient GPU support for LLM inference with x-bit quantization (e.g. FP6,FP5).

Cuda 217 16 Updated Oct 28, 2024

Efficient Triton Kernels for LLM Training

Python 3,822 229 Updated Dec 13, 2024

Official repository for "AM-RADIO: Reduce All Domains Into One"

Jupyter Notebook 844 36 Updated Dec 10, 2024

A native PyTorch Library for large model training

Python 2,754 222 Updated Dec 14, 2024

PyTorch native quantization and sparsity for training and inference

Python 1,666 184 Updated Dec 14, 2024

NVIDIA Math Libraries for the Python Ecosystem

Cython 214 12 Updated Nov 19, 2024

A list of awesome compiler projects and papers for tensor computation and deep learning.

2,421 305 Updated Oct 19, 2024

Accessible large language models via k-bit quantization for PyTorch.

Python 6,407 640 Updated Dec 13, 2024

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 8,890 1,024 Updated Dec 11, 2024

A central repository to keep track of the status of work on and support for free-threaded CPython (see PEP 703), with a focus on the scientific and ML/AI ecosystem

165 22 Updated Dec 13, 2024

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 35,860 4,167 Updated Dec 14, 2024

An innovative superfamily of fonts for code

TypeScript 14,529 249 Updated Jul 15, 2024

Oceanic Next theme for neovim

Vim Script 1,147 141 Updated Apr 11, 2024

JAX-Toolbox

Jupyter Notebook 268 50 Updated Dec 14, 2024
Python 4 2 Updated Jan 26, 2022

Clean & Elegant Color Scheme inspired by Atom One and Material

Vim Script 911 57 Updated Nov 25, 2024

WholeGraph - large scale Graph Neural Networks

Cuda 101 37 Updated Nov 25, 2024

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Python 10,906 1,084 Updated Dec 9, 2024

You like pytorch? You like micrograd? You love tinygrad! ❤️

Python 27,172 3,019 Updated Dec 14, 2024

Open alternative files for the current buffer

Lua 398 30 Updated Nov 30, 2024

NeoSolarized colorscheme for NeoVim with full transparency

Lua 177 16 Updated Jun 20, 2024

🌙 A better color scheme for the late night coder

Vim Script 774 65 Updated Aug 1, 2023

Interactively select and swap function arguments, list elements, and much more. Powered by tree-sitter.

Lua 509 22 Updated Aug 10, 2024

A pleasant, mild, dark (n)vim theme.

Vim Script 73 11 Updated May 10, 2023

Another attempt of a flat Gruvbox theme for Neovim

Lua 244 22 Updated Aug 7, 2024

Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".

Python 1,961 156 Updated Mar 27, 2024

Dim inactive windows in Neovim using window-local highlight namespaces.

Lua 349 10 Updated Nov 7, 2024
Next