Skip to content
View ndkling's full-sized avatar

Block or report ndkling

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

External repo for the EFAAR benchmarking paper

Jupyter Notebook 18 4 Updated Mar 4, 2025

Examples in the MLX framework

Python 7,054 995 Updated Mar 4, 2025

MLX: An array framework for Apple silicon

C++ 19,402 1,104 Updated Mar 5, 2025

LlamaIndex is the leading framework for building LLM-powered agents over your data.

Python 39,577 5,640 Updated Mar 5, 2025

Discord server https://discord.gg/HrV52MgSC2 QQ频道 https://pd.qq.com/s/1dwwmkgq4

TypeScript 1,023 74 Updated Feb 27, 2025

Kolmogorov Arnold Networks

Jupyter Notebook 15,468 1,458 Updated Jan 19, 2025

Real-time Fallacy Detection using OpenAI whisper and ChatGPT/LLaMA/Mistral

Python 112 12 Updated Dec 10, 2023

LLM inference in C/C++

C++ 75,848 10,971 Updated Mar 5, 2025

userspace daemon to combine joy-cons from the hid-nintendo kernel driver

C++ 402 72 Updated Feb 21, 2024

8-bit CUDA functions for PyTorch Rocm compatible

Python 39 7 Updated Mar 26, 2024

List USB devices and reset a USB device from the command line

Python 111 39 Updated Jun 19, 2023

Repository to download, process, and visualize local climate data from ERA5

R 4 3 Updated Mar 21, 2022

Go ahead and axolotl questions

Python 8,786 968 Updated Mar 4, 2025

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 37,208 4,278 Updated Mar 5, 2025

🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading

Python 9,482 540 Updated Sep 7, 2024

ENSO-ASC: ENSO deep learning forecast model with a multivariate air-sea coupler

Python 14 8 Updated Mar 16, 2023

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 69,680 7,500 Updated Mar 5, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 40,292 6,036 Updated Mar 5, 2025

Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".

Python 2,046 162 Updated Mar 27, 2024

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Python 4,734 508 Updated Jan 21, 2025

[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization

Python 679 43 Updated Aug 13, 2024

A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.

Python 2,832 222 Updated Sep 30, 2023

A gradio web UI for running Large Language Models like GPT-J 6B, OPT, GALACTICA, LLaMA, and Pygmalion.

Python 308 15 Updated Aug 22, 2023

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Python 2,802 233 Updated Mar 3, 2025

QLoRA: Efficient Finetuning of Quantized LLMs

Jupyter Notebook 10,270 835 Updated Jun 10, 2024

An autonomous AI agent extension for Oobabooga's web ui

Python 175 13 Updated Sep 7, 2023

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 87,534 23,515 Updated Mar 5, 2025

Oobabooga extension for Bark TTS

Python 118 15 Updated Nov 23, 2023

Universal LLM Deployment Engine with ML Compilation

Python 20,114 1,676 Updated Mar 3, 2025

AMD ROCm™ Software - GitHub Home

Shell 5,028 408 Updated Mar 5, 2025
Next