Skip to content
View ahmedhus22's full-sized avatar

Block or report ahmedhus22

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Puzzles for learning Triton

Jupyter Notebook 1,196 92 Updated Nov 18, 2024

A curated list of resources for learning and exploring Triton, OpenAI's programming language for writing efficient GPU code.

24 4 Updated Dec 8, 2024

Cataloging released Triton kernels.

142 7 Updated Aug 26, 2024

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 8,765 631 Updated Dec 11, 2024

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 35,170 4,340 Updated Nov 18, 2024

The LLM's practical guide: From the fundamentals to deploying advanced LLM and RAG apps to AWS using LLMOps best practices

Python 1,958 325 Updated Dec 2, 2024

Finetune Llama 3.3, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 80% less memory

Python 19,217 1,350 Updated Dec 12, 2024

Efficient Triton Kernels for LLM Training

Python 3,813 228 Updated Dec 13, 2024

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 8,886 1,025 Updated Dec 11, 2024

This repository is a comprehensive collection of research papers, annotations, and concise summaries in the field of Natural Language Processing (NLP). It focuses on machine learning and deep learn…

7 2 Updated Feb 10, 2024

Marble Blast, but in a higher dimension

C# 3 Updated Aug 13, 2024
Python 4 Updated Nov 15, 2024

NVIDIA curated collection of educational resources related to general purpose GPU programming.

Jupyter Notebook 165 31 Updated Nov 17, 2024

📕 A clone of @rygorous series of posts on the graphics pipeline.

90 23 Updated Mar 7, 2021

GPUOcelot: A dynamic compilation framework for PTX

C++ 153 12 Updated Dec 12, 2024

NumPy & SciPy for GPU

Python 9,574 859 Updated Dec 13, 2024

Data Compression, Lossless implementation

Python 1 Updated Dec 27, 2022

A fully C++ deep learning framework.

C++ 45 4 Updated Jul 24, 2024

Documented and Unit Tested educational Deep Learning framework with Autograd from scratch.

Python 105 2 Updated Apr 10, 2024

My Rice Setup

Shell 956 121 Updated Aug 7, 2024

Desktop Backup Client for Borg Backup

Python 2,046 136 Updated Dec 12, 2024

A list of awesome beginners-friendly projects.

69,881 7,028 Updated Dec 3, 2024

Python Data Science Handbook: full text in Jupyter Notebooks

Jupyter Notebook 43,419 17,974 Updated Jun 26, 2024

Deduplicating archiver with compression and authenticated encryption.

Python 11,292 749 Updated Nov 25, 2024

Materials and IPython notebooks for "Python for Data Analysis" by Wes McKinney, published by O'Reilly Media

Jupyter Notebook 22,370 15,222 Updated Dec 22, 2023

PyCon 2015 Pandas tutorial materials

Jupyter Notebook 1,037 693 Updated Apr 12, 2024

Simple mathematical art

C++ 589 88 Updated Oct 31, 2019

The simplest way to run LLaMA on your local machine

CSS 13,101 1,416 Updated Jun 18, 2024

Lenia - Mathematical Life Forms

Python 3,558 223 Updated Jul 19, 2024

Probabilistic language based on pattern matching and constraint propagation, 153 examples

C# 7,551 317 Updated Nov 13, 2024
Next