🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Python 8,053 989 Updated Dec 13, 2024

LeapLabTHU / Agent-Attention

Official repository of Agent Attention (ECCV2024)

Python 551 37 Updated Nov 17, 2024

lucidrains / linear-attention-transformer

Transformer based on a variant of attention that is linear complexity in respect to sequence length

Python 707 67 Updated May 5, 2024

pprp / Awesome-LLM-Quantization

Awesome list for LLM quantization

Python 134 10 Updated Dec 12, 2024

facebookresearch / SpinQuant

Code repo for the paper "SpinQuant LLM quantization with learned rotations"

Python 183 17 Updated Nov 11, 2024

NVIDIA / cuda-samples

Samples for CUDA Developers which demonstrates features in CUDA Toolkit

C 6,583 1,859 Updated Jul 26, 2024

prprbr / awesome-lifelong-continual-learning

A list of papers, blogs, datasets and software in the field of lifelong/continual machine learning

280 44 Updated Mar 6, 2021

jadore801120 / attention-is-all-you-need-pytorch

A PyTorch implementation of the Transformer model in "Attention is All You Need".

Python 8,932 1,987 Updated Apr 16, 2024

AlexanderMath / fasth

Code for the article "What if Neural Networks had SVDs?", to be presented as a spotlight paper at NeurIPS 2020.

Python 70 10 Updated Jul 25, 2024

linux-surface / linux-surface

Linux Kernel for Surface Devices

Shell 5,284 226 Updated Dec 11, 2024

NVIDIA / cutlass

CUDA Templates for Linear Algebra Subroutines

C++ 5,822 1,005 Updated Dec 11, 2024

Dao-AILab / flash-attention

Fast and memory-efficient exact attention

Python 14,621 1,371 Updated Dec 13, 2024

hhb072 / OrthogonalTransformer

Python 10 Updated Oct 20, 2023

davidtvs / pytorch-lr-finder

A learning rate range test implementation in PyTorch

Python 929 120 Updated Dec 1, 2024

Goutam-Kelam / LayerOut

A new regularization technique that freezes the layers of the deep neural networks stochastically.

Python 4 Updated Jan 6, 2021

VaticanCameos99 / knowledge-distillation-for-unet

An implementation of Knowledge distillation for segmentation, to train a small (student) UNet from a larger (teacher) UNet thereby reducing the size of the network while achieving performance simil…

Python 51 13 Updated May 7, 2020

lezcano / expRNN

Optimization with orthogonal constraints and on general manifolds

Python 126 21 Updated Jul 13, 2020

cdluminate / cdluminate

TeX 5 Updated Dec 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Can Goksen cangoksen

Achievements

Achievements

Highlights

Block or report cangoksen

Stars

muhac / docker-jupyter-pytorch

QingruZhang / AdaLoRA

atfortes / Awesome-LLM-Reasoning

atfortes / Awesome-Controllable-Diffusion

lucidrains / local-attention

mit-han-lab / streaming-llm

tomaarsen / attention_sinks

karpathy / LLM101n

tmabraham / diffusion_reading_group

Dao-AILab / fast-hadamard-transform

MattShannon / bandmat

openai / blocksparse

huggingface / accelerate