Skip to content
View Li-05's full-sized avatar

Block or report Li-05

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

📰 Must-read papers and blogs on Speculative Decoding ⚡️

564 26 Updated Jan 16, 2025

PyTorch Re-Implementation of "The Sparsely-Gated Mixture-of-Experts Layer" by Noam Shazeer et al. https://arxiv.org/abs/1701.06538

Python 1,023 107 Updated Apr 19, 2024

A curated list for Efficient Large Language Models

Python 1,401 104 Updated Dec 30, 2024

Ongoing research training transformer models at scale

Python 11,170 2,494 Updated Jan 23, 2025

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python 1,959 345 Updated Dec 24, 2024

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python 1,357 220 Updated Mar 20, 2024

Proper implementation of ResNet-s for CIFAR10/100 in pytorch that matches description of the original paper.

Python 1,249 334 Updated Jun 18, 2024

Accuracy 77%. Large batch deep learning optimizer LARS for ImageNet with PyTorch and ResNet, using Horovod for distribution. Optional accumulated gradient and NVIDIA DALI dataloader.

Python 38 8 Updated Jun 1, 2021

PyTorch深度学习快速入门教程(绝对通俗易懂!)

Python 2,966 656 Updated Jun 7, 2022

cvpr2024/cvpr2023/cvpr2022/cvpr2021/cvpr2020/cvpr2019/cvpr2018/cvpr2017 论文/代码/解读/直播合集,极市团队整理

12,462 2,298 Updated Apr 25, 2024