Stars
📖A curated list of Awesome LLM/VLM Inference Papers with codes: WINT8/4, Flash-Attention, Paged-Attention, Parallelism, etc. 🎉🎉
Transformer related optimization, including BERT, GPT
A high-throughput and memory-efficient inference and serving engine for LLMs
A curated list of awesome edge AI computing / system research.
Resources of our survey paper "Optimizing Edge AI: A Comprehensive Survey on Data, Model, and System Strategies"
本项目分享了中山大学计算机学院本科和研究生阶段的课程资料、笔记、期末考试卷和其他实用的相关资源。希望对同学们的学习有所帮助❤️,如果喜欢记得给个star🌟
Survey Paper List - Efficient LLM and Foundation Models
"Efficient Federated Learning for Modern NLP", to appear at MobiCom 2023.
总结梳理自然语言处理工程师(NLP)需要积累的各方面知识,包括面试题,各种基础知识,工程能力等等,提升核心竞争力
本项目是一个面向小白开发者的大模型应用开发教程,在线阅读地址:https://datawhalechina.github.io/llm-universe/
面向开发者的 LLM 入门教程,吴恩达大模型系列课程中文版