Skip to content
View ranpin's full-sized avatar

Block or report ranpin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

国科大硕士/博士学位论文LeTeX模板, 以《中国科学院大学研究生学位论文撰写规范指导意见》(校发学位字[2022]40号, 附件1) 作为撰写要求

TeX 16 5 Updated Mar 5, 2025
Python 6 Updated Feb 24, 2025

LaTeX Thesis Template for the University of Chinese Academy of Sciences

TeX 3,576 943 Updated Feb 29, 2024

黑马程序员最新Java项目实战《苍穹外卖》,最适合新手的SpringBoot+SSM企业级项目实战 相比于瑞吉外卖苍穹外卖的业务更加真实完整,用户端改为微信小程序,登录改为了微信登录,加入了统计报表,来单提醒,客户催单,订单管理等功能,业务实现了闭环。技术选型更加丰富和实用。可以认为是增强版瑞吉外卖

JavaScript 541 136 Updated Jul 13, 2023

推荐系统入门教程,在线阅读地址:https://datawhalechina.github.io/fun-rec/

Jupyter Notebook 5,299 890 Updated Feb 22, 2025

Easy-to-Use RAG Framework; CCF AIOps International Challenge 2024 Top3 Solution; CCF AIOps 国际挑战赛 2024 季军方案

Python 338 41 Updated Nov 17, 2024

本项目为书籍《大模型RAG实战》的代码以及资料汇总。

Jupyter Notebook 142 20 Updated Nov 18, 2024

向量检索与 RAG 实践:技术、实现与应用

93 16 Updated Nov 5, 2024

大模型/LLM推理和部署理论与实践

192 31 Updated Feb 9, 2025

EfficientFormerV2 [ICCV 2023] & EfficientFormer [NeurIPs 2022]

Python 1,024 92 Updated Aug 13, 2023

Quantized Attention that achieves speedups of 2.1-3.1x and 2.7-5.1x compared to FlashAttention2 and xformers, respectively, without lossing end-to-end metrics across various models.

Cuda 1,076 64 Updated Feb 28, 2025
Jupyter Notebook 138 6 Updated Jun 2, 2023

Training-free Post-training Efficient Sub-quadratic Complexity Attention. Implemented with OpenAI Triton.

Python 122 13 Updated Feb 25, 2025
Python 3 Updated Sep 23, 2024

Official repository for LightSeq: Sequence Level Parallelism for Distributed Training of Long Context Transformers

Python 207 9 Updated Aug 19, 2024

Development repository for the Triton language and compiler

MLIR 14,763 1,845 Updated Mar 8, 2025

Simplify your onnx model

C++ 3,995 389 Updated Sep 3, 2024

Inference optimization of the ViT model using TensorRT, NVIDIA's high-performance deep learning inference platform. TensorRT is designed to maximize the efficiency of deep learning models during in…

2 Updated Aug 17, 2024
Python 5 Updated Feb 12, 2025

TensorRT 2022 亚军方案,tensorrt加速mobilevit模型

Python 61 6 Updated Jun 22, 2022

Flash Attention in ~100 lines of CUDA (forward pass only)

Cuda 714 64 Updated Dec 30, 2024

how to optimize some algorithm in cuda.

Cuda 1,957 174 Updated Mar 5, 2025

A unified library of state-of-the-art model optimization techniques such as quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deploym…

Python 768 56 Updated Mar 3, 2025

校招、秋招、春招、实习好项目!带你从零实现一个高性能的深度学习推理库,支持大模型 llama2 、Unet、Yolov5、Resnet等模型的推理。Implement a high-performance deep learning inference library step by step

C++ 2,756 311 Updated Oct 26, 2024

This is the official code for paper: Token Summarisation for Efficient Vision Transformers via Graph-based Token Propagation

Python 26 1 Updated Jan 15, 2024
Next