-
HKUST
Highlights
- Pro
Stars
📰 Must-read papers and blogs on Speculative Decoding ⚡️
A self-learning tutorail for CUDA High Performance Programing.
Spec-Bench: A Comprehensive Benchmark and Unified Evaluation Platform for Speculative Decoding (ACL 2024 Findings)
⏰ Collaboratively track deadlines of conferences recommended by CCF (Website, Python Cli, Wechat Applet) / If you find it useful, please star this project, thanks~
Training materials associated with NVIDIA's CUDA Training Series (www.olcf.ornl.gov/cuda-training-series/)
📖A curated list of Awesome LLM/VLM Inference Papers with codes: WINT8/4, Flash-Attention, Paged-Attention, Parallelism, etc. 🎉🎉
Multiple NVIDIA GPUs or Apple Silicon for Large Language Model Inference?
Tips for Writing a Research Paper using LaTeX
Official Code for "SVD-LLM: Truncation-aware Singular Value Decomposition for Large Language Model Compression"
PyTorch implementation of Language model compression with weighted low-rank factorization
Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.
A Langchain email agent that responds to incoming email. Email service with AWS SES.
😎 Awesome list of tools and projects with the awesome LangChain framework
🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.
Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.
✨✨Latest Advances on Multimodal Large Language Models
Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型
Homework solutions for CMU 10-414/714 – Deep Learning Systems: Algorithms and Implementation
XQUIC Library released by Alibaba is a cross-platform implementation of QUIC and HTTP/3 protocol.
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.