-
VITA Public
Forked from VITA-MLLM/VITA✨✨VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction
Python Other UpdatedJan 12, 2025 -
bert4torch Public
Forked from Tongjilibo/bert4torchAn elegent pytorch implement of transformers
Python MIT License UpdatedDec 29, 2024 -
Machine-Learning Public
Forked from xbeat/Machine-LearningCross Beat (xbe.at) - Your hub for python, machine learning and AI tutorials. Explore Python tutorials, AI insights, and more.
Creative Commons Zero v1.0 Universal UpdatedDec 21, 2024 -
graphrag Public
Forked from microsoft/graphragA modular graph-based Retrieval-Augmented Generation (RAG) system
Python MIT License UpdatedDec 18, 2024 -
unsloth Public
Forked from unslothai/unslothFinetune Llama 3.2, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory
Python Apache License 2.0 UpdatedOct 18, 2024 -
AISystem Public
Forked from chenzomi12/AISystemAISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术
Jupyter Notebook Apache License 2.0 UpdatedOct 15, 2024 -
DeepSpeedExamples Public
Forked from deepspeedai/DeepSpeedExamplesExample models using DeepSpeed
Python Apache License 2.0 UpdatedAug 13, 2024 -
UnderstandingDeepLearning-ZH-CN Public
Forked from guowenfei-mathsfan/UnderstandingDeepLearning-ZH-CNUnderstandingDeepLearing中文翻译
TeX UpdatedJul 21, 2024 -
onnx-tensorrt Public
Forked from onnx/onnx-tensorrtONNX-TensorRT: TensorRT backend for ONNX
C++ Apache License 2.0 UpdatedJul 17, 2024 -
how-to-optim-algorithm-in-cuda Public
Forked from BBuf/how-to-optim-algorithm-in-cudahow to optimize some algorithm in cuda.
Cuda UpdatedJul 1, 2024 -
-
cuda-mode-lectures Public
Forked from gpu-mode/lecturesMaterial for cuda-mode lectures
Jupyter Notebook Apache License 2.0 UpdatedJun 13, 2024 -
Accelerate-Model-Training-with-PyTorch-2.X Public
Forked from PacktPublishing/Accelerate-Model-Training-with-PyTorch-2.XAccelerate Model Training with PyTorch 2.X, published by Packt
Jupyter Notebook MIT License UpdatedJun 12, 2024 -
gpu-optimization-workshop Public
Forked from mlops-discord/gpu-optimization-workshopSlides, notes, and materials for the workshop (cuda-mode)
UpdatedJun 1, 2024 -
SpeculativeDecodingPapers Public
Forked from hemingkx/SpeculativeDecodingPapers📰 Must-read papers and blogs on Speculative Decoding ⚡️
Apache License 2.0 UpdatedApr 30, 2024 -
llm-resource Public
Forked from liguodongiot/llm-resourceLLM全栈优质资源汇总
Shell Apache License 2.0 UpdatedApr 2, 2024 -
onnxruntime-inference-examples Public
Forked from microsoft/onnxruntime-inference-examplesExamples for using ONNX Runtime for machine learning inferencing.
Python MIT License UpdatedMar 10, 2024 -
DDP-practice Public
Forked from rickyang1114/DDP-practiceA demo of image classification with PyTorch DDP (DistributedDataParallel) and AMP (Automatic Mixed Precision) modules.
Python UpdatedMar 2, 2024 -
CUDA-From-Correctness-To-Performance-Code Public
Forked from interestingLSY/CUDA-From-Correctness-To-Performance-CodeCodes & examples for "CUDA - From Correctness to Performance"
C++ Apache License 2.0 UpdatedFeb 18, 2024 -
Awesome-LLM-Compression Public
Forked from HuangOwen/Awesome-LLM-CompressionAwesome LLM compression research papers and tools.
MIT License UpdatedFeb 17, 2024 -
transfomers-silicon-research Public
Forked from aliemo/transfomers-silicon-researchResearch and Materials on Hardware implementation of Transformer Model
Jupyter Notebook MIT License UpdatedFeb 16, 2024 -
LLMSurvey Public
Forked from RUCAIBox/LLMSurveyThe official GitHub page for the survey paper "A Survey of Large Language Models".
Python UpdatedJan 10, 2024 -
-
intel-extension-for-transformers Public
Forked from intel/intel-extension-for-transformers⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡
C++ Apache License 2.0 UpdatedNov 16, 2023 -
-
Selective_Context Public
Forked from liyucheng09/Selective_ContextCompress your input to ChatGPT or other LLMs, to let them process 2x more content and save 40% memory and GPU time.
-
udlbook Public
Forked from udlbook/udlbookUnderstanding Deep Learning - Simon J.D. Prince
Jupyter Notebook Other UpdatedOct 14, 2023 -
MyTinySTL Public
Forked from Alinshans/MyTinySTLAchieve a tiny STL in C++11
C++ Other UpdatedOct 10, 2023 -
puck Public
Forked from baidu/puckPuck is a high-performance ANN search engine
Jupyter Notebook Apache License 2.0 UpdatedSep 19, 2023 -
d2l-zh Public
Forked from d2l-ai/d2l-zh《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。
Python Apache License 2.0 UpdatedSep 17, 2023