- KAIST, 291 Daehak-ro, Yuseong-gu, Daejeon
-
20:54
(UTC +09:00)
Highlights
- Pro
Lists (3)
Sort Name ascending (A-Z)
Stars
21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
Large Language Model (LLM) Systems Paper List
My learning notes/codes for ML SYS.
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper
A generative world for general-purpose robotics & embodied AI learning.
[KAIST CS632] Road damage detection using YOLOv8 on Xilinx FPGA, repair estimation with vLLM-Serve Phi-3.5 FAISS RAG, and data management via GS1 EPCISv2 and React dashboard
SGLang is a fast serving framework for large language models and vision language models.
Efficient LLM Inference over Long Sequences
References content from the OLCF CUDA Training Series. (https://github.com/olcf/cuda-training-series)
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models
AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation
Vitis AI is Xilinx’s development stack for AI inference on Xilinx hardware platforms, including both edge devices and Alveo cards.
[ICISTS 2024 Hackafair Gold Award for 1st Place] GenEraser: A Customizable Profanity Filtering Service for Each Community with Automatically Updating Filters using RAG
[2024 Social Problem-Solving Volunteer Idea Hackathon 1st Place Winner] Generative Volunteer Reward System for Creating "My Own Characters" using GenAI, GreenThread
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
SeungjaeLim / ALLeX
Forked from JJong84/ALLeX[2024 SPARCS Science Hackathon Top Award(1st)] Web-based XR labs for ALL types of eXperiments, ALLeX
Learn CUDA Programming, published by Packt
The official GitHub page for the survey paper "A Survey on Mixture of Experts in Large Language Models".
CUDATracePreload is a dynamic tracing tool for CUDA and NCCL API calls.