Skip to content
View SeungjaeLim's full-sized avatar
  • KAIST, 291 Daehak-ro, Yuseong-gu, Daejeon
  • 20:54 (UTC +09:00)

Highlights

  • Pro

Organizations

@homey2023 @PlaceEYE

Block or report SeungjaeLim

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/

Jupyter Notebook 71,157 36,939 Updated Feb 17, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python 4,896 485 Updated Feb 21, 2025

Large Language Model (LLM) Systems Paper List

780 28 Updated Feb 20, 2025

My learning notes/codes for ML SYS.

Python 870 42 Updated Feb 21, 2025

🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper

Python 31,166 2,570 Updated Feb 21, 2025

A generative world for general-purpose robotics & embodied AI learning.

Python 23,966 2,062 Updated Feb 21, 2025

[KAIST CS632] Road damage detection using YOLOv8 on Xilinx FPGA, repair estimation with vLLM-Serve Phi-3.5 FAISS RAG, and data management via GS1 EPCISv2 and React dashboard

Python 1 Updated Dec 19, 2024

SGLang is a fast serving framework for large language models and vision language models.

Python 10,425 1,018 Updated Feb 21, 2025

Efficient LLM Inference over Long Sequences

Python 359 19 Updated Feb 14, 2025

References content from the OLCF CUDA Training Series. (https://github.com/olcf/cuda-training-series)

Cuda 2 1 Updated Nov 21, 2024
Python 24 5 Updated Aug 31, 2023

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 1 1 Updated Nov 19, 2024

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 37,870 4,629 Updated Feb 21, 2025

OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models

Python 1,661 128 Updated Jan 17, 2025

AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation

Python 3,596 280 Updated Feb 16, 2025
JavaScript 1 Updated Oct 30, 2024

Vitis AI is Xilinx’s development stack for AI inference on Xilinx hardware platforms, including both edge devices and Alveo cards.

Python 1,538 642 Updated Sep 12, 2024

Official inference framework for 1-bit LLMs

C++ 12,745 894 Updated Feb 18, 2025

[ICISTS 2024 Hackafair Gold Award for 1st Place] GenEraser: A Customizable Profanity Filtering Service for Each Community with Automatically Updating Filters using RAG

Python 1 Updated Oct 21, 2024

[2024 Social Problem-Solving Volunteer Idea Hackathon 1st Place Winner] Generative Volunteer Reward System for Creating "My Own Characters" using GenAI, GreenThread

JavaScript 1 Updated Aug 29, 2024

한국어 데이터 세트 링크

871 102 Updated Oct 14, 2024

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 31,025 6,469 Updated Jan 9, 2025

[2024 SPARCS Science Hackathon Top Award(1st)] Web-based XR labs for ALL types of eXperiments, ALLeX

TypeScript 2 Updated Oct 21, 2024

Learn CUDA Programming, published by Packt

Cuda 1,104 248 Updated Dec 30, 2023

SPARCS Science Hackathon 2024 1st Prize

TypeScript 2 2 Updated Oct 21, 2024

The official GitHub page for the survey paper "A Survey on Mixture of Experts in Large Language Models".

244 15 Updated Jan 21, 2025

AI tech blog

6 Updated Jan 21, 2025

CUDATracePreload is a dynamic tracing tool for CUDA and NCCL API calls.

C++ 3 Updated Dec 6, 2023
Next