Skip to content
View Gengar0215's full-sized avatar

Block or report Gengar0215

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Collect every awesome work about r1!

Python 15 Updated Feb 1, 2025

Optimizing inference proxy for LLMs

Python 1,982 156 Updated Jan 31, 2025

RAG 论文学习

40 2 Updated Jan 24, 2025
Python 365 44 Updated Jul 6, 2023

利用HuggingFace的官方下载工具从镜像网站进行高速下载。

Python 937 84 Updated Oct 12, 2024

Reading notes about Multimodal Large Language Models, Large Language Models, and Diffusion Models

239 7 Updated Jan 7, 2025

OpenLLMWiki: Docs of OpenLLMAI. Survey, reproduction and domain/task adaptation of open source chatgpt alternatives/implementations. PiXiu-貔貅 means fortune.

255 15 Updated Dec 10, 2024

✨✨Latest Advances on Multimodal Large Language Models

13,702 880 Updated Jan 28, 2025

Llama3、Llama3.1 中文仓库(随书籍撰写中... 各种网友及厂商微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档)

Python 4,119 340 Updated Sep 16, 2024

This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."

MATLAB 4,490 586 Updated Dec 26, 2024
Python 2,452 286 Updated Jan 31, 2025

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Jupyter Notebook 44,858 4,789 Updated Jan 22, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python 4,178 405 Updated Jan 30, 2025

personal chatgpt

Jupyter Notebook 335 62 Updated Dec 16, 2024

CRUXEval: Code Reasoning, Understanding, and Execution Evaluation

Python 123 14 Updated Oct 11, 2024

机器学习、深度学习、自然语言处理、计算机视觉、各种算法等AI领域相关技术的路线、教程、干货分享。笔记有:机器学习实战、剑指Offer、cs231n、cs131、吴恩达机器学习、cs224n、python自然语言处理实战

Python 565 143 Updated Nov 14, 2020

Scalable toolkit for efficient model alignment

Python 699 86 Updated Jan 31, 2025

👨‍💻 An awesome and curated list of best code-LLM for research.

1,112 64 Updated Dec 10, 2024

[TMLR] A curated list of language modeling researches for code (and other software engineering activities), plus related datasets.

2,024 130 Updated Jan 26, 2025

A library for advanced large language model reasoning

Python 1,696 149 Updated Jan 31, 2025

Tutorial4RL: Tutorial for Reinforcement Learning. 强化学习入门教程.

130 12 Updated Mar 27, 2024

Deep Reinforcement Learning

3,496 599 Updated Dec 10, 2022

本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)

HTML 13,477 1,516 Updated Jan 15, 2025

从预训练到强化学习的中文llama2

Python 85 14 Updated Oct 19, 2023

Inference code for LLaMA models

Python 113 26 Updated Aug 13, 2023

《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程

Jupyter Notebook 11,823 1,342 Updated Jan 25, 2025

自然语言处理学习笔记:机器学习及深度学习原理和示例,基于 Tensorflow 和 PyTorch 框架,Transformer、BERT、ALBERT等最新预训练模型及源代码详解,及基于预训练模型进行各种自然语言处理任务。模型部署

Jupyter Notebook 373 68 Updated Jun 19, 2020
Python 5 Updated Jul 14, 2024
Next