Skip to content
View ZhouJiangbing's full-sized avatar

Block or report ZhouJiangbing

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Jupyter Notebook 3,607 1,065 Updated Jul 9, 2024

Official code repo for the O'Reilly Book - "Hands-On Large Language Models"

Jupyter Notebook 5,584 1,226 Updated Feb 15, 2025

XuanCe: A Comprehensive and Unified Deep Reinforcement Learning Library

Python 776 120 Updated Mar 6, 2025

Reproduce R1 Zero on Logic Puzzle

Python 2,100 136 Updated Mar 3, 2025

This repository contains a collection of resources and papers on Diffusion Models for RL, accompanying the paper "Diffusion Models for Reinforcement Learning: A Survey"

524 24 Updated Nov 29, 2024

LLMs interview notes and answers:该仓库主要记录大模型(LLMs)算法工程师相关的面试题和参考答案

60 19 Updated Oct 16, 2023

A comprehensive list of PAPERS, CODEBASES, and, DATASETS on Decision Making using Foundation Models including LLMs and VLMs.

360 20 Updated Apr 24, 2024

Official Repository for SIGIR2024 Demo Paper "An Integrated Data Processing Framework for Pretraining Foundation Models"

Python 72 11 Updated Aug 27, 2024

Tutorial4RL: Tutorial for Reinforcement Learning. 强化学习入门教程.

139 12 Updated Mar 27, 2024

Monitoring recent cross-research on LLM & RL on arXiv for control. If there are good papers, PRs are welcome.

328 17 Updated Sep 12, 2024

A collection of LLM with RL papers

264 10 Updated Apr 24, 2024

PPO x Family DRL Tutorial Course(决策智能入门级公开课:8节课帮你盘清算法理论,理顺代码逻辑,玩转决策AI应用实践 )

Python 2,132 183 Updated May 15, 2024

FinGLM: 致力于构建一个开放的、公益的、持久的金融大模型项目,利用开源开放来促进「AI+金融」。

HTML 1,903 283 Updated May 8, 2024

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

Python 3,898 216 Updated Mar 12, 2025

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

18,837 1,816 Updated Sep 19, 2024

HuggingLLM, Hugging Future.

Jupyter Notebook 2,935 380 Updated Mar 8, 2025

本项目是一个面向小白开发者的大模型应用开发教程,在线阅读地址:https://datawhalechina.github.io/llm-universe/

Jupyter Notebook 7,037 805 Updated Mar 12, 2025

从0到1动手学习大模型技术

42 2 Updated Mar 25, 2024

《动手学大模型Dive into LLMs》系列编程实践教程

4,564 416 Updated Sep 20, 2024

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 18,745 1,891 Updated Apr 30, 2024

Diffusion Models, Recommender Systems, Recommendation, Diff4Rec

43 7 Updated Feb 18, 2025

Awesome-LLM: a curated list of Large Language Model

22,031 1,803 Updated Mar 4, 2025

大语言模型相关开源项目汇总

7 Updated Jan 25, 2025

Survey: A collection of AWESOME papers and resources on the large language model (LLM) related recommender system topics.

1,217 68 Updated Mar 5, 2025

🧑‍🚀 全世界最好的LLM资料总结(数据处理、模型训练、模型部署、o1 模型、小语言模型、视觉语言模型) | Summary of the world's best LLM resources.

4,048 424 Updated Mar 12, 2025

《大语言模型》作者:赵鑫,李军毅,周昆,唐天一,文继荣

CSS 3,153 211 Updated Mar 5, 2025

AuctionNet: A Novel Benchmark for Decision-Making in Large-Scale Games

Python 131 8 Updated Dec 20, 2024
Next