Skip to content
View yyz666ai's full-sized avatar

Block or report yyz666ai

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

PyTorch深度学习快速入门教程(绝对通俗易懂!)

Python 3,005 659 Updated Feb 9, 2025

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 9,487 1,233 Updated Feb 1, 2025

Fully open reproduction of DeepSeek-R1

Python 18,288 1,534 Updated Feb 10, 2025

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

18,100 1,736 Updated Sep 19, 2024

LLMs-from-scratch项目中文翻译

Jupyter Notebook 202 37 Updated Feb 8, 2025

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 2,483 147 Updated Feb 8, 2025

An logseq to anki syncing plugin with superpowers - image occlusion, card direction, incremental cards, and a lot more.

TypeScript 486 34 Updated Feb 9, 2025

《剑指 Offer》 Python, Java, C++ 解题代码,LeetBook《图解算法数据结构》配套代码仓

Java 6,916 802 Updated May 11, 2024

《Hello 算法》:动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新,English version ongoing

Java 108,763 13,544 Updated Feb 10, 2025

An annotated implementation of the Transformer paper.

Jupyter Notebook 5,974 1,270 Updated Apr 7, 2024

mindspore added to ray-rllib

2 Updated Dec 20, 2024

🎼 一款结构化的 Markdown 引擎,支持 Go 和 JavaScript。A structured Markdown engine that supports Go and JavaScript.

Go 1,388 139 Updated Feb 3, 2025

A privacy-first, self-hosted, fully open source personal knowledge management software, written in typescript and golang.

TypeScript 28,750 1,862 Updated Feb 10, 2025
Jupyter Notebook 40 15 Updated Feb 7, 2025

仅需Python基础,从0构建大语言模型;从0逐步构建GLM4\Llama3\RWKV6, 深入理解大模型原理

Jupyter Notebook 2,225 318 Updated Aug 15, 2024
Jupyter Notebook 314 85 Updated Apr 29, 2024

The official Meta Llama 3 GitHub site

Python 28,245 3,265 Updated Jan 26, 2025

https://hnlp.boyuai.com

Jupyter Notebook 55 9 Updated Oct 4, 2024

Lightweight version of MAPPO to help you quickly migrate to your local environment.

Python 571 86 Updated Jan 4, 2025

Notebooks for the O'Reilly book "Learning Ray"

Jupyter Notebook 274 69 Updated Apr 25, 2024

A Pytorch implementation of the multi agent deep deterministic policy gradients (MADDPG) algorithm

Python 319 74 Updated Apr 8, 2021
Python 91 28 Updated Nov 13, 2020

Tutorial for Ray

Python 17 3 Updated Mar 31, 2024

This is suite of the hands-on training materials that shows how to scale CV, NLP, time-series forecasting workloads with Ray.

Jupyter Notebook 369 69 Updated Feb 13, 2024

PyTorch implementations of popular off-policy multi-agent reinforcement learning algorithms, including QMix, VDN, MADDPG, and MATD3.

Python 445 73 Updated Jul 21, 2023

This is the official implementation of Multi-Agent PPO (MAPPO).

Python 1,416 314 Updated Jul 18, 2024

A high-performance, scalable MindSpore reinforcement learning framework.

Python 44 8 Updated Jul 1, 2024
Next