yyz666ai

yyz666 yyz666ai

2 followers · 3 following

Achievements

Lists (3)

Sort

Stars

xiaotudui / pytorch-tutorial

PyTorch深度学习快速入门教程（绝对通俗易懂！）

Python 3,005 659 Updated Feb 9, 2025

Jiayi-Pan / TinyZero

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 9,487 1,233 Updated Feb 1, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 18,288 1,534 Updated Feb 10, 2025

deepseek-ai / DeepSeek-R1

70,914 9,133 Updated Feb 8, 2025

HqWu-HITCS / Awesome-Chinese-LLM

整理开源的中文大语言模型，以规模较小、可私有化部署、训练成本较低的模型为主，包括底座模型，垂直领域微调及应用，数据集与教程等。

18,100 1,736 Updated Sep 19, 2024

MLNLP-World / LLMs-from-scratch-CN

LLMs-from-scratch项目中文翻译

Jupyter Notebook 202 37 Updated Feb 8, 2025

kvcache-ai / Mooncake

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 2,483 147 Updated Feb 8, 2025

debanjandhar12 / logseq-anki-sync

An logseq to anki syncing plugin with superpowers - image occlusion, card direction, incremental cards, and a lot more.

TypeScript 486 34 Updated Feb 9, 2025

krahets / LeetCode-Book

《剑指 Offer》 Python, Java, C++ 解题代码，LeetBook《图解算法数据结构》配套代码仓

Java 6,916 802 Updated May 11, 2024

ZJU-LLMs / Foundations-of-LLMs

2,993 304 Updated Jan 14, 2025

deepseek-ai / DeepSeek-V3

Python 81,956 13,030 Updated Feb 8, 2025

krahets / hello-algo

《Hello 算法》：动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新，English version ongoing

Java 108,763 13,544 Updated Feb 10, 2025

harvardnlp / annotated-transformer

An annotated implementation of the Transformer paper.

Jupyter Notebook 5,974 1,270 Updated Apr 7, 2024

Po11uxx / ray-rllib-mindspore

mindspore added to ray-rllib

2 Updated Dec 20, 2024

88250 / lute

🎼 一款结构化的 Markdown 引擎，支持 Go 和 JavaScript。A structured Markdown engine that supports Go and JavaScript.

Go 1,388 139 Updated Feb 3, 2025

siyuan-note / siyuan

A privacy-first, self-hosted, fully open source personal knowledge management software, written in typescript and golang.

TypeScript 28,750 1,862 Updated Feb 10, 2025

AlibabaCloudDocs / aliyun_acp_learning

Jupyter Notebook 40 15 Updated Feb 7, 2025

datawhalechina / llms-from-scratch-cn

仅需Python基础，从0构建大语言模型；从0逐步构建GLM4\Llama3\RWKV6，深入理解大模型原理

Jupyter Notebook 2,225 318 Updated Aug 15, 2024

waylandzhang / Transformer-from-scratch

Jupyter Notebook 314 85 Updated Apr 29, 2024

meta-llama / llama3

The official Meta Llama 3 GitHub site

Python 28,245 3,265 Updated Jan 26, 2025

boyu-ai / Hands-on-NLP

https://hnlp.boyuai.com

Jupyter Notebook 55 9 Updated Oct 4, 2024

tinyzqh / light_mappo

Lightweight version of MAPPO to help you quickly migrate to your local environment.

Python 571 86 Updated Jan 4, 2025

maxpumperla / learning_ray

Notebooks for the O'Reilly book "Learning Ray"

Jupyter Notebook 274 69 Updated Apr 25, 2024

philtabor / Multi-Agent-Deep-Deterministic-Policy-Gradients

A Pytorch implementation of the multi agent deep deterministic policy gradients (MADDPG) algorithm

Python 319 74 Updated Apr 8, 2021

wjh720 / QPLEX

Python 91 28 Updated Nov 13, 2020

OpenRL-Lab / Ray_Tutorial

Tutorial for Ray

Python 17 3 Updated Mar 31, 2024

ray-project / ray-educational-materials

This is suite of the hands-on training materials that shows how to scale CV, NLP, time-series forecasting workloads with Ray.

Jupyter Notebook 369 69 Updated Feb 13, 2024

marlbenchmark / off-policy

PyTorch implementations of popular off-policy multi-agent reinforcement learning algorithms, including QMix, VDN, MADDPG, and MATD3.

Python 445 73 Updated Jul 21, 2023

marlbenchmark / on-policy

This is the official implementation of Multi-Agent PPO (MAPPO).

Python 1,416 314 Updated Jul 18, 2024

mindspore-lab / mindrl

A high-performance, scalable MindSpore reinforcement learning framework.

Python 44 8 Updated Jul 1, 2024

yyz666 yyz666ai

Lists (3)

AI_Learning

paper

python package

Stars