Skip to content
View yangminsi's full-sized avatar

Block or report yangminsi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
22 results for source starred repositories
Clear filter

[ICML 2020] On the Noisy Gradient Descent that Generalizes as SGD

Python 4 2 Updated Jun 27, 2020

PyTorch Implementation of Momentum-Based Policy Gradient Methods

Python 8 2 Updated Aug 12, 2020

Fault-tolerant, highly scalable GPU orchestration, and a machine learning framework designed for training models with billions to trillions of parameters

Jupyter Notebook 3,321 557 Updated May 25, 2024

这是一个vue仓库

JavaScript 307 143 Updated Oct 14, 2017

微信中的知乎--微信小程序 demo // Zhihu in Wechat

JavaScript 1,997 633 Updated Oct 27, 2022

Promise based HTTP client for the browser and node.js

JavaScript 106,483 11,029 Updated Mar 11, 2025

PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

Python 4,154 872 Updated Mar 24, 2023

Code for the paper "Curiosity-driven Exploration in Deep Reinforcement Learning via Bayesian Neural Networks"

Python 342 92 Updated Nov 22, 2018

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Python 16,100 4,892 Updated Aug 1, 2024

📚 C/C++ 技术面试基础知识总结,包括语言、程序库、数据结构、算法、系统、网络、链接装载库等知识及面试经验、招聘、内推等信息。This repository is a summary of the basic knowledge of recruiting job seekers and beginners in the direction of C/C++ technology, in…

C++ 35,620 8,031 Updated Mar 19, 2024

📚 技术面试必备基础知识、Leetcode、计算机操作系统、计算机网络、系统设计

179,290 51,206 Updated Aug 21, 2024

java项目实战练习

Java 3,344 1,479 Updated Aug 8, 2024

Reinforcement learning with the implementation of the emphatic TD of Sutton & al. (2015)

Python 1 Updated Jan 29, 2019

Experiment code for our project on actor-critic algorithms with emphatic weightings.

Jupyter Notebook 8 Updated Jul 6, 2023

Reinforcement learning resources curated

9,000 1,829 Updated May 25, 2023

莫烦Python Website source code

HTML 580 217 Updated Sep 21, 2020

Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.

Jupyter Notebook 21,030 6,085 Updated Jul 13, 2023

TensorFlow Tutorial and Examples for Beginners (support TF v1 & v2)

Jupyter Notebook 43,546 14,910 Updated Jul 26, 2024

牛客网刷题记录

Jupyter Notebook 1 1 Updated Mar 29, 2019

CS231课程笔记翻译 https://zhuanlan.zhihu.com/intelligentunit

497 135 Updated Apr 3, 2017
Python 2 Updated Jul 26, 2020

deep-learning-practice

Python 4 Updated Sep 15, 2019