Skip to content
View yangminsi's full-sized avatar

Block or report yangminsi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[ICML 2020] On the Noisy Gradient Descent that Generalizes as SGD

Python 4 2 Updated Jun 27, 2020

PyTorch Implementation of Momentum-Based Policy Gradient Methods

Python 8 2 Updated Aug 12, 2020

Fault-tolerant, highly scalable GPU orchestration, and a machine learning framework designed for training models with billions to trillions of parameters

Jupyter Notebook 3,302 554 Updated May 25, 2024

这是一个vue仓库

JavaScript 307 144 Updated Oct 14, 2017

微信中的知乎--微信小程序 demo // Zhihu in Wechat

JavaScript 1,981 635 Updated Oct 27, 2022

Promise based HTTP client for the browser and node.js

JavaScript 106,005 10,983 Updated Dec 18, 2024

PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

Python 4,032 858 Updated Mar 24, 2023

Code for the paper "Curiosity-driven Exploration in Deep Reinforcement Learning via Bayesian Neural Networks"

Python 343 91 Updated Nov 22, 2018

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Python 15,903 4,879 Updated Aug 1, 2024

A toolkit for developing and comparing reinforcement learning algorithms.

Python 1 Updated Sep 17, 2019

📚 C/C++ 技术面试基础知识总结,包括语言、程序库、数据结构、算法、系统、网络、链接装载库等知识及面试经验、招聘、内推等信息。This repository is a summary of the basic knowledge of recruiting job seekers and beginners in the direction of C/C++ technology, in…

C++ 35,086 7,996 Updated Mar 19, 2024

📚 技术面试必备基础知识、Leetcode、计算机操作系统、计算机网络、系统设计

178,176 51,169 Updated Aug 21, 2024

🏦 银行笔试面试经验分享及资料分享(help you pass the bank interview, and get a amazing bank offer!)

1 Updated May 7, 2020

source code for the paper "Policy Search by Target Distribution Learning for Continuous Control"

Python 1 Updated Feb 3, 2020

java项目实战练习

Java 3,265 1,469 Updated Aug 8, 2024

Reinforcement learning with the implementation of the emphatic TD of Sutton & al. (2015)

Python 1 Updated Jan 29, 2019

Experiment code for our project on actor-critic algorithms with emphatic weightings.

Jupyter Notebook 8 Updated Jul 6, 2023

The most cited deep learning papers

TeX 1 Updated Feb 24, 2019

Reinforcement learning resources curated

8,888 1,832 Updated May 25, 2023

 Now we have become very big, Different from the original idea. Collect premium software in various categories.

JavaScript 1 Updated Jan 23, 2019

pytorch handbook是一本开源的书籍,目标是帮助那些希望和使用PyTorch进行深度学习开发和研究的朋友快速入门,其中包含的Pytorch教程全部通过测试保证可以成功运行

Jupyter Notebook 1 Updated Jan 24, 2019

莫烦Python Website source code

HTML 577 217 Updated Sep 21, 2020

Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.

Jupyter Notebook 20,758 6,054 Updated Jul 13, 2023

TensorFlow Tutorial and Examples for Beginners (support TF v1 & v2)

Jupyter Notebook 43,456 14,931 Updated Jul 26, 2024

牛客网刷题记录

Jupyter Notebook 1 1 Updated Mar 29, 2019

CS231课程笔记翻译 https://zhuanlan.zhihu.com/intelligentunit

481 134 Updated Apr 3, 2017
Python 2 Updated Jul 26, 2020

CNN-RNN中文文本分类,基于tensorflow

Python 2 Updated Aug 27, 2018

BiLstm+CNN+CRF 法律文档(合同类案件)领域分词(100篇标注样本)

Python 2 Updated Aug 27, 2018

deep-learning-practice

Python 4 Updated Sep 15, 2019