-
Gaoling School of Artificial Intelligence, Renmin University of China
Highlights
- Pro
-
KuaiComt.github.io Public
A Joint Video and Comment Recommendation Dataset
CSS Other UpdatedJan 24, 2025 -
-
LAST Public
Do Not Wait: Learning Re-Ranking Model Without User Feedback At Serving Time in E-Commerce (RecSys 2024)
-
UOEP Public
Reinforcing Long-Term Performance in Recommender Systems with User-Oriented Exploration Policy (SIGIR 2024)
-
Controllable Multi-Objective Re-ranking with Policy Hypernetworks (SIGKDD 2023)
-
ruc_gsai_rl Public
这是中国人民大学高瓴人工智能学院本科课程《强化学习》的期末项目安排,项目内容是训练一个适用于国标麻将的强化学习智能体。
-
-
The source code of ``Counteracting Duration Bias in Video Recommendation via Counterfactual Watch Time''. In KDD'24
Python UpdatedMay 24, 2024 -
EasyRL4Rec Public
Forked from chongminggao/EasyRL4RecJupyter Notebook MIT License UpdatedMar 19, 2024 -
WSDM2022-PTUPCDR Public
Forked from easezyc/WSDM2022-PTUPCDRThis is the official implementation of our paper Personalized Transfer of User Preferences for Cross-domain Recommendation (PTUPCDR), which has been accepted by WSDM2022.
Python UpdatedJul 17, 2023 -
FCPO Public
Forked from TobyGE/FCPOcode for ‘Towards Long-term Fairness in Recommendation’
Python GNU General Public License v3.0 UpdatedApr 23, 2023 -
LOMPO Public
Forked from rmrafailov/LOMPOOfficial Codebase for Offline Reinforcement Learning from Images with Latent Space Models
Python UpdatedMay 15, 2022 -
pareto-hypernetworks Public
Forked from AvivNavon/pareto-hypernetworksOfficial implementation of Learning The Pareto Front With HyperNetworks [ICLR 2021]
-
-
IHGNN Public
Forked from CDboyOne/IHGNNPyTorch implementation of Interactive Hypergraph Neural Network for Personalized Product Search (IHGNN)
Python MIT License UpdatedFeb 27, 2022 -
-
LibRerank Public
Forked from LibRerank-Community/LibRerankLibRerank is a toolkit for re-ranking algorithms. There are a number of re-ranking algorithms, such as PRM, DLCM, GSF, miDNN, SetRank, EGRerank, Seq2Slate.
Python MIT License UpdatedFeb 21, 2022 -
postgraduate-recommendation Public
Forked from NJU-SE-15-share-review/postgraduate-recommendation南大软院保研攻略
Python UpdatedFeb 19, 2022 -
DouZero Public
Forked from kwai/DouZero[ICML 2021] DouZero: Mastering DouDizhu with Self-Play Deep Reinforcement Learning | 斗地主AI
-
rlcard Public
Forked from datamllab/rlcardReinforcement Learning / AI Bots in Card (Poker) Games - Blackjack, Leduc, Texas, DouDizhu, Mahjong, UNO.
Python MIT License UpdatedDec 16, 2021 -
alpha-zero-general Public
Forked from suragnair/alpha-zero-generalA clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more
Jupyter Notebook MIT License UpdatedDec 5, 2021 -
attention-is-all-you-need-pytorch Public
Forked from jadore801120/attention-is-all-you-need-pytorchA PyTorch implementation of the Transformer model in "Attention is All You Need".
Python MIT License UpdatedNov 23, 2021 -
rlkit Public
Forked from rail-berkeley/rlkitCollection of reinforcement learning algorithms
Python MIT License UpdatedOct 30, 2021 -
King-of-Pigeon Public
Forked from yuezih/King-of-Pigeon欢迎 follow me 或 star this repository ,有机会将会更新其它有趣的模板。
Python MIT License UpdatedOct 27, 2021 -
-
ocr.pytorch Public
Forked from courao/ocr.pytorchA pure pytorch implemented ocr project including text detection and recognition
Python MIT License UpdatedSep 3, 2021 -
multiagent-particle-envs Public
Forked from openai/multiagent-particle-envsCode for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
Python MIT License UpdatedAug 30, 2021 -
a-PyTorch-Tutorial-to-Image-Captioning Public
Forked from sgrvinod/a-PyTorch-Tutorial-to-Image-CaptioningShow, Attend, and Tell | a PyTorch Tutorial to Image Captioning
Python MIT License UpdatedJul 26, 2021 -
StarCraft Public
Forked from starry-sky6688/MARL-AlgorithmsImplementations of IQL, QMIX, VDN, COMA, QTRAN, MAVEN, CommNet, DyMA-CL, and G2ANet on SMAC, the decentralised micromanagement scenario of StarCraft II
Python UpdatedJun 15, 2021 -
maddpg Public
Forked from openai/maddpgCode for the MADDPG algorithm from the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
Python MIT License UpdatedMay 31, 2021