This repository contains papers and other resources for the Research Methods class
- Playing Atari with Deep Reinforcement Learning, Algorithm: DQN.
- Asynchronous Methods for Deep Reinforcement Learning
- Grandmaster level in StarCraft II using multi-agent reinforcement learning
- The Ingredients of Real World Robotic Reinforcement Learning
- Concrete Problems in AI Safety
- Deep Reinforcement Learning that matters
- Benchmarking Reinforcement Learning Algorithms on Real-World Robots
- Dueling Network Architectures for Deep Reinforcement Learning
- DeepMimic: Example-Guided Deep Reinforcement Learning of Physics-Based Character Skills
- Deep reinforcement learning from human preferences
- Strategic Attentive Writer for Learning Macro-Actions
- Prioritized Experience Replay
- Learning to Paint With Model-based Deep Reinforcement Learning
- A Review of Cooperative Multi-Agent Deep Reinforcement Learning
- Machine Learning for Combinatorial Optimization: a Methodological Tour d’Horizon
- Deep Reinforcement Learning for Autonomous Driving: A Survey
- Resource Management with Deep Reinforcement Learning
- Deep Reinforcement Learning with Double Q-learning
- Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm
- Language Models are Few-Shot Learners
- Towards a Human-like Open-Domain ChatbotNeural Combinatorial Optimization with Reinforcement Learning
- Multi-Agent Reinforcement Learning for Dynamic Routing Games: A Unified Paradigm
- Deep Reinforcement Learning for Cyber Security
- Learning Combinatorial Optimization Algorithms over Graphs
- Challenges of Real-World Reinforcement Learning
- Deep reinforcement learning for time series: playing idealized trading games*
- Safe, Multi-Agent, Reinforcement Learning for Autonomous Driving
- Unsupervised Doodling and Painting with Improved SPIRAL
- Dream to Control: Learning Behaviors by Latent Imagination
- Social Influence as Intrinsic Motivation for Multi-Agent Deep Reinforcement Learning
- Proximal Policy Optimization Algorithms
- Deterministic Policy Gradient Algorithms
- QT-Opt: Scalable Deep Reinforcement Learning for Vision-Based Robotic Manipulation
- Solving Rubik's Cube with a Robot Hand
- A Hybrid of Deep Reinforcement Learning and Local Search for the Vehicle Routing Problems
- Continuous control with deep reinforcement learning
- Variational Intrinsic Control
- FeUdal Networks for Hierarchical Reinforcement Learning
- Learning Dexterous In-Hand Manipulation
- The Reactor: A fast and sample-efficient Actor-Critic agent for Reinforcement Learning
- Effective approaches to attention-based neural machine translation
- Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer 10.Neural Approaches to Conversational AI
- ALBERT: A LITE BERT FOR SELF-SUPERVISED LEARNING OF LANGUAGE REPRESENTATIONS
- Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
- Deep Speech 2: End-to-End Speech Recognition in English and Mandarin
- Listen, attend and spell: A neural network for large vocabulary conversational speech recognition
- Restricted Boltzmann Machines for Collaborative Filtering
- Language Models are Few-Shot Learners
- Towards a Human-like Open-Domain Chatbot
- Analysis Methods in Neural Language Processing: A Survey
- Jukebox: A Generative Model for Music
- RoBERTa: A Robustly Optimized BERT Pretraining Approach
- XLNet: Generalized Autoregressive Pretraining for Language Understanding
- StructBERT: Incorporating Language Structures into Pre-training for Deep Language Understanding
- Gated Graph Sequence Neural Networks
- Graph attention networks
- An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
- Pay attention to MLPs
- DeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition
- Visualizing and Understanding Convolutional Networks
- Face Detection with a 3D Model
- Taming Transformers for High-Resolution Image Synthesis
- Deep learning-enabled medical computer vision
- Densely Connected Convolutional Networks
- Computer Vision for Autonomous Vehicles: Problems, Datasets and State of the Art
- Automatically identifying, counting, and describing wild animals in camera-trap images with deep learning
- Spherical CNNs
- Adversarial Examples that Fool both Computer Vision and Time-Limited Humans
- A Closed-form Solution to Photorealistic Image Stylization
- Adam: A Method for Stochastic Optimization
- Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift
- Deep Learning using Rectified Linear Units (ReLU)
- Deep Learning in Label-free Cell Classification
- Generative Pretraining from Pixels
- Deep Learning for Deepfakes Creation and Detection: A Survey
- Very deep convolutional networks for large-scale image recognition
- Anycost GANs for Interactive Image Synthesis and Editing
- Deep-Anomaly: Fully Convolutional Neural Network for Fast Anomaly Detection in Crowded Scenes
- Zero-Shot Text-to-Image Generation
- Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering
- Learning Transferable Visual Models From Natural Language Supervision
- Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks
- Attention is all you need
- Deep Learning for Hate Speech Detection in Tweets
- Deep content-based music recommendation
- Sequence to sequence model 12.Emotion-Cause Pair Extraction: A New Task To Emotion Analysis In Texts 11.A Neural Conversational Model
- Neural machine translation by jointly learning to align and translate
- High Fidelity Speech Synthesis with Adversarial Networks
- WinoGrande: An Adversarial Winograd Schema Challenge at Scale
- BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
- Onsets and Frames: Dual-Objective Piano Transcription
- Reformer: The Efficient Transformer
- Animating Pictures with Eulerian Motion Fields
- Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews
- Convolutional sequence to sequence learning
- ImageNet Classification with Deep Convolutional Neural Networks
- Deep Residual Learning for Image Recognition
- You Only Look Once: Unified, Real-Time Object Detection
- Old Photo Restoration via Deep Latent Space Translation
- Generative Adversarial Networks
- Pose-Controllable Talking Face Generation by Implicitly Modularized Audio-Visual Representation