Projects
Reading
People
Chat
SU\G
(đž)
/K·U
Projects
Reading
People
Chat
Sign Up
Light
Dark
System
Mehdi Mirza
Follow
Share
Generating author description...
All published works
Action
Title
Year
Authors
+
PDF
Chat
Generative Adversarial Networks
2022
Ian J. Goodfellow
Jean Pouget-Abadie
Mehdi Mirza
Bing Xu
David Warde-Farley
Sherjil Ozair
Aaron Courville
Yoshua Bengio
+
Evaluating model-based planning and planner amortization for continuous control
2021
Arunkumar Byravan
Leonard Hasenclever
Piotr Trochim
Mehdi Mirza
Alessandro Davide Ialongo
Yuval Tassa
Jost Tobias Springenberg
Abbas Abdolmaleki
Nicolas Heess
Josh Merel
+
PDF
Chat
Generative adversarial networks
2020
Ian Goodfellow
Jean Pouget-Abadie
Mehdi Mirza
Bing Xu
David Warde-Farley
Sherjil Ozair
Aaron Courville
Yoshua Bengio
+
Beyond Tabula-Rasa: a Modular Reinforcement Learning Approach for Physically Embedded 3D Sokoban.
2020
PĂ©ter Karkus
Mehdi Mirza
Arthur Guez
Andrew Jaegle
Timothy Lillicrap
Lars Buesing
Nicolas Heess
Théophane Weber
+
Physically Embedded Planning Problems: New Challenges for Reinforcement Learning
2020
Mehdi Mirza
Andrew Jaegle
Jonathan J. Hunt
Arthur Guez
Saran Tunyasuvunakool
Alistair Muldal
Théophane Weber
PĂ©ter Karkus
SĂ©bastien RacaniĂšre
Lars Buesing
+
PDF
Chat
Optimizing agent behavior over long time scales by transporting value
2019
Chia-Chun Hung
Timothy Lillicrap
Josh Abramson
Yan Wu
Mehdi Mirza
Federico Carnevale
Arun Ahuja
Greg Wayne
+
An investigation of model-free planning
2019
Arthur Guez
Mehdi Mirza
Karol Gregor
Rishabh Kabra
SĂ©bastien RacaniĂšre
Théophane Weber
David Raposo
Adam Santoro
Laurent Orseau
Tom Eccles
+
Optimizing Agent Behavior over Long Time Scales by Transporting Value
2018
Chia-Chun Hung
Timothy Lillicrap
Josh Abramson
Yan Wu
Mehdi Mirza
Federico Carnevale
Arun Ahuja
Greg Wayne
+
Unsupervised Predictive Memory in a Goal-Directed Agent
2018
Greg Wayne
Chia-Chun Hung
Amos David
Mehdi Mirza
Arun Ahuja
Agnieszka GrabskaâBarwiĆska
Jack W. Rae
Piotr Mirowski
Joel Z. Leibo
Adam Santoro
+
Probing Physics Knowledge Using Tools from Developmental Psychology
2018
Luis Piloto
Ari Weinstein
Dhruva Tb
Arun Ahuja
Mehdi Mirza
Greg Wayne
Amos David
Chia-Chun Hung
Matthew Botvinick
+
Asynchronous Methods for Deep Reinforcement Learning
2016
Volodymyr Mnih
AdriĂ PuigdomĂšnech Badia
Mehdi Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
+
Theano: A Python framework for fast computation of mathematical expressions
2016
The Theano Development Team
Rami AlâRfou
Guillaume Alain
Amjad Almahairi
Christof Angermueller
Dzmitry Bahdanau
Nicolas Ballas
Frédéric Bastien
Justin Bayer
Anatoly Belikov
+
Generalizable Features From Unsupervised Learning
2016
Mehdi Mirza
Aaron Courville
Yoshua Bengio
+
Asynchronous Methods for Deep Reinforcement Learning
2016
Volodymyr Mnih
AdriĂ PuigdomĂšnech Badia
Mehdi Mirza
Alex Graves
Tim Harley
Timothy Lillicrap
David Silver
Koray Kavukcuoglu
+
PDF
Chat
EmoNets: Multimodal deep learning approaches for emotion recognition in video
2015
Samira Ebrahimi Kahou
Xavier Bouthillier
Pascal Lamblin
Ăaǧlar GĂŒlçehre
Vincent Michalski
Kishore Konda
SĂ©bastien Jean
Pierre Froumenty
Yann Dauphin
Nicolas Boulanger-Lewandowski
+
EmoNets: Multimodal deep learning approaches for emotion recognition in video
2015
Samira Ebrahimi Kahou
Xavier Bouthillier
Pascal Lamblin
Ăaǧlar GĂŒlçehre
Vincent Michalski
Kishore Konda
SĂ©bastien Jean
Pierre Froumenty
Yann Dauphin
Nicolas Boulanger-Lewandowski
+
PDF
Chat
Challenges in representation learning: A report on three machine learning contests
2014
Ian Goodfellow
Dumitru Erhan
Pierre Carrier
Aaron Courville
Mehdi Mirza
Ben Hamner
Will Cukierski
Yichuan Tang
David S. Thaler
DongâHyun Lee
+
Conditional Generative Adversarial Nets
2014
Mehdi Mirza
Simon Osindero
+
An Empirical Investigation of Catastrophic Forgetting in Gradient-Based Neural Networks
2014
Ian Goodfellow
Mehdi Mirza
Xiao Da
Aaron Courville
Yoshua Bengio
+
Pylearn2: a machine learning research library
2013
Ian Goodfellow
David Warde-Farley
Pascal Lamblin
Vincent Dumoulin
Mehdi Mirza
Razvan Pascanu
James Bergstra
Frédéric Bastien
Yoshua Bengio
+
Challenges in Representation Learning: A report on three machine learning contests
2013
Ian Goodfellow
Dumitru Erhan
Pierre Carrier
Aaron Courville
Mehdi Mirza
Ben Hamner
Will Cukierski
Yichuan Tang
David S. Thaler
DongâHyun Lee
+
PDF
Chat
Challenges in Representation Learning: A Report on Three Machine Learning Contests
2013
Ian Goodfellow
Dumitru Erhan
Pierre Carrier
Aaron Courville
Mehdi Mirza
Ben Hamner
Will Cukierski
Yichuan Tang
David S. Thaler
DongâHyun Lee
+
An Empirical Investigation of Catastrophic Forgetting in Gradient-Based Neural Networks
2013
Ian Goodfellow
Mehdi Mirza
Xiao Da
Aaron Courville
Yoshua Bengio
+
Maxout Networks
2013
Ian Goodfellow
David Warde-Farley
Mehdi Mirza
Aaron Courville
Yoshua Bengio
Common Coauthors
Coauthor
Papers Together
Yoshua Bengio
13
Aaron Courville
12
Ian Goodfellow
9
Timothy Lillicrap
8
David Warde-Farley
7
Greg Wayne
5
James Bergstra
5
Pascal Lamblin
4
David Silver
4
Arun Ahuja
4
Dumitru Erhan
4
Pascal Vincent
3
Jingjing Xie
3
Christopher Pal
3
Nicolas Heess
3
Yann N. Dauphin
3
SĂ©bastien Jean
3
Radu Tudor Ionescu
3
Chetan Ramaiah
3
Xiaojie Wang
3
Will Cukierski
3
Vincent Michalski
3
Xavier Bouthillier
3
Tim Harley
3
Arthur Guez
3
Théophane Weber
3
Bing Xu
3
Ruifan Li
3
Ćukasz Romaszko
3
DongâHyun Lee
3
Marius Popescu
3
Josh Abramson
3
Samira Ebrahimi Kahou
3
Cristian Grozea
3
Dimitris Athanasakis
3
Roland Memisevic
3
Yichuan Tang
3
Nicolas Boulanger-Lewandowski
3
David S. Thaler
3
Yingbo Zhou
3
Fangxiang Feng
3
Ăaǧlar GĂŒlçehre
3
Ben Hamner
3
Koray Kavukcuoglu
3
Pierre Carrier
3
John ShaweâTaylor
3
Maxim Milakov
3
Chia-Chun Hung
3
Frédéric Bastien
2
SĂ©bastien RacaniĂšre
2
Commonly Cited References
Action
Title
Year
Authors
# of times referenced
+
Improving neural networks by preventing co-adaptation of feature detectors
2012
Geoffrey E. Hinton
Nitish Srivastava
Alex Krizhevsky
Ilya Sutskever
Ruslan Salakhutdinov
7
+
Pylearn2: a machine learning research library
2013
Ian Goodfellow
David Warde-Farley
Pascal Lamblin
Vincent Dumoulin
Mehdi Mirza
Razvan Pascanu
James Bergstra
Frédéric Bastien
Yoshua Bengio
6
+
PDF
Chat
Deep Residual Learning for Image Recognition
2016
Kaiming He
Xiangyu Zhang
Shaoqing Ren
Jian Sun
5
+
High-Dimensional Continuous Control Using Generalized Advantage Estimation
2015
John Schulman
Philipp Moritz
Sergey Levine
Michael I. Jordan
Pieter Abbeel
4
+
Theano: new features and speed improvements
2012
Frédéric Bastien
Pascal Lamblin
Razvan Pascanu
James Bergstra
Ian J. Goodfellow
Arnaud Bergeron
Nicolas Bouchard
David Warde-Farley
Yoshua Bengio
4
+
Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift
2015
Sergey Ioffe
Christian Szegedy
4
+
PDF
Chat
Speech recognition with deep recurrent neural networks
2013
Alex Graves
Abdelrahman Mohamed
Geoffrey E. Hinton
4
+
Adam: A Method for Stochastic Optimization
2014
Diederik P. Kingma
Jimmy Ba
3
+
A guide to convolution arithmetic for deep learning
2016
Vincent Dumoulin
Francesco Visin
3
+
Constructing Hierarchical Image-tags Bimodal Representations for Word Tags Alternative Choice
2013
Fangxiang Feng
Ruifan Li
Xiaojie Wang
3
+
Emergence of Locomotion Behaviours in Rich Environments
2017
Nicolas Heess
Dhruva Tb
Sriram Srinivasan
Jay Lemmon
Josh Merel
Greg Wayne
Yuval Tassa
Tom Erez
Ziyu Wang
S. M. Ali Eslami
3
+
Deep Learning using Linear Support Vector Machines
2013
Yichuan Tang
3
+
Deep Generative Stochastic Networks Trainable by Backprop
2013
Yoshua Bengio
Ăric Thibodeau-Laufer
Guillaume Alain
Jason Yosinski
3
+
Asynchronous Methods for Deep Reinforcement Learning
2016
Volodymyr Mnih
AdriĂ PuigdomĂšnech Badia
Mehdi Mirza
Alex Graves
Tim Harley
Timothy Lillicrap
David Silver
Koray Kavukcuoglu
3
+
Imagination-Augmented Agents for Deep Reinforcement Learning
2017
SĂ©bastien RacaniĂšre
Théophane Weber
David Reichert
Lars Buesing
Arthur Guez
Danilo Jimenez Rezende
AdriĂ PuigdomĂšnech Badia
Oriol Vinyals
Nicolas Heess
Yujia Li
3
+
Unsupervised Predictive Memory in a Goal-Directed Agent
2018
Greg Wayne
Chia-Chun Hung
Amos David
Mehdi Mirza
Arun Ahuja
Agnieszka GrabskaâBarwiĆska
Jack W. Rae
Piotr Mirowski
Joel Z. Leibo
Adam Santoro
3
+
PDF
Chat
The Arcade Learning Environment: An Evaluation Platform for General Agents
2013
Marc G. Bellemare
Yavar Naddaf
Joel Veness
Michael Bowling
3
+
Distributed Deep Q-Learning
2015
Hao Yi Ong
Kevin Chavez
Augustus Hong
2
+
Exploring Model-based Planning with Policy Networks
2020
Tingwu Wang
Jimmy Ba
2
+
Better Mixing via Deep Representations
2012
Yoshua Bengio
Grégoire Mesnil
Yann Dauphin
Salah Rifai
2
+
IKEA Furniture Assembly Environment for Long-Horizon Complex Manipulation Tasks
2019
Youngwoon Lee
Edward S. Hu
Zhengyu Yang
Alex Yin
Joseph J. Lim
2
+
PDF
Chat
dm_control: Software and tasks for continuous control
2020
Saran Tunyasuvunakool
Alistair Muldal
Yotam Doron
Siqi Liu
Steven Bohez
Josh Merel
Tom Erez
Timothy Lillicrap
Nicolas Heess
Yuval Tassa
2
+
PDF
Chat
Mastering Atari, Go, chess and shogi by planning with a learned model
2020
Julian Schrittwieser
Ioannis Antonoglou
Thomas Hubert
Karen Simonyan
Laurent Sifre
Simon Schmitt
Arthur Guez
Edward Lockhart
Demis Hassabis
Thore Graepel
2
+
Relay Policy Learning: Solving Long-Horizon Tasks via Imitation and Reinforcement Learning
2019
Abhishek Gupta
Vikash Kumar
Corey Lynch
Sergey Levine
Karol Hausman
2
+
Value Prediction Network
2017
Junhyuk Oh
Satinder Singh
Honglak Lee
2
+
PDF
Chat
Infinite-Horizon Policy-Gradient Estimation
2001
J. Baxter
Peter L. Bartlett
2
+
Convolutional LSTM Network: A Machine Learning Approach for Precipitation Nowcasting
2015
Xingjian Shi
Zhourong Chen
Hao Wang
DitâYan Yeung
Wai Kin Wong
Wangâchun Woo
2
+
PDF
Chat
The Option-Critic Architecture
2017
PierreâLuc Bacon
Jean Harb
Doina Precup
2
+
PDF
Chat
ModDrop: Adaptive Multi-Modal Gesture Recognition
2015
Natalia Neverova
Christian Wolf
Graham W. Taylor
Florian Nebout
2
+
Hierarchical Visuomotor Control of Humanoids.
2018
Josh Merel
Arun Ahuja
Vu Pham
Saran Tunyasuvunakool
Siqi Liu
Dhruva Tirumala
Nicolas Heess
Greg Wayne
2
+
HOGWILD!: A Lock-Free Approach to Parallelizing Stochastic Gradient Descent
2011
Feng Niu
Benjamin Recht
Christopher RĂ©
Stephen J. Wright
2
+
Representation Learning: A Review and New Perspectives
2012
Yoshua Bengio
Aaron Courville
Pascal Vincent
2
+
Reverse Curriculum Generation for Reinforcement Learning
2017
Carlos Florensa
David Held
Markus Wulfmeier
Michael Zhang
Pieter Abbeel
2
+
On the convergence of markovian stochastic algorithms with rapidly decreasing ergodicity rates
1999
Laurent Younes
2
+
Reinforcement Learning with Unsupervised Auxiliary Tasks
2016
Max Jaderberg
Volodymyr Mnih
Wojciech Marian Czarnecki
Tom Schaul
Joel Z. Leibo
David Silver
Koray Kavukcuoglu
2
+
Large-Scale Feature Learning With Spike-and-Slab Sparse Coding
2012
Ian Goodfellow
Aaron Courville
Yoshua Bengio
2
+
Massively Parallel Methods for Deep Reinforcement Learning
2015
Arun Sukumaran Nair
P. Srinivasan
Sam Blackwell
Cagdas Alcicek
Rory Fearon
Alessandro De Maria
Vedavyas Panneershelvam
Mustafa Suleyman
Charles Beattie
Stig Petersen
2
+
Trust Region Policy Optimization
2015
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
2
+
Data-efficient Deep Reinforcement Learning for Dexterous Manipulation
2017
Ivaylo Popov
Nicolas Heess
Timothy Lillicrap
Roland Hafner
Gabriel Barth-Maron
Matej VecerĂk
Thomas Lampe
Yuval Tassa
Tom Erez
Martin Riedmiller
2
+
Been There, Done That: Meta-Learning with Episodic Recall
2018
Samuel Ritter
Jane X. Wang
Zeb KurthâNelson
Siddhant M. Jayakumar
Charles D. Blundell
Razvan Pascanu
Matthew Botvinick
2
+
PDF
Chat
Increasing the Action Gap: New Operators for Reinforcement Learning
2016
Marc G. Bellemare
Georg Ostrovski
Arthur Guez
Philip S. Thomas
RĂ©mi Munos
2
+
The role of spatio-temporal synchrony in the encoding of motion
2013
Kishore Konda
Roland Memisevic
Vincent Michalski
2
+
Neural Machine Translation by Jointly Learning to Align and Translate
2015
Dzmitry Bahdanau
Kyunghyun Cho
Yoshua Bengio
2
+
Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models
2018
Kurtland Chua
Roberto Calandra
Rowan McAllister
Sergey Levine
2
+
End-to-End Training of Deep Visuomotor Policies
2015
Sergey Levine
Chelsea Finn
Trevor Darrell
Pieter Abbeel
1
+
PDF
Chat
The NumPy Array: A Structure for Efficient Numerical Computation
2011
Stéfan van der Walt
Steven C. Colbert
Gaël Varoquaux
1
+
Neural Machine Translation by Jointly Learning to Align and Translate
2014
Dzmitry Bahdanau
Kyunghyun Cho
Yoshua Bengio
1
+
One weird trick for parallelizing convolutional neural networks
2014
Alex Krizhevsky
1
+
A Convolutional Neural Network for Modelling Sentences
2014
Nal Kalchbrenner
Edward Grefenstette
Phil Blunsom
1
+
Action-Conditional Video Prediction using Deep Networks in Atari Games
2015
Junhyuk Oh
Xiaoxiao Guo
Honglak Lee
Richard L. Lewis
Satinder Singh
1