Mehdi Mirza

Follow

Generating author description...

All published works
Action Title Year Authors
+ PDF Chat Generative Adversarial Networks 2022 Ian J. Goodfellow
Jean Pouget-Abadie
Mehdi Mirza
Bing Xu
David Warde-Farley
Sherjil Ozair
Aaron Courville
Yoshua Bengio
+ Evaluating model-based planning and planner amortization for continuous control 2021 Arunkumar Byravan
Leonard Hasenclever
Piotr Trochim
Mehdi Mirza
Alessandro Davide Ialongo
Yuval Tassa
Jost Tobias Springenberg
Abbas Abdolmaleki
Nicolas Heess
Josh Merel
+ PDF Chat Generative adversarial networks 2020 Ian Goodfellow
Jean Pouget-Abadie
Mehdi Mirza
Bing Xu
David Warde-Farley
Sherjil Ozair
Aaron Courville
Yoshua Bengio
+ Beyond Tabula-Rasa: a Modular Reinforcement Learning Approach for Physically Embedded 3D Sokoban. 2020 PĂ©ter Karkus
Mehdi Mirza
Arthur Guez
Andrew Jaegle
Timothy Lillicrap
Lars Buesing
Nicolas Heess
Théophane Weber
+ Physically Embedded Planning Problems: New Challenges for Reinforcement Learning 2020 Mehdi Mirza
Andrew Jaegle
Jonathan J. Hunt
Arthur Guez
Saran Tunyasuvunakool
Alistair Muldal
Théophane Weber
PĂ©ter Karkus
SĂ©bastien RacaniĂšre
Lars Buesing
+ PDF Chat Optimizing agent behavior over long time scales by transporting value 2019 Chia-Chun Hung
Timothy Lillicrap
Josh Abramson
Yan Wu
Mehdi Mirza
Federico Carnevale
Arun Ahuja
Greg Wayne
+ An investigation of model-free planning 2019 Arthur Guez
Mehdi Mirza
Karol Gregor
Rishabh Kabra
SĂ©bastien RacaniĂšre
Théophane Weber
David Raposo
Adam Santoro
Laurent Orseau
Tom Eccles
+ Optimizing Agent Behavior over Long Time Scales by Transporting Value 2018 Chia-Chun Hung
Timothy Lillicrap
Josh Abramson
Yan Wu
Mehdi Mirza
Federico Carnevale
Arun Ahuja
Greg Wayne
+ Unsupervised Predictive Memory in a Goal-Directed Agent 2018 Greg Wayne
Chia-Chun Hung
Amos David
Mehdi Mirza
Arun Ahuja
Agnieszka Grabska‐BarwiƄska
Jack W. Rae
Piotr Mirowski
Joel Z. Leibo
Adam Santoro
+ Probing Physics Knowledge Using Tools from Developmental Psychology 2018 Luis Piloto
Ari Weinstein
Dhruva Tb
Arun Ahuja
Mehdi Mirza
Greg Wayne
Amos David
Chia-Chun Hung
Matthew Botvinick
+ Asynchronous Methods for Deep Reinforcement Learning 2016 Volodymyr Mnih
AdriĂ  PuigdomĂšnech Badia
Mehdi Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
+ Theano: A Python framework for fast computation of mathematical expressions 2016 The Theano Development Team
Rami Al‐Rfou
Guillaume Alain
Amjad Almahairi
Christof Angermueller
Dzmitry Bahdanau
Nicolas Ballas
Frédéric Bastien
Justin Bayer
Anatoly Belikov
+ Generalizable Features From Unsupervised Learning 2016 Mehdi Mirza
Aaron Courville
Yoshua Bengio
+ Asynchronous Methods for Deep Reinforcement Learning 2016 Volodymyr Mnih
AdriĂ  PuigdomĂšnech Badia
Mehdi Mirza
Alex Graves
Tim Harley
Timothy Lillicrap
David Silver
Koray Kavukcuoglu
+ PDF Chat EmoNets: Multimodal deep learning approaches for emotion recognition in video 2015 Samira Ebrahimi Kahou
Xavier Bouthillier
Pascal Lamblin
Çaǧlar GĂŒlçehre
Vincent Michalski
Kishore Konda
SĂ©bastien Jean
Pierre Froumenty
Yann Dauphin
Nicolas Boulanger-Lewandowski
+ EmoNets: Multimodal deep learning approaches for emotion recognition in video 2015 Samira Ebrahimi Kahou
Xavier Bouthillier
Pascal Lamblin
Çaǧlar GĂŒlçehre
Vincent Michalski
Kishore Konda
SĂ©bastien Jean
Pierre Froumenty
Yann Dauphin
Nicolas Boulanger-Lewandowski
+ PDF Chat Challenges in representation learning: A report on three machine learning contests 2014 Ian Goodfellow
Dumitru Erhan
Pierre Carrier
Aaron Courville
Mehdi Mirza
Ben Hamner
Will Cukierski
Yichuan Tang
David S. Thaler
Dong‐Hyun Lee
+ Conditional Generative Adversarial Nets 2014 Mehdi Mirza
Simon Osindero
+ An Empirical Investigation of Catastrophic Forgetting in Gradient-Based Neural Networks 2014 Ian Goodfellow
Mehdi Mirza
Xiao Da
Aaron Courville
Yoshua Bengio
+ Pylearn2: a machine learning research library 2013 Ian Goodfellow
David Warde-Farley
Pascal Lamblin
Vincent Dumoulin
Mehdi Mirza
Razvan Pascanu
James Bergstra
Frédéric Bastien
Yoshua Bengio
+ Challenges in Representation Learning: A report on three machine learning contests 2013 Ian Goodfellow
Dumitru Erhan
Pierre Carrier
Aaron Courville
Mehdi Mirza
Ben Hamner
Will Cukierski
Yichuan Tang
David S. Thaler
Dong‐Hyun Lee
+ PDF Chat Challenges in Representation Learning: A Report on Three Machine Learning Contests 2013 Ian Goodfellow
Dumitru Erhan
Pierre Carrier
Aaron Courville
Mehdi Mirza
Ben Hamner
Will Cukierski
Yichuan Tang
David S. Thaler
Dong‐Hyun Lee
+ An Empirical Investigation of Catastrophic Forgetting in Gradient-Based Neural Networks 2013 Ian Goodfellow
Mehdi Mirza
Xiao Da
Aaron Courville
Yoshua Bengio
+ Maxout Networks 2013 Ian Goodfellow
David Warde-Farley
Mehdi Mirza
Aaron Courville
Yoshua Bengio
Common Coauthors
Commonly Cited References
Action Title Year Authors # of times referenced
+ Improving neural networks by preventing co-adaptation of feature detectors 2012 Geoffrey E. Hinton
Nitish Srivastava
Alex Krizhevsky
Ilya Sutskever
Ruslan Salakhutdinov
7
+ Pylearn2: a machine learning research library 2013 Ian Goodfellow
David Warde-Farley
Pascal Lamblin
Vincent Dumoulin
Mehdi Mirza
Razvan Pascanu
James Bergstra
Frédéric Bastien
Yoshua Bengio
6
+ PDF Chat Deep Residual Learning for Image Recognition 2016 Kaiming He
Xiangyu Zhang
Shaoqing Ren
Jian Sun
5
+ High-Dimensional Continuous Control Using Generalized Advantage Estimation 2015 John Schulman
Philipp Moritz
Sergey Levine
Michael I. Jordan
Pieter Abbeel
4
+ Theano: new features and speed improvements 2012 Frédéric Bastien
Pascal Lamblin
Razvan Pascanu
James Bergstra
Ian J. Goodfellow
Arnaud Bergeron
Nicolas Bouchard
David Warde-Farley
Yoshua Bengio
4
+ Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift 2015 Sergey Ioffe
Christian Szegedy
4
+ PDF Chat Speech recognition with deep recurrent neural networks 2013 Alex Graves
Abdelrahman Mohamed
Geoffrey E. Hinton
4
+ Adam: A Method for Stochastic Optimization 2014 Diederik P. Kingma
Jimmy Ba
3
+ A guide to convolution arithmetic for deep learning 2016 Vincent Dumoulin
Francesco Visin
3
+ Constructing Hierarchical Image-tags Bimodal Representations for Word Tags Alternative Choice 2013 Fangxiang Feng
Ruifan Li
Xiaojie Wang
3
+ Emergence of Locomotion Behaviours in Rich Environments 2017 Nicolas Heess
Dhruva Tb
Sriram Srinivasan
Jay Lemmon
Josh Merel
Greg Wayne
Yuval Tassa
Tom Erez
Ziyu Wang
S. M. Ali Eslami
3
+ Deep Learning using Linear Support Vector Machines 2013 Yichuan Tang
3
+ Deep Generative Stochastic Networks Trainable by Backprop 2013 Yoshua Bengio
Éric Thibodeau-Laufer
Guillaume Alain
Jason Yosinski
3
+ Asynchronous Methods for Deep Reinforcement Learning 2016 Volodymyr Mnih
AdriĂ  PuigdomĂšnech Badia
Mehdi Mirza
Alex Graves
Tim Harley
Timothy Lillicrap
David Silver
Koray Kavukcuoglu
3
+ Imagination-Augmented Agents for Deep Reinforcement Learning 2017 SĂ©bastien RacaniĂšre
Théophane Weber
David Reichert
Lars Buesing
Arthur Guez
Danilo Jimenez Rezende
AdriĂ  PuigdomĂšnech Badia
Oriol Vinyals
Nicolas Heess
Yujia Li
3
+ Unsupervised Predictive Memory in a Goal-Directed Agent 2018 Greg Wayne
Chia-Chun Hung
Amos David
Mehdi Mirza
Arun Ahuja
Agnieszka Grabska‐BarwiƄska
Jack W. Rae
Piotr Mirowski
Joel Z. Leibo
Adam Santoro
3
+ PDF Chat The Arcade Learning Environment: An Evaluation Platform for General Agents 2013 Marc G. Bellemare
Yavar Naddaf
Joel Veness
Michael Bowling
3
+ Distributed Deep Q-Learning 2015 Hao Yi Ong
Kevin Chavez
Augustus Hong
2
+ Exploring Model-based Planning with Policy Networks 2020 Tingwu Wang
Jimmy Ba
2
+ Better Mixing via Deep Representations 2012 Yoshua Bengio
Grégoire Mesnil
Yann Dauphin
Salah Rifai
2
+ IKEA Furniture Assembly Environment for Long-Horizon Complex Manipulation Tasks 2019 Youngwoon Lee
Edward S. Hu
Zhengyu Yang
Alex Yin
Joseph J. Lim
2
+ PDF Chat dm_control: Software and tasks for continuous control 2020 Saran Tunyasuvunakool
Alistair Muldal
Yotam Doron
Siqi Liu
Steven Bohez
Josh Merel
Tom Erez
Timothy Lillicrap
Nicolas Heess
Yuval Tassa
2
+ PDF Chat Mastering Atari, Go, chess and shogi by planning with a learned model 2020 Julian Schrittwieser
Ioannis Antonoglou
Thomas Hubert
Karen Simonyan
Laurent Sifre
Simon Schmitt
Arthur Guez
Edward Lockhart
Demis Hassabis
Thore Graepel
2
+ Relay Policy Learning: Solving Long-Horizon Tasks via Imitation and Reinforcement Learning 2019 Abhishek Gupta
Vikash Kumar
Corey Lynch
Sergey Levine
Karol Hausman
2
+ Value Prediction Network 2017 Junhyuk Oh
Satinder Singh
Honglak Lee
2
+ PDF Chat Infinite-Horizon Policy-Gradient Estimation 2001 J. Baxter
Peter L. Bartlett
2
+ Convolutional LSTM Network: A Machine Learning Approach for Precipitation Nowcasting 2015 Xingjian Shi
Zhourong Chen
Hao Wang
Dit‐Yan Yeung
Wai Kin Wong
Wang‐chun Woo
2
+ PDF Chat The Option-Critic Architecture 2017 Pierre‐Luc Bacon
Jean Harb
Doina Precup
2
+ PDF Chat ModDrop: Adaptive Multi-Modal Gesture Recognition 2015 Natalia Neverova
Christian Wolf
Graham W. Taylor
Florian Nebout
2
+ Hierarchical Visuomotor Control of Humanoids. 2018 Josh Merel
Arun Ahuja
Vu Pham
Saran Tunyasuvunakool
Siqi Liu
Dhruva Tirumala
Nicolas Heess
Greg Wayne
2
+ HOGWILD!: A Lock-Free Approach to Parallelizing Stochastic Gradient Descent 2011 Feng Niu
Benjamin Recht
Christopher RĂ©
Stephen J. Wright
2
+ Representation Learning: A Review and New Perspectives 2012 Yoshua Bengio
Aaron Courville
Pascal Vincent
2
+ Reverse Curriculum Generation for Reinforcement Learning 2017 Carlos Florensa
David Held
Markus Wulfmeier
Michael Zhang
Pieter Abbeel
2
+ On the convergence of markovian stochastic algorithms with rapidly decreasing ergodicity rates 1999 Laurent Younes
2
+ Reinforcement Learning with Unsupervised Auxiliary Tasks 2016 Max Jaderberg
Volodymyr Mnih
Wojciech Marian Czarnecki
Tom Schaul
Joel Z. Leibo
David Silver
Koray Kavukcuoglu
2
+ Large-Scale Feature Learning With Spike-and-Slab Sparse Coding 2012 Ian Goodfellow
Aaron Courville
Yoshua Bengio
2
+ Massively Parallel Methods for Deep Reinforcement Learning 2015 Arun Sukumaran Nair
P. Srinivasan
Sam Blackwell
Cagdas Alcicek
Rory Fearon
Alessandro De Maria
Vedavyas Panneershelvam
Mustafa Suleyman
Charles Beattie
Stig Petersen
2
+ Trust Region Policy Optimization 2015 John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
2
+ Data-efficient Deep Reinforcement Learning for Dexterous Manipulation 2017 Ivaylo Popov
Nicolas Heess
Timothy Lillicrap
Roland Hafner
Gabriel Barth-Maron
Matej VecerĂ­k
Thomas Lampe
Yuval Tassa
Tom Erez
Martin Riedmiller
2
+ Been There, Done That: Meta-Learning with Episodic Recall 2018 Samuel Ritter
Jane X. Wang
Zeb Kurth‐Nelson
Siddhant M. Jayakumar
Charles D. Blundell
Razvan Pascanu
Matthew Botvinick
2
+ PDF Chat Increasing the Action Gap: New Operators for Reinforcement Learning 2016 Marc G. Bellemare
Georg Ostrovski
Arthur Guez
Philip S. Thomas
RĂ©mi Munos
2
+ The role of spatio-temporal synchrony in the encoding of motion 2013 Kishore Konda
Roland Memisevic
Vincent Michalski
2
+ Neural Machine Translation by Jointly Learning to Align and Translate 2015 Dzmitry Bahdanau
Kyunghyun Cho
Yoshua Bengio
2
+ Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models 2018 Kurtland Chua
Roberto Calandra
Rowan McAllister
Sergey Levine
2
+ End-to-End Training of Deep Visuomotor Policies 2015 Sergey Levine
Chelsea Finn
Trevor Darrell
Pieter Abbeel
1
+ PDF Chat The NumPy Array: A Structure for Efficient Numerical Computation 2011 Stéfan van der Walt
Steven C. Colbert
Gaël Varoquaux
1
+ Neural Machine Translation by Jointly Learning to Align and Translate 2014 Dzmitry Bahdanau
Kyunghyun Cho
Yoshua Bengio
1
+ One weird trick for parallelizing convolutional neural networks 2014 Alex Krizhevsky
1
+ A Convolutional Neural Network for Modelling Sentences 2014 Nal Kalchbrenner
Edward Grefenstette
Phil Blunsom
1
+ Action-Conditional Video Prediction using Deep Networks in Atari Games 2015 Junhyuk Oh
Xiaoxiao Guo
Honglak Lee
Richard L. Lewis
Satinder Singh
1