Anna Harutyunyan

Follow

Generating author description...

All published works
Action Title Year Authors
+ PDF Chat Three Dogmas of Reinforcement Learning 2024 David Abel
Mark K. Ho
Anna Harutyunyan
+ An Analysis of Quantile Temporal-Difference Learning 2023 Mark Rowland
Rémi Munos
Mohammad Gheshlaghi Azar
Yunhao Tang
Georg Ostrovski
Anna Harutyunyan
Karl Tuyls
Marc G. Bellemare
Will Dabney
+ DoMo-AC: Doubly Multi-step Off-policy Actor-Critic Algorithm 2023 Yunhao Tang
Tadashi Kozuno
Mark Rowland
Anna Harutyunyan
Rémi Munos
Bernardo Ávila Pires
Michal Vaľko
+ Bootstrapped Representations in Reinforcement Learning 2023 Charline Le Lan
Stephen Tu
Mark Rowland
Anna Harutyunyan
Rishabh Agarwal
Marc G. Bellemare
Will Dabney
+ On the Expressivity of Markov Reward 2021 David Abel
Will Dabney
Anna Harutyunyan
Mark K. Ho
Michael L. Littman
Doina Precup
Satinder Singh
+ Model-Free Counterfactual Credit Assignment 2021 Thomas Mesnard
Théophane Weber
Fabio Viola
Shantanu Thakoor
Alaa Saade
Anna Harutyunyan
Will Dabney
Tom Stepleton
Nicolas Heess
Marcus Hütter
+ On the Expressivity of Markov Reward 2021 David Abel
Will Dabney
Anna Harutyunyan
Mark K. Ho
Michael L. Littman
Doina Precup
Satinder Singh
+ Counterfactual Credit Assignment in Model-Free Reinforcement Learning 2020 Thomas Mesnard
Théophane Weber
Fabio Viola
Shantanu Thakoor
Alaa Saade
Anna Harutyunyan
Will Dabney
Tom Stepleton
Nicolas Heess
Arthur Guez
+ Useful Policy Invariant Shaping from Arbitrary Advice 2020 Paniz Behboudian
Yash Satsangi
Matthew E. Taylor
Anna Harutyunyan
Michael Bowling
+ Counterfactual Credit Assignment in Model-Free Reinforcement Learning 2020 Thomas Mesnard
Théophane Weber
Fabio Viola
Shantanu Thakoor
Alaa Saade
Anna Harutyunyan
Will Dabney
Tom Stepleton
Nicolas Heess
Arthur Guez
+ Useful Policy Invariant Shaping from Arbitrary Advice 2020 Paniz Behboudian
Yash Satsangi
Matthew E. Taylor
Anna Harutyunyan
Michael Bowling
+ Hindsight Credit Assignment 2019 Anna Harutyunyan
Will Dabney
Thomas Mesnard
Mohammad Gheshlaghi Azar
Bilal Piot
Nicolas Heess
Hado P. van Hasselt
Gregory Wayne
Satinder Singh
Doina Precup
+ Hindsight Credit Assignment 2019 Anna Harutyunyan
Will Dabney
Thomas Mesnard
Mohammad Gheshlaghi Azar
Bilal Piot
Nicolas Heess
Hado van Hasselt
Greg Wayne
Satinder Singh
Doina Precup
+ Conditional Importance Sampling for Off-Policy Learning 2019 Mark Rowland
Anna Harutyunyan
Hado van Hasselt
Diana Borsa
Tom Schaul
Rémi Munos
Will Dabney
+ The Termination Critic. 2019 Anna Harutyunyan
Will Dabney
Diana Borsa
Nicolas Heess
Rémi Munos
Doina Precup
+ Hindsight Credit Assignment 2019 Anna Harutyunyan
Will Dabney
Thomas Mesnard
Mohammad Gheshlaghi Azar
Bilal Piot
Nicolas Heess
Hado van Hasselt
Greg Wayne
Satinder Singh
Doina Precup
+ Conditional Importance Sampling for Off-Policy Learning 2019 Mark Rowland
Anna Harutyunyan
Hado van Hasselt
Diana Borsa
Tom Schaul
Rémi Munos
Will Dabney
+ The Termination Critic 2019 Anna Harutyunyan
Will Dabney
Diana Borsa
Nicolas Heess
Rémi Munos
Doina Precup
+ PDF Chat Reinforcement Learning in POMDPs With Memoryless Options and Option-Observation Initiation Sets 2018 Denis Steckelmacher
Diederik M. Roijers
Anna Harutyunyan
Peter Vrancx
Hélène Plisnier
Ann Nowé
+ PDF Chat Learning With Options That Terminate Off-Policy 2018 Anna Harutyunyan
Peter Vrancx
Pierre-Luc Bacon
Doina Precup
Ann Nowé
+ Learning with Options that Terminate Off-Policy 2017 Anna Harutyunyan
Peter Vrancx
Pierre‐Luc Bacon
Doina Precup
Ann Nowé
+ Reinforcement Learning in POMDPs with Memoryless Options and Option-Observation Initiation Sets 2017 Denis Steckelmacher
Diederik M. Roijers
Anna Harutyunyan
Peter Vrancx
Hélène Plisnier
Ann Nowé
+ Safe and efficient off-policy reinforcement learning 2016 Rémi Munos
Thomas Stepleton
Anna Harutyunyan
Marc G. Bellemare
+ Q($\lambda$) with Off-Policy Corrections 2016 Anna Harutyunyan
Marc G. Bellemare
Tom Stepleton
Rémi Munos
+ PDF Chat Q($$\lambda $$) with Off-Policy Corrections 2016 Anna Harutyunyan
Marc G. Bellemare
Tom Stepleton
Rémi Munos
+ Safe and Efficient Off-Policy Reinforcement Learning 2016 Rémi Munos
Tom Stepleton
Anna Harutyunyan
Marc G. Bellemare
+ Q($λ$) with Off-Policy Corrections 2016 Anna Harutyunyan
Marc G. Bellemare
Tom Stepleton
Rémi Munos
+ Off-Policy Reward Shaping with Ensembles 2015 Anna Harutyunyan
Tim Brys
Peter Vrancx
Ann Nowé
+ Off-Policy Shaping Ensembles in Reinforcement Learning 2014 Anna Harutyunyan
Tim Brys
Peter Vrancx
Ann Nowé
+ Off-Policy Shaping Ensembles in Reinforcement Learning 2014 Anna Harutyunyan
Tim Brys
Peter Vrancx
Ann Nowé
+ Boundary-to-boundary flows in planar graphs 2013 Glencora Borradaile
Anna Harutyunyan
+ Maximum st-flow in directed planar graphs via shortest paths 2013 Glencora Borradaile
Anna Harutyunyan
+ PDF Chat Boundary-to-Boundary Flows in Planar Graphs 2013 Glencora Borradaile
Anna Harutyunyan
+ PDF Chat Maximum st-Flow in Directed Planar Graphs via Shortest Paths 2013 Glencora Borradaile
Anna Harutyunyan
+ Boundary-to-boundary flows in planar graphs 2013 Glencora Borradaile
Anna Harutyunyan
+ Maximum st-flow in directed planar graphs via shortest paths 2013 Glencora Borradaile
Anna Harutyunyan
Common Coauthors
Commonly Cited References
Action Title Year Authors # of times referenced
+ Asynchronous Methods for Deep Reinforcement Learning 2016 Volodymyr Mnih
Adrià Puigdomènech Badia
Mehdi Mirza
Alex Graves
Tim Harley
Timothy Lillicrap
David Silver
Koray Kavukcuoglu
5
+ PDF Chat The Option-Critic Architecture 2017 Pierre‐Luc Bacon
Jean Harb
Doina Precup
3
+ Emphatic Temporal-Difference Learning 2015 Ashique Rupam Mahmood
Huizhen Yu
Martha White
Richard S. Sutton
3
+ Generalized emphatic temporal difference learning: bias-variance analysis 2016 Assaf Hallak
Aviv Tamar
Rémi Munos
Shie Mannor
2
+ PDF Chat Maximum st-Flow in Directed Planar Graphs via Shortest Paths 2013 Glencora Borradaile
Anna Harutyunyan
2
+ Trust Region Policy Optimization 2015 John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
2
+ PDF Chat The Arcade Learning Environment: An Evaluation Platform for General Agents 2013 Marc G. Bellemare
Yavar Naddaf
Joel Veness
Michael Bowling
2
+ Learning continuous control policies by stochastic value gradients 2015 Nicolas Heess
Greg Wayne
David Silver
Timothy Lillicrap
Yuval Tassa
Tom Erez
2
+ PDF Chat Generalized Emphatic Temporal Difference Learning: Bias-Variance Analysis 2016 Assaf Hallak
Aviv Tamar
Rémi Munos
Shie Mannor
2
+ Prioritized Experience Replay 2015 Tom Schaul
John Quan
Ioannis Antonoglou
David Silver
2
+ Safe and Efficient Off-Policy Reinforcement Learning 2016 Rémi Munos
Tom Stepleton
Anna Harutyunyan
Marc G. Bellemare
2
+ Reinforcement Learning Neural Turing Machines 2015 Wojciech Zaremba
Ilya Sutskever
1
+ PDF Chat Efficient Planning under Uncertainty with Macro-actions 2011 Ran He
Emma Brunskill
Nicholas Roy
1
+ The optimal reward baseline for gradient-based reinforcement learning 2001 Lex Weaver
Nigel Tao
1
+ Asynchronous Methods for Deep Reinforcement Learning 2016 Volodymyr Mnih
Adrià Puigdomènech Badia
Mehdi Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
1
+ Q($\lambda$) with Off-Policy Corrections 2016 Anna Harutyunyan
Marc G. Bellemare
Tom Stepleton
Rémi Munos
1
+ A Deep Hierarchical Approach to Lifelong Learning in Minecraft 2016 Chen Tessler
Shahar Givony
Tom Zahavy
Daniel J. Mankowitz
Shie Mannor
1
+ Safe and efficient off-policy reinforcement learning 2016 Rémi Munos
Thomas Stepleton
Anna Harutyunyan
Marc G. Bellemare
1
+ Strategic Attentive Writer for Learning Macro-Actions 2016 Alexander -
Vezhnevets
Volodymyr Mnih
John Agapiou
Simon Osindero
Alex Graves
Oriol Vinyals
Koray Kavukcuoglu
1
+ Reinforcement Learning with Unsupervised Auxiliary Tasks 2016 Max Jaderberg
Volodymyr Mnih
Wojciech Marian Czarnecki
Tom Schaul
Joel Z. Leibo
David Silver
Koray Kavukcuoglu
1
+ Variational Intrinsic Control 2016 Karol Gregor
Danilo Jimenez Rezende
Daan Wierstra
1
+ A Matrix Splitting Perspective on Planning with Options 2016 Pierre‐Luc Bacon
Doina Precup
1
+ PDF Chat Multi-Step Reinforcement Learning: A Unifying Algorithm 2018 Kristopher De Asis
Juan Hernandez-Garcia
Gerhard Holland
Richard S. Sutton
1
+ PDF Chat Adversarial Discriminative Domain Adaptation 2017 Eric Tzeng
Judy Hoffman
Kate Saenko
Trevor Darrell
1
+ Convergent Tree Backup and Retrace with Function Approximation 2017 Abdelaziz Touati
Pierre‐Luc Bacon
Doina Precup
Pascal Vincent
1
+ Hindsight policy gradients 2017 Paulo Rauber
Filipe Mutz
Juergen Schmidhuber
1
+ Variance Reduction for Policy Gradient with Action-Dependent Factorized Baselines 2018 Cathy Wu
Aravind Rajeswaran
Yan Duan
Vikash Kumar
Alexandre M. Bayen
Sham M. Kakade
Igor Mordatch
Pieter Abbeel
1
+ Model-Based Planning with Discrete and Continuous Actions 2017 Mikael Henaff
WILLIAM F. WHITNEY
Yann LeCun
1
+ Self-Consistent Trajectory Autoencoder: Hierarchical Reinforcement Learning with Trajectory Embeddings 2018 John D. Co-Reyes
YuXuan Liu
Abhishek Gupta
Benjamin Eysenbach
Pieter Abbeel
Sergey Levine
1
+ Representation Learning with Contrastive Predictive Coding 2018 Aäron van den Oord
Yazhe Li
Oriol Vinyals
1
+ Policy optimization via importance sampling 2018 Alberto Maria Metelli
Matteo Papini
Francesco Faccio
Marcello Restelli
1
+ Optimizing Agent Behavior over Long Time Scales by Transporting Value 2018 Chia-Chun Hung
Timothy Lillicrap
Josh Abramson
Yan Wu
Mehdi Mirza
Federico Carnevale
Arun Ahuja
Greg Wayne
1
+ Model-Based Reinforcement Learning for Atari 2019 Łukasz Kaiser
Mohammad Babaeizadeh
Piotr Miłoś
Błażej Osiński
Roy H. Campbell
Konrad Czechowski
Dumitru Erhan
Chelsea Finn
Piotr Kozakowski
Sergey Levine
1
+ Credit Assignment Techniques in Stochastic Computation Graphs. 2019 Théophane Weber
Nicolas Heess
Lars Buesing
David Silver
1
+ Gradient Estimation Using Stochastic Computation Graphs 2015 John Schulman
Nicolas Heess
Théophane Weber
Pieter Abbeel
1
+ Counterfactual Off-Policy Evaluation with Gumbel-Max Structural Causal Models 2019 Michael Oberst
David Sontag
1
+ Off-Policy Actor-Critic 2012 Thomas Degris
Martha White
Richard S. Sutton
1
+ Benchmarking Model-Based Reinforcement Learning 2019 Tingwu Wang
Xuchan Bao
Ignasi Clavera
Jerrick Hoang
Yeming Wen
Eric Langlois
Shunshi Zhang
Guodong Zhang
Pieter Abbeel
Jimmy Ba
1
+ Doubly Robust Off-policy Value Evaluation for Reinforcement Learning 2015 Nan Jiang
Lihong Li
1
+ Hindsight policy gradients 2017 Paulo Rauber
Avinash Ummadisingu
Filipe Mutz
Jürgen Schmidhuber
1
+ PDF Chat When Waiting Is Not an Option: Learning Options With a Deliberation Cost 2018 Jean Harb
Pierre‐Luc Bacon
Martin Klissarov
Doina Precup
1
+ Meta Learning Shared Hierarchies 2017 Kevin Frans
Jonathan Ho
Xi Chen
Pieter Abbeel
John Schulman
1
+ Unifying task specification in reinforcement learning 2016 Martha White
1
+ Attention is All you Need 2017 Ashish Vaswani
Noam Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan N. Gomez
Łukasz Kaiser
Illia Polosukhin
1
+ Diversity is All You Need: Learning Skills without a Reward Function 2018 Benjamin Eysenbach
Abhishek Gupta
Julian Ibarz
Sergey Levine
1
+ Distributional Reinforcement Learning With Quantile Regression 2017 Will Dabney
Mark Rowland
Marc G. Bellemare
Rémi Munos
1
+ Continuous control with deep reinforcement learning 2016 Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
Nicolas Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
1
+ Importance Sampling Policy Evaluation with an Estimated Behavior Policy 2018 Josiah P. Hanna
Scott Niekum
Peter Stone
1
+ Recall Traces: Backtracking Models for Efficient Reinforcement Learning 2018 Anirudh Goyal
Philémon Brakel
William Fedus
Soumye Singhal
Timothy Lillicrap
Sergey Levine
Hugo Larochelle
Yoshua Bengio
1
+ PDF Chat A Deep Hierarchical Approach to Lifelong Learning in Minecraft 2017 Chen Tessler
Shahar Givony
Tom Zahavy
Daniel J. Mankowitz
Shie Mannor
1