Johan Ferret

Follow

Generating author description...

All published works
Action Title Year Authors
+ PDF Chat Humanity's Last Exam 2025 Long Phan
Alice Gatti
Ziwen Han
Nathaniel Li
Josephina Hu
Hugh Zhang
Shuangshuang Shi
Michael Y. Choi
Arjun Agrawal
Asmita Chopra
+ PDF Chat Diversity-Rewarded CFG Distillation 2024 Geoffrey Cideron
Andrea Agostinelli
Johan Ferret
Sertan Girgin
Romuald Élie
Olivier Bachem
Sarah Perrin
Alexandre Ramé
+ PDF Chat Assessing the Zero-Shot Capabilities of LLMs for Action Evaluation in RL 2024 Eduardo Pignatelli
Johan Ferret
Tim Rockäschel
Edward Grefenstette
Davide Paglieri
Samuel Coward
Laura Toni
+ PDF Chat Gemma 2: Improving Open Language Models at a Practical Size 2024 Gemma Team
Morgane Rivière
Shreya Pathak
Pier Giuseppe Sessa
Cassidy Hardin
Surya Bhupatiraju
Léonard Hussenot
Thomas Mesnard
Bobak Shahriari
Alexandre Ramé
+ PDF Chat Conditioned Language Policy: A General Framework for Steerable Multi-Objective Finetuning 2024 Kaiwen Wang
Rahul Kidambi
Ryan Sullivan
A. Agarwal
Christoph Dann
Andrea Michi
Marco Gelmi
Yunxuan Li
Raghav Gupta
Avinava Dubey
+ PDF Chat BOND: Aligning LLMs with Best-of-N Distillation 2024 Pier Giuseppe Sessa
Robert Dadashi
Léonard Hussenot
Johan Ferret
Nino Vieillard
Alexandre Ramé
Bobak Shariari
Sarah Perrin
Abe Friesen
Geoffrey Cideron
+ PDF Chat WARP: On the Benefits of Weight Averaged Rewarded Policies 2024 Alexandre Ramé
Johan Ferret
Nino Vieillard
Robert Dadashi
Léonard Hussenot
Pierre-Louis Cedoz
Pier Giuseppe Sessa
Sertan Girgin
Arthur Douillard
Olivier Bachem
+ PDF Chat RecurrentGemma: Moving Past Transformers for Efficient Open Language Models 2024 Aleksandar Botev
Soham De
Samuel Smith
Anushan Fernando
George-Cristian Muraru
Ruba Haroun
Leonard Berrada
Razvan Pascanu
Pier Giuseppe Sessa
Robert Dadashi
+ PDF Chat Gemma: Open Models Based on Gemini Research and Technology 2024 Gemma Team
Thomas Mesnard
Cassidy Hardin
Robert Dadashi
Surya Bhupatiraju
Shreya Pathak
Laurent Sifre
Morgane Rivière
Mihir Kale
Juliette Love
+ PDF Chat Direct Language Model Alignment from Online AI Feedback 2024 Shangmin Guo
Biao Zhang
Tianlin Liu
Tianqi Liu
Misha Khalman
Felipe Llinares
Alexandre Ramé
Thomas Mesnard
Yao Zhao
Bilal Piot
+ WARM: On the Benefits of Weight Averaged Reward Models 2024 Alexandre Ramé
Nino Vieillard
Léonard Hussenot
Robert Dadashi
Geoffrey Cideron
Olivier Bachem
Johan Ferret
+ Factually Consistent Summarization via Reinforcement Learning with Textual Entailment Feedback 2023 Paul Roit
Johan Ferret
Lior Shani
Roee Aharoni
Geoffrey Cideron
Robert Dadashi
Matthieu Geist
Sertan Girgin
Léonard Hussenot
Orgad Keller
+ Factually Consistent Summarization via Reinforcement Learning with Textual Entailment Feedback 2023 Paul Roit
Johan Ferret
Lior Shani
Roee Aharoni
Geoffrey Cideron
Robert Dadashi
Matthieu Geist
Sertan Girgin
Léonard Hussenot
Orgad Keller
+ A Survey of Temporal Credit Assignment in Deep Reinforcement Learning 2023 Eduardo Pignatelli
Johan Ferret
Matthieu Geist
Thomas Mesnard
Hado van Hasselt
Laura Toni
+ Gemini: A Family of Highly Capable Multimodal Models 2023 Gemini Team
Rohan Anil
Sebastian Borgeaud
Jean-Baptiste Alayrac
Jiahui Yu
Radu Soricut
Johan Schalkwyk
Andrew M. Dai
Anja Hauth
Katie Millican
+ Lazy-MDPs: Towards Interpretable Reinforcement Learning by Learning When to Act 2022 Alexis Jacq
Johan Ferret
Olivier Pietquin
Matthieu Geist
+ PDF Chat There Is No Turning Back: A Self-Supervised Approach for Reversibility-Aware Reinforcement Learning 2021 Nathan Grinsztajn
Johan Ferret
Olivier Pietquin
Pierre‐Marie Preux
Matthieu Geist
+ More Efficient Exploration with Symbolic Priors on Action Sequence Equivalences 2021 Toby Johnstone
Nathan Grinsztajn
Johan Ferret
Pierre‐Marie Preux
+ PDF Chat Adversarially Guided Actor-Critic 2021 Yannis Flet-Berliac
Johan Ferret
Olivier Pietquin
Pierre‐Marie Preux
Matthieu Geist
+ PDF Chat Adversarially Guided Actor-Critic 2021 Yannis Flet-Berliac
Johan Ferret
Olivier Pietquin
Pierre‐Marie Preux
Matthieu Geist
+ PDF Chat Self-Imitation Advantage Learning 2021 Johan Ferret
Olivier Pietquin
Matthieu Geist
+ Adversarially Guided Actor-Critic 2021 Yannis Flet-Berliac
Johan Ferret
Olivier Pietquin
Pierre‐Marie Preux
Matthieu Geist
+ More Efficient Exploration with Symbolic Priors on Action Sequence Equivalences 2021 Toby Johnstone
Nathan Grinsztajn
Johan Ferret
Pierre‐Marie Preux
+ There Is No Turning Back: A Self-Supervised Approach for Reversibility-Aware Reinforcement Learning 2021 Nathan Grinsztajn
Johan Ferret
Olivier Pietquin
Pierre‐Marie Preux
Matthieu Geist
+ Self-Attentional Credit Assignment for Transfer in Reinforcement Learning 2020 Johan Ferret
Raphaël Marinier
Matthieu Geist
Olivier Pietquin
+ Self-Imitation Advantage Learning 2020 Johan Ferret
Olivier Pietquin
Matthieu Geist
+ Acme: A Research Framework for Distributed Reinforcement Learning 2020 Matthew W. Hoffman
Bobak Shahriari
John Aslanides
Gabriel Barth-Maron
Nikola Momchev
Danila Sinopalnikov
Piotr Stańczyk
Sabela Ramos
Anton Raichuk
Damien Vincent
+ Credit Assignment as a Proxy for Transfer in Reinforcement Learning. 2019 Johan Ferret
Raphaël Marinier
Matthieu Geist
Olivier Pietquin
Common Coauthors
Commonly Cited References
Action Title Year Authors # of times referenced
+ Proximal Policy Optimization Algorithms 2017 John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
5
+ Munchausen Reinforcement Learning 2020 Nino Vieillard
Olivier Pietquin
Matthieu Geist
4
+ Exploration by Random Network Distillation 2018 Yuri Burda
Harrison Edwards
Amos Storkey
Oleg Klimov
4
+ Adam: A Method for Stochastic Optimization 2014 Diederik P. Kingma
Jimmy Ba
4
+ Go-Explore: a New Approach for Hard-Exploration Problems 2019 Adrien Ecoffet
Joost Huizinga
Joel Lehman
Kenneth O. Stanley
Jeff Clune
4
+ Noisy Networks For Exploration 2018 Meire Fortunato
Mohammad Gheshlaghi Azar
Bilal Piot
Jacob Menick
Ian Osband
Alexander Graves
Vlad Mnih
Rémi Munos
Demis Hassabis
Olivier Pietquin
3
+ Episodic Curiosity through Reachability 2018 Nikolay Savinov
Anton Raichuk
Raphaël Marinier
Damien Vincent
Marc Pollefeys
Timothy Lillicrap
Sylvain Gelly
3
+ Generative Adversarial Imitation Learning 2016 Jonathan Ho
Stefano Ermon
3
+ A Theory of Regularized Markov Decision Processes 2019 Matthieu Geist
Bruno Scherrer
Olivier Pietquin
3
+ Illuminating Generalization in Deep Reinforcement Learning through Procedural Level Generation 2018 Niels Justesen
Rubén Rodríguez Torrado
Philip Bontrager
Ahmed Khalifa
Julian Togelius
Sebastian Risi
2
+ PDF Chat Self-Imitation Advantage Learning 2021 Johan Ferret
Olivier Pietquin
Matthieu Geist
2
+ Connecting Generative Adversarial Networks and Actor-Critic Methods 2016 David Pfau
Oriol Vinyals
2
+ Only Relevant Information Matters: Filtering Out Noisy Samples To Boost RL 2020 Yannis Flet-Berliac
Pierre‐Marie Preux
2
+ Towards Principled Methods for Training Generative Adversarial Networks 2017 Martín Arjovsky
Léon Bottou
2
+ Diversity Actor-Critic: Sample-Aware Entropy Regularization for Sample-Efficient Exploration 2020 Seungyul Han
Youngchul Sung
2
+ PDF Chat The Arcade Learning Environment: An Evaluation Platform for General Agents 2013 Marc G. Bellemare
Yavar Naddaf
Joel Veness
Michael Bowling
2
+ Self-Attentional Credit Assignment for Transfer in Reinforcement Learning 2020 Johan Ferret
Raphaël Marinier
Matthieu Geist
Olivier Pietquin
2
+ Asynchronous Methods for Deep Reinforcement Learning 2016 Volodymyr Mnih
Adrià Puigdomènech Badia
Mehdi Mirza
Alex Graves
Tim Harley
Timothy Lillicrap
David Silver
Koray Kavukcuoglu
2
+ Concrete Problems in AI Safety 2016 Dario Amodei
Chris Olah
Jacob Steinhardt
Paul F. Christiano
John Schulman
Dan Mané
2
+ Distributional Smoothing with Virtual Adversarial Training 2016 Takeru Miyato
Shin‐ichi Maeda
Masanori Koyama
Ken Nakae
Shin Ishii
2
+ Generalization and Regularization in DQN 2018 Jesse Farebrother
Marlos C. Machado
Michael Bowling
2
+ PDF Chat Attention Augmented Convolutional Networks 2019 Irwan Bello
Barret Zoph
Quoc V. Le
Ashish Vaswani
Jonathon Shlens
2
+ Observational Overfitting in Reinforcement Learning 2019 Xingyou Song
Yiding Jiang
Yilun Du
Behnam Neyshabur
2
+ Diversity is All You Need: Learning Skills without a Reward Function 2018 Benjamin Eysenbach
Abhishek Gupta
Julian Ibarz
Sergey Levine
2
+ PDF Chat Learning Value Functions in Deep Policy Gradients using Residual Variance 2021 Yannis Flet-Berliac
Reda Ouhamma
Odalric-Ambrym Maillard
Pierre‐Marie Preux
2
+ DARLA: Improving Zero-Shot Transfer in Reinforcement Learning 2017 Irina Higgins
Arka Pal
Andrei A. Rusu
Löıc Matthey
Christopher Burgess
Alexander Pritzel
Matthew Botvinick
Charles Blundell
Alexander Lerchner
2
+ Leverage the Average: an Analysis of Regularization in RL 2020 Nino Vieillard
Tadashi Kozuno
Bruno Scherrer
Olivier Pietquin
Rémi Munos
Matthieu Geist
2
+ Fast and Accurate Deep Network Learning by Exponential Linear Units (ELUs) 2015 Djork-Arné Clevert
Thomas Unterthiner
Sepp Hochreiter
2
+ Safe and Efficient Off-Policy Reinforcement Learning 2016 Rémi Munos
Tom Stepleton
Anna Harutyunyan
Marc G. Bellemare
2
+ Understanding the impact of entropy on policy optimization 2018 Zafarali Ahmed
Nicolas Le Roux
Mohammad Norouzi
Dale Schuurmans
2
+ Deep Recurrent Q-Learning for Partially Observable MDPs 2015 Matthew Hausknecht
Peter Stone
2
+ High-Dimensional Continuous Control Using Generalized Advantage Estimation 2015 John Schulman
Philipp Moritz
Sergey Levine
Michael I. Jordan
Pieter Abbeel
2
+ Sample Efficient Actor-Critic with Experience Replay 2016 Ziyu Wang
Victor Bapst
Nicolas Heess
Volodymyr Mnih
Rémi Munos
Koray Kavukcuoglu
Nando de Freitas
2
+ A Study on Overfitting in Deep Reinforcement Learning 2018 Chiyuan Zhang
Oriol Vinyals
Rémi Munos
Samy Bengio
2
+ Continuous control with deep reinforcement learning 2016 Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
Nicolas Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
2
+ A Dissection of Overfitting and Generalization in Continuous Reinforcement Learning 2018 Amy Zhang
Nicolas Ballas
Joëlle Pineau
2
+ Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks 2017 Chelsea Finn
Pieter Abbeel
Sergey Levine
2
+ Adversarial Machine Learning at Scale 2016 Alexey Kurakin
Ian Goodfellow
Samy Bengio
2
+ Agent57: Outperforming the Atari Human Benchmark 2020 Adrià Puigdomènech Badia
Bilal Piot
Steven Kapturowski
Pablo Sprechmann
Alex Vitvitskyi
Daniel Guo
Charles Blundell
2
+ PDF Chat ViZDoom: A Doom-based AI research platform for visual reinforcement learning 2016 Michał Kempka
Marek Wydmuch
Grzegorz Runc
Jakub Toczek
Wojciech Jaśkowski
2
+ IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures 2018 Lasse Espeholt
Hubert Soyer
Rémi Munos
Karen Simonyan
Volodymir Mnih
Tom Ward
Yotam Doron
Vlad Firoiu
Tim Harley
Iain Dunning
1
+ Maximum a Posteriori Policy Optimisation 2018 Abbas Abdolmaleki
Jost Tobias Springenberg
Yuval Tassa
Rémi Munos
Nicolas Heess
Martin Riedmiller
1
+ Learning to Play in a Day: Faster Deep Reinforcement Learning by Optimality Tightening 2016 Frank He
Yang Liu
Alexander G. Schwing
Jian Peng
1
+ AI Safety Gridworlds 2017 Jan Leike
Miljan Martic
Victoria Krakovna
Pedro A. Ortega
Tom Everitt
Andrew Lefrancq
Laurent Orseau
Shane Legg
1
+ Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation 2017 Yuhuai Wu
Elman Mansimov
S. Matthew Liao
Roger Grosse
Jimmy Ba
1
+ PDF Chat Self-Supervised Video Representation Learning with Odd-One-Out Networks 2017 Basura Fernando
Hakan Bilen
Efstratios Gavves
Stephen Jay Gould
1
+ Deep reinforcement learning with double Q-Learning 2016 Hado van Hasselt
Arthur Guez
David Silver
1
+ Imagination-Augmented Agents for Deep Reinforcement Learning 2017 Théophane Weber
Sébastien Racanière
David Reichert
Lars Buesing
Arthur Guez
Danilo Jimenez Rezende
Adrià Puigdomènech Badia
Oriol Vinyals
Nicolas Heess
Yujia Li
1
+ Learning to reinforcement learn 2016 Jane X. Wang
Zeb Kurth‐Nelson
Dhruva Tirumala
Hubert Soyer
Joel Z. Leibo
Rémi Munos
Charles Blundell
Dharshan Kumaran
Matt Botvinick
1
+ Provably Efficient Maximum Entropy Exploration 2018 Elad Hazan
Sham M. Kakade
Karan Singh
Abby Van Soest
1