Human-centric Dialog Training via Offline Reinforcement Learning

Type: Preprint

Publication Date: 2020-01-01

Citations: 0

DOI: https://doi.org/10.48550/arxiv.2010.05848

Locations

  • arXiv (Cornell University) - View
  • DSpace@MIT (Massachusetts Institute of Technology) - View - PDF
  • DataCite API - View

Similar Works

Action Title Year Authors
+ Human-centric Dialog Training via Offline Reinforcement Learning 2020 Natasha Jaques
Judy Hanwen Shen
Asma Ghandeharioun
Craig Ferguson
Ă€gata Lapedriza
N. S. Jones
Shixiang Gu
Rosalind W. Picard
+ Human-centric dialog training via offline reinforcement learning 2020 Natasha Jaques
Judy Hanwen Shen
Asma Ghandeharioun
Craig Ferguson
Ă€gata Lapedriza
Noah Jones
Shixiang Gu
Rosalind W. Picard
+ PDF Chat Hierarchical Reinforcement Learning for Open-Domain Dialog 2020 Abdelrhman Saleh
Natasha Jaques
Asma Ghandeharioun
Judy Hanwen Shen
Rosalind W. Picard
+ Hierarchical Reinforcement Learning for Open-Domain Dialog 2019 Abdelrhman Saleh
Natasha Jaques
Asma Ghandeharioun
Judy Hanwen Shen
Rosalind W. Picard
+ Way Off-Policy Batch Deep Reinforcement Learning of Implicit Human Preferences in Dialog 2019 Natasha Jaques
Asma Ghandeharioun
Judy Hanwen Shen
Craig Ferguson
Ă€gata Lapedriza
Noah Jones
Shixiang Gu
Rosalind W. Picard
+ Bootstrapping LLM-based Task-Oriented Dialogue Agents via Self-Talk 2024 Dennis Ulmer
Elman Mansimov
Kaixiang Lin
Justin Sun
Xibin Gao
Yi Zhang
+ Learning to Dialogue via Complex Hindsight Experience Replay. 2018 Keting Lu
Shiqi Zhang
Xiaoping Chen
+ On the Effectiveness of Offline RL for Dialogue Response Generation 2023 Paloma Sodhi
Felix Wu
Ethan R. Elenberg
Kilian Q. Weinberger
Ryan McDonald
+ Replicating Complex Dialogue Policy of Humans via Offline Imitation Learning with Supervised Regularization 2023 Zhoujian Sun
Chenyang Zhao
Zhengxing Huang
Nai Ding
+ PDF Chat Interactive Dialogue Agents via Reinforcement Learning on Hindsight Regenerations 2024 Joey Hong
Jessica Lin
Anca D. Dragan
Sergey Levine
+ Offline Reinforcement Learning for Mixture-of-Expert Dialogue Management 2023 Dhawal Gupta
Yinlam Chow
Mohammad Ghavamzadeh
Craig Boutilier
+ Sample-efficient Deep Reinforcement Learning for Dialog Control 2016 Kavosh Asadi
J. D. Williams
+ Rethinking Supervised Learning and Reinforcement Learning in Task-Oriented Dialogue Systems 2020 Ziming Li
Julia Kiseleva
Maarten de Rijke
+ CHAI: A CHatbot AI for Task-Oriented Dialogue with Offline Reinforcement Learning 2022 Siddharth Verma
Justin Fu
Mengjiao Yang
Sergey Levine
+ PDF Chat Iterative policy learning in end-to-end trainable task-oriented neural dialog models 2017 Bing Liu
Ian Lane
+ Iterative Policy Learning in End-to-End Trainable Task-Oriented Neural Dialog Models 2017 Bing Liu
Ian Lane
+ Online Sequence-to-Sequence Reinforcement Learning for Open-Domain Conversational Agents 2016 Nabiha Asghar
Pascal Poupart
Jiang Xin
Hang Li
+ PDF Chat Learning from Naturally Occurring Feedback 2024 Shachar Don-Yehiya
Leshem Choshen
Omri Abend
+ NaRLE: Natural Language Models using Reinforcement Learning with Emotion Feedback 2021 R Zhou
Soham Deshmukh
Jeremiah Greer
Charles Lee
+ A Deep Reinforcement Learning Chatbot (Short Version) 2018 Iulian Vlad Serban
Chinnadhurai Sankar
Mathieu Germain
Saizheng Zhang
Zhouhan Lin
Sandeep Subramanian
Taesup Kim
Michael Pieper
Sarath Chandar
Nan Rosemary Ke

Works That Cite This (0)

Action Title Year Authors

Works Cited by This (0)

Action Title Year Authors