Tianyu Gao

Follow

Generating author description...

Common Coauthors
Commonly Cited References
Action Title Year Authors # of times referenced
+ A Survey of Available Corpora for Building Data-Driven Dialogue Systems 2015 Iulian Vlad Serban
Ryan Lowe
Peter Henderson
Laurent Charlin
Joëlle Pineau
1
+ Proximal Policy Optimization Algorithms 2017 John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
1
+ Way Off-Policy Batch Deep Reinforcement Learning of Implicit Human Preferences in Dialog 2019 Natasha Jaques
Asma Ghandeharioun
Judy Hanwen Shen
Craig Ferguson
Àgata Lapedriza
Noah Jones
Shixiang Gu
Rosalind W. Picard
1
+ Learning from Dialogue after Deployment: Feed Yourself, Chatbot! 2019 Braden Hancock
Antoine Bordes
Pierre-Emmanuel Mazaré
Jason Weston
1
+ ParlAI: A Dialog Research Software Platform 2017 Alexander Miller
Will Feng
Dhruv Batra
Antoine Bordes
Adam Fisch
Jiasen Lu
Devi Parikh
Jason Weston
1
+ Neural Text Generation with Unlikelihood Training 2019 Sean Welleck
Ilia Kulikov
Stephen Roller
Emily Dinan
Kyunghyun Cho
Jason Weston
1
+ Better Rewards Yield Better Summaries: Learning to Summarise Without References 2019 Florian Böhm
Yang Gao
Christian M. Meyer
Ori Shapira
Ido Dagan
Iryna Gurevych
1
+ Build it Break it Fix it for Dialogue Safety: Robustness from Adversarial Human Attack 2019 Emily Dinan
Samuel Humeau
Bharath Chintagunta
Jason Weston
1
+ Fine-Tuning Language Models from Human Preferences 2019 Daniel M. Ziegler
Nisan Stiennon
Jeffrey Wu
T. B. Brown
Alec Radford
Dario Amodei
Paul F. Christiano
Geoffrey Irving
1
+ Plug and Play Language Models: A Simple Approach to Controlled Text Generation 2019 Sumanth Dathathri
Andrea Madotto
Janice Lan
Jane Hung
Eric Frank
Piero Molino
Jason Yosinski
Rosanne Liu
1
+ The Pushshift Reddit Dataset 2020 Jason Baumgartner
Savvas Zannettou
Brian Keegan
Megan Squire
Jeremy Blackburn
1
+ Can You Put it All Together: Evaluating Conversational Agents’ Ability to Blend Skills 2020 Eric M. Smith
Mary Williamson
Kurt Shuster
Jason Weston
Y-Lan Boureau
1
+ Negative Training for Neural Dialogue Response Generation 2020 Tianxing He
James Glass
1
+ GeDi: Generative Discriminator Guided Sequence Generation 2021 Ben Krause
Akhilesh Gotmare
Bryan McCann
Nitish Shirish Keskar
Shafiq Joty
Richard Socher
Nazneen Fatema Rajani
1
+ RealToxicityPrompts: Evaluating Neural Toxic Degeneration in Language Models 2020 Samuel Gehman
Suchin Gururangan
Maarten Sap
Yejin Choi
Noah A. Smith
1
+ The Pile: An 800GB Dataset of Diverse Text for Language Modeling 2021 Leo Gao
Stella Biderman
Sid Black
Laurence Golding
Travis Hoppe
Charles Wilmer Foster
Jason Phang
Horace He
Anish Thite
Noa Nabeshima
1
+ Recipes for Building an Open-Domain Chatbot 2021 Stephen Roller
Emily Dinan
Naman Goyal
Da Young Ju
Mary Williamson
Yinhan Liu
Jing Xu
Myle Ott
Eric M. Smith
Y-Lan Boureau
1
+ Leveraging Passage Retrieval with Generative Models for Open Domain Question Answering 2021 Gautier Izacard
Édouard Grave
1
+ PDF Chat FUDGE: Controlled Text Generation With Future Discriminators 2021 Kevin Yang
Dan Klein
1
+ I like fish, especially dolphins: Addressing Contradictions in Dialogue Modeling 2021 Yixin Nie
Mary Williamson
Mohit Bansal
Douwe Kiela
Jason Weston
1
+ PDF Chat Internet-Augmented Dialogue Generation 2022 Mojtaba Komeili
Kurt Shuster
Jason Weston
1
+ PDF Chat Beyond Goldfish Memory: Long-Term Open-Domain Conversation 2022 Jing Xu
Arthur Szlam
Jason Weston
1
+ Recursively Summarizing Books with Human Feedback 2021 Jeff Wu
Long Ouyang
Daniel M. Ziegler
Nisan Stiennon
Ryan Lowe
Jan Leike
Paul F. Christiano
1
+ PaLM: Scaling Language Modeling with Pathways 2022 Aakanksha Chowdhery
Sharan Narang
Jacob Devlin
Maarten Bosma
Gaurav Mishra
Adam Roberts
Paul Barham
Hyung Won Chung
Charles Sutton
Sebastian Gehrmann
1
+ Jam or Cream First? Modeling Ambiguity in Neural Machine Translation with SCONES 2022 Felix Stahlberg
Shankar Kumar
1
+ Training language models to follow instructions with human feedback 2022 Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
Pamela Mishkin
Chong Zhang
Sandhini Agarwal
Katarina Slama
Alex Ray
1
+ Am I Me or You? State-of-the-Art Dialogue Models Cannot Maintain an Identity 2021 Kurt Shuster
Jack Urbanek
Arthur Szlam
Jason Weston
1
+ A Simple Contrastive Learning Objective for Alleviating Neural Text Degeneration 2022 Shaojie Jiang
Ruqing Zhang
Svitlana Vakulenko
Maarten de Rijke
1
+ Quark: Controllable Text Generation with Reinforced Unlearning 2022 Ximing Lu
Sean Welleck
Liwei Jiang
Jack Hessel
Lianhui Qin
Peter West
Prithviraj Ammanabrolu
Yejin Choi
1
+ DIRECTOR: Generator-Classifiers For Supervised Language Modeling 2022 Kushal Arora
Kurt Shuster
Sainbayar Sukhbaatar
Jason Weston
1
+ Anticipating Safety Issues in E2E Conversational AI: Framework and Tooling 2021 Emily Dinan
Gavin Abercrombie
A. Stevie Bergman
Shannon Spruit
Dirk Hovy
Y-Lan Boureau
Verena Rieser
1
+ Learning to summarize from human feedback 2020 Nisan Stiennon
Long Ouyang
Jeff Wu
Daniel M. Ziegler
Ryan Lowe
Chelsea Voss
Alec Radford
Dario Amodei
Paul Christiano
1
+ The Second Conversational Intelligence Challenge (ConvAI2) 2019 Emily Dinan
Varvara Logacheva
Valentin Malykh
Alexander Miller
Kurt Shuster
Jack Urbanek
Douwe Kiela
Arthur Szlam
Iulian Vlad Serban
Ryan Lowe
1
+ BlenderBot 3: a deployed conversational agent that continually learns to responsibly engage 2022 Kurt Shuster
Jing Xu
Mojtaba Komeili
Da Young Ju
Eric M. Smith
Stephen Roller
Megan Ung
Moya Chen
Kushal Arora
Joshua Lane
1
+ Language Models are Few-Shot Learners 2020 T. B. Brown
Benjamin F. Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
Prafulla Dhariwal
Arvind Neelakantan
Pranav Shyam
Girish Sastry
Amanda Askell
1
+ PyTorch: An Imperative Style, High-Performance Deep Learning Library 2019 Adam Paszke
Sam Gross
Francisco Massa
Adam Lerer
James T. Bradbury
Gregory Chanan
Trevor Killeen
Zeming Lin
Natalia Gimelshein
Luca Antiga
1
+ Learning from data in the mixed adversarial non-adversarial case: Finding the helpers and ignoring the trolls 2022 Da Young Ju
Jing Xu
Y-Lan Boureau
Jason Weston
1
+ Learning New Skills after Deployment: Improving open-domain internet-driven dialogue with human feedback 2022 Jing Xu
Megan Ung
Mojtaba Komeili
Kushal Arora
Y-Lan Boureau
Jason Weston
1
+ A General Language Assistant as a Laboratory for Alignment 2021 Amanda Askell
Yuntao Bai
Anna Chen
Dawn Drain
Deep Ganguli
Tom Henighan
Andy Jones
Nicholas Joseph
Ben Mann
Nova DasSarma
1
+ WebGPT: Browser-assisted question-answering with human feedback 2021 Reiichiro Nakano
Jacob Hilton
Suchir Balaji
Jeff Wu
Long Ouyang
Christina Kim
Christopher Hesse
Shantanu Jain
Vineet Kosaraju
William H. Saunders
1
+ Deep reinforcement learning from human preferences 2017 Paul Christiano
Jan Leike
T. B. Brown
Miljan Martic
Shane Legg
Dario Amodei
1
+ Attention Is All You Need 2017 Ashish Vaswani
Noam Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan N. Gomez
Łukasz Kaiser
Illia Polosukhin
1