Tongshuang Wu

Follow

Generating author description...

All published works
Action Title Year Authors
+ PDF Chat Improving Automated Feedback Systems for Tutor Training in Low-Resource Scenarios through Data Augmentation 2025 Chentianye Xu
Jionghao Lin
Tongshuang Wu
Vincent Aleven
Kenneth R. Koedinger
+ PDF Chat Tool Learning with Foundation Models 2024 Yujia Qin
Shengding Hu
Yankai Lin
Weize Chen
Ning Ding
Ganqu Cui
Zheni Zeng
Xuanhe Zhou
Y.-Y. Huang
Chaojun Xiao
+ PDF Chat Orbit: A Framework for Designing and Evaluating Multi-objective Rankers 2024 Chenyang Yang
Tesi Xiao
Michael Shavlovsky
Christian KĂ€stner
Tongshuang Wu
+ PDF Chat HiMemFormer: Hierarchical Memory-Aware Transformer for Multi-Agent Action Anticipation 2024 Zirui Wang
Xinlei Zhao
Simon Stepputtis
Woojun Kim
Tongshuang Wu
Katia Sycara
Yaqi Xie
+ PDF Chat What You Say = What You Want? Teaching Humans to Articulate Requirements for LLMs 2024 Qianou Ma
Weirui Peng
Hua Shen
Kenneth R. Koedinger
Tongshuang Wu
+ PDF Chat What Is Wrong with My Model? Identifying Systematic Problems with Semantic Data Slicing 2024 Chenyang Yang
Yao Hong
Grace A. Lewis
Tongshuang Wu
Christian KĂ€stner
+ A large-scale audit of dataset licensing and attribution in AI 2024 Shayne Longpre
Robert Mahari
Anthony Chen
Naana Obeng-Marnu
Damien Sileo
William Brannon
Niklas Muennighoff
Nathan Khazam
Jad Kabbara
Kartik Perisetla
+ PDF Chat SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning 2024 Chenyang Zhao
Xueying Jia
Vijay Viswanathan
Tongshuang Wu
Graham Neubig
+ PDF Chat Synthetic Multimodal Question Generation 2024 Ian Wu
Sravan Jayanthi
Vijay Viswanathan
Simon Rosenberg
Sina Pakazad
Tongshuang Wu
Graham Neubig
+ PDF Chat WebCanvas: Benchmarking Web Agents in Online Environments 2024 Yichen Pan
Dehan Kong
Sida Zhou
Cheng Cui
Yifei Leng
Bing Jiang
Hangyu Liu
Yanyi Shang
Shuyan Zhou
Tongshuang Wu
+ Wikibench: Community-Driven Data Curation for AI Evaluation on Wikipedia 2024 Tzu-Sheng Kuo
Aaron Halfaker
Zirui Cheng
Jiwoo Kim
Meng-Hsin Wu
Tongshuang Wu
Kenneth Holstein
Haiyi Zhu
+ Selenite: Scaffolding Online Sensemaking with Comprehensive Overviews Elicited from Large Language Models 2024 Michael Xieyang Liu
Tongshuang Wu
Tianying Chen
Franklin Mingzhe Li
Aniket Kittur
Brad A. Myers
+ PDF Chat Beyond Relevance: Evaluate and Improve Retrievers on Perspective Awareness 2024 Xinran Zhao
Tong Chen
Sihao Chen
Hongming Zhang
Tongshuang Wu
+ PDF Chat Better Synthetic Data by Retrieving and Transforming Existing Datasets 2024 Saumya Gandhi
Ritu Gala
Vijay Viswanathan
Tongshuang Wu
Graham Neubig
+ PDF Chat Evaluating Mathematical Reasoning Beyond Accuracy 2024 Shijie Xia
Xuefeng Li
Yixin Liu
Tongshuang Wu
Pengfei Liu
+ PDF Chat Fact-and-Reflection (FaR) Improves Confidence Calibration of Large Language Models 2024 Xinran Zhao
Hongming Zhang
Xiaoman Pan
Wenlin Yao
Dong Yu
Tongshuang Wu
Jianshu Chen
+ PDF Chat Large Language Models Enable Few-Shot Clustering 2024 Vijay Viswanathan
Kiril Gashteovski
Kiril Gashteovski
Carolin Lawrence
Tongshuang Wu
Graham Neubig
+ Do LLMs Exhibit Human-like Response Biases? A Case Study in Survey Design 2024 Lindia Tjuatja
Valerie Chen
Tongshuang Wu
Ameet Talwalkwar
Graham Neubig
+ Synergi: A Mixed-Initiative System for Scholarly Synthesis and Sensemaking 2023 Hyeonsu B Kang
Tongshuang Wu
Joseph Chee Chang
Aniket Kittur
+ PDF Chat ConvXAI : Delivering Heterogeneous AI Explanations via Conversations to Support Human-AI Scientific Writing 2023 Hua Shen
Chieh-Yang Huang
Tongshuang Wu
Ting-Hao Huang
+ Parachute: Evaluating Interactive Human-LM Co-writing Systems 2023 Hua Shen
Tongshuang Wu
+ Tool Learning with Foundation Models 2023 Yujia Qin
Shengding Hu
Yankai Lin
Weize Chen
Ning Ding
Ganqu Cui
Zheni Zeng
Yufei Huang
Chaojun Xiao
Chi Han
+ Bridging the Gap: A Survey on Integrating (Human) Feedback for Natural Language Generation 2023 Patrick Fernandes
Aman Madaan
Emmy Liu
AntĂłnio Farinhas
Pedro Henrique Martins
Amanda Bertsch
José G. C. de Souza
Shuyan Zhou
Tongshuang Wu
Graham Neubig
+ ConvXAI: Delivering Heterogeneous AI Explanations via Conversations to Support Human-AI Scientific Writing 2023 Hua Shen
Chieh-Yang Huang
Tongshuang Wu
Ting-Hao Huang
+ BiasX: "Thinking Slow" in Toxic Content Moderation with Explanations of Implied Social Biases 2023 Yiming Zhang
Sravani Nanduri
Liwei Jiang
Tongshuang Wu
Maarten Sap
+ DataFinder: Scientific Dataset Recommendation from Natural Language Descriptions 2023 Vijay Viswanathan
Luyu Gao
Tongshuang Wu
Pengfei Liu
Graham Neubig
+ Seeing Seeds Beyond Weeds: Green Teaming Generative AI for Beneficial Uses 2023 Logan Stapleton
Jordan Taylor
Sarah Jane Fox
Tongshuang Wu
Haiyi Zhu
+ Is AI the better programming partner? Human-Human Pair Programming vs. Human-AI pAIr Programming 2023 Qianou
Ma -
Tongshuang Wu
Kenneth R. Koedinger
+ Large Language Models Enable Few-Shot Clustering 2023 Vijay Viswanathan
Kiril Gashteovski
Carolin Lawrence
Tongshuang Wu
Graham Neubig
+ LLMs as Workers in Human-Computational Algorithms? Replicating Crowdsourcing Pipelines with LLMs 2023 Tongshuang Wu
Haiyi Zhu
Maya Albayrak
Alexis Axon
Amanda Bertsch
Wenxing Deng
Ziqi Ding
Bill Guo
Sireesh Gururaja
Tzu-Sheng Kuo
+ DataFinder: Scientific Dataset Recommendation from Natural Language Descriptions 2023 Vijay Viswanathan
Luyu Gao
Tongshuang Wu
Pengfei Liu
Graham Neubig
+ Prompt2Model: Generating Deployable Models from Natural Language Instructions 2023 Vijay Viswanathan
Chenyang Zhao
Amanda Bertsch
Tongshuang Wu
Graham Neubig
+ Selenite: Scaffolding Online Sensemaking with Comprehensive Overviews Elicited from Large Language Models 2023 Michael Xieyang Liu
Tongshuang Wu
Tianying Chen
Franklin Mingzhe Li
Aniket Kittur
Brad A. Myers
+ HypoCompass: Large-Language-Model-based Tutor for Hypothesis Construction in Debugging for Novices 2023 Qianou Ma
Hua Shen
Kenneth R. Koedinger
Tongshuang Wu
+ From Nuisance to News Sense: Augmenting the News with Cross-Document Evidence and Context 2023 Jeremiah Milbauer
Ziqi Ding
Zhijin Wu
Tongshuang Wu
+ Beyond Testers' Biases: Guiding Model Testing with Knowledge Bases using LLMs 2023 Chenyang Yang
Rishabh Rustogi
Rachel Brower–Sinning
Grace A. Lewis
Christian KĂ€stner
Tongshuang Wu
+ The Data Provenance Initiative: A Large Scale Audit of Dataset Licensing & Attribution in AI 2023 Shayne Longpre
Robert Mahari
Anthony Chen
Naana Obeng-Marnu
Damien Sileo
William Brannon
Niklas Muennighoff
Nathan Khazam
Jad Kabbara
Kartik Perisetla
+ Measuring Adversarial Datasets 2023 Yuanchen Bai
Raoyi Huang
Vijay Viswanathan
Tzu-Sheng Kuo
Tongshuang Wu
+ PDF Chat BiasX: “Thinking Slow” in Toxic Content Moderation with Explanations of Implied Social Biases 2023 Yiming Zhang
Sravani Nanduri
Liwei Jiang
Tongshuang Wu
Maarten Sap
+ Beyond Testers’ Biases: Guiding Model Testing with Knowledge Bases using LLMs 2023 Chenyang Yang
Rishabh Rustogi
Rachel Brower–Sinning
Grace A. Lewis
Christian Kaestner
Tongshuang Wu
+ Prompt2Model: Generating Deployable Models from Natural Language Instructions 2023 Vijay Viswanathan
Chenyang Zhao
Amanda Bertsch
Tongshuang Wu
Graham Neubig
+ PDF Chat Bridging the Gap: A Survey on Integrating (Human) Feedback for Natural Language Generation 2023 Patrick Fernandes
Aman Madaan
Emmy Liu
AntĂłnio Farinhas
Pedro Henrique Martins
Amanda Bertsch
José G. C. de Souza
Shuyan Zhou
Tongshuang Wu
Graham Neubig
+ Measuring Adversarial Datasets 2023 Yuanchen Bai
Raoyi Huang
Vijay Viswanathan
Tzu-Sheng Kuo
Tongshuang Wu
+ PDF Chat AI Chains: Transparent and Controllable Human-AI Interaction by Chaining Large Language Model Prompts 2022 Tongshuang Wu
Michael Terry
Carrie J. Cai
+ PDF Chat StoryBuddy: A Human-AI Collaborative Chatbot for Parent-Child Interactive Storytelling with Flexible Parental Involvement 2022 Zheng Zhang
Ying Xu
Yanhao Wang
Bingsheng Yao
Daniel Ritchie
Tongshuang Wu
Mo Yu
Dakuo Wang
Toby Jia-Jun Li
+ PDF Chat Pretty Princess vs. Successful Leader: Gender Roles in Greeting Card Messages 2022 Jiao Sun
Tongshuang Wu
Yue Jiang
Ronil Awalegaonkar
Xi Lin
Diyi Yang
+ PDF Chat PromptChainer: Chaining Large Language Model Prompts through Visual Programming 2022 Tongshuang Wu
Ellen Jiang
Aaron Donsbach
Jeff Gray
Alejandra Molina
Michael Terry
Carrie J. Cai
+ PDF Chat Tailor: Generating and Perturbing Text with Semantic Controls 2022 Alexis Ross
Tongshuang Wu
Hao Peng
Matthew N. Peters
Matt Gardner
+ Are Shortest Rationales the Best Explanations for Human Understanding? 2022 Hua Shen
Tongshuang Wu
Wenbo Guo
Ting-Hao Huang
+ PDF Chat Fantastic Questions and Where to Find Them: FairytaleQA – An Authentic Dataset for Narrative Comprehension 2022 Ying Xu
Dakuo Wang
Mo Yu
Daniel Ritchie
Bingsheng Yao
Tongshuang Wu
Zheng Zhang
Toby Jia-Jun Li
Nora Bradford
Branda Sun
+ PromptChainer: Chaining Large Language Model Prompts through Visual Programming 2022 Tongshuang Wu
Ellen Jiang
Aaron Donsbach
Jeff Gray
Alejandra Molina
Michael Terry
Carrie J. Cai
+ Are Shortest Rationales the Best Explanations for Human Understanding? 2022 Hua Shen
Tongshuang Wu
Wenbo Guo
Ting-Hao Huang
+ PDF Chat It is AI’s Turn to Ask Humans a Question: Question-Answer Pair Generation for Children’s Story Books 2022 Bingsheng Yao
Dakuo Wang
Tongshuang Wu
Zheng Zhang
Toby Li
Mo Yu
Ying Xu
+ Towards Natural Language-Based Visualization Authoring 2022 Yun Wang
Zhitao Hou
Leixian Shen
Tongshuang Wu
Jiaqi Wang
He Huang
Haidong Zhang
Dongmei Zhang
+ Fantastic Questions and Where to Find Them: FairytaleQA -- An Authentic Dataset for Narrative Comprehension 2022 Ying Xu
Dakuo Wang
Mo Yu
Daniel Ritchie
Bingsheng Yao
Tongshuang Wu
Zheng Zhang
Toby Jia-Jun Li
Nora Bradford
Branda Sun
+ PDF Chat Towards Natural Language-Based Visualization Authoring 2022 Yun Wang
Zhitao Hou
Leixian Shen
Tongshuang Wu
Jiaqi Wang
He Huang
Haidong Zhang
Dongmei Zhang
+ Capabilities for Better ML Engineering 2022 Chenyang Yang
Rachel Brower–Sinning
Grace A. Lewis
Christian KĂ€stner
Tongshuang Wu
+ Decisions that Explain Themselves: A User-Centric Deep Reinforcement Learning Explanation System 2022 Xiaoran Wu
Zihan Yan
Chongjie Zhang
Tongshuang Wu
+ PDF Chat NL-Augmenter: A Framework for Task-Sensitive Natural Language Augmentation 2021 Kaustubh Dhole
Varun Gangal
Sebastian Gehrmann
Aadesh Gupta
Zhenhao Li
Saad Mahamood
Abinaya Mahendiran
Simon Mille
Ashish Srivastava
Samson Tan
+ It is AI's Turn to Ask Human a Question: Question and Answer Pair Generation for Children Storybooks in FairytaleQA Dataset. 2021 Bingsheng Yao
Dakuo Wang
Tongshuang Wu
Tran Manh Hoang
Branda Sun
Toby Jia-Jun Li
Mo Yu
Ying Xu
+ PDF Chat DeHumor: Visual Analytics for Decomposing Humor 2021 Xingbo Wang
Ming Yao
Tongshuang Wu
Haipeng Zeng
Yong Wang
Huamin Qu
+ PDF Chat Does the Whole Exceed its Parts? The Effect of AI Explanations on Complementary Team Performance 2021 Gagan Bansal
Tongshuang Wu
Joyce Zhou
Raymond Fok
Besmira Nushi
Ece Kamar
Marco TĂșlio Ribeiro
Daniel S. Weld
+ Polyjuice: Automated, General-purpose Counterfactual Generation. 2021 Tongshuang Wu
Marco TĂșlio Ribeiro
Jeffrey Heer
Daniel S. Weld
+ Polyjuice: Generating Counterfactuals for Explaining, Evaluating, and Improving Models 2021 Tongshuang Wu
Marco TĂșlio Ribeiro
Jeffrey Heer
Daniel S. Weld
+ Polyjuice: Generating Counterfactuals for Explaining, Evaluating, and Improving Models 2021 Tongshuang Wu
Marco TĂșlio Ribeiro
Jeffrey Heer
Daniel S. Weld
+ NL-Augmenter: A Framework for Task-Sensitive Natural Language Augmentation 2021 Kaustubh Dhole
Varun Gangal
Sebastian Gehrmann
Aadesh Gupta
Zhenhao Li
Saad Mahamood
Abinaya Mahendiran
Simon Mille
Ashish Shrivastava
Samson Tan
+ AI Chains: Transparent and Controllable Human-AI Interaction by Chaining Large Language Model Prompts 2021 Tongshuang Wu
Michael Terry
Carrie J. Cai
+ Tailor: Generating and Perturbing Text with Semantic Controls 2021 Alexis Ross
Tongshuang Wu
Hao Peng
Matthew E. Peters
Matt Gardner
+ It is AI's Turn to Ask Humans a Question: Question-Answer Pair Generation for Children's Story Books 2021 Bingsheng Yao
Dakuo Wang
Tongshuang Wu
Zheng Zhang
Toby Jia-Jun Li
Mo Yu
Ying Xu
+ Pretty Princess vs. Successful Leader: Gender Roles in Greeting Card Messages 2021 Jiao Sun
Tongshuang Wu
Yue Jiang
Ronil Awalegaonkar
Xi Victoria Lin
Diyi Yang
+ Does the Whole Exceed its Parts? The Effect of AI Explanations on Complementary Team Performance 2020 Gagan Bansal
Tongshuang Wu
Joyce Zhou
Raymond Fok
Besmira Nushi
Ece Kamar
Marco TĂșlio Ribeiro
Daniel S. Weld
+ Beyond Accuracy: Behavioral Testing of NLP models with CheckList 2020 Marco TĂșlio Ribeiro
Tongshuang Wu
Carlos Guestrin
Sameer Singh
+ Beyond Accuracy: Behavioral Testing of NLP Models with CheckList 2020 Marco TĂșlio Ribeiro
Tongshuang Wu
Carlos Guestrin
Sameer Singh
+ Does the Whole Exceed its Parts? The Effect of AI Explanations on Complementary Team Performance 2020 Gagan Bansal
Tongshuang Wu
Joyce Zhou
Raymond Fok
Besmira Nushi
Ece Kamar
Marco TĂșlio Ribeiro
Daniel S. Weld
+ Beyond Accuracy: Behavioral Testing of NLP models with CheckList 2020 Marco TĂșlio Ribeiro
Tongshuang Wu
Carlos Guestrin
Sameer Singh
+ Technology-Enabled Disinformation: Summary, Lessons, and Recommendations 2018 John Garland Akers
Gagan Bansal
Gabriel Cadamuro
Christine Chen
Quanze Chen
Lucy Lin
Phoebe Mulcaire
Rajalakshmi Nandakumar
Matthew S. Rockett
Lucy Simko
Common Coauthors
Commonly Cited References
Action Title Year Authors # of times referenced
+ PDF Chat SQuAD: 100,000+ Questions for Machine Comprehension of Text 2016 Pranav Rajpurkar
Jian Zhang
Konstantin Lopyrev
Percy Liang
8
+ RoBERTa: A Robustly Optimized BERT Pretraining Approach 2019 Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
Mike Lewis
Luke Zettlemoyer
Veselin Stoyanov
7
+ Adversarial Example Generation with Syntactically Controlled Paraphrase Networks 2018 Mohit Iyyer
John Wieting
Kevin Gimpel
Luke Zettlemoyer
6
+ Stress Test Evaluation for Natural Language Inference 2018 Aakanksha Naik
Abhilasha Ravichander
Norman Sadeh
Carolyn Penstein Rosé
Graham Neubig
6
+ PDF Chat Does the Whole Exceed its Parts? The Effect of AI Explanations on Complementary Team Performance 2021 Gagan Bansal
Tongshuang Wu
Joyce Zhou
Raymond Fok
Besmira Nushi
Ece Kamar
Marco TĂșlio Ribeiro
Daniel S. Weld
5
+ Evaluating Explainable AI: Which Algorithmic Explanations Help Users Predict Model Behavior? 2020 Peter Hase
Mohit Bansal
5
+ DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter 2019 Victor Sanh
Lysandre Debut
Julien Chaumond
Thomas Wolf
5
+ Language Models are Few-Shot Learners 2020 T. B. Brown
Benjamin F. Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
Prafulla Dhariwal
Arvind Neelakantan
Pranav Shyam
Girish Sastry
Amanda Askell
5
+ Probing What Different NLP Tasks Teach Machines about Function Word Comprehension 2019 Najoung Kim
Roma Patel
Adam Poliak
Patrick Xia
Alex Wang
Tom McCoy
Ian Tenney
Alexis Ross
Tal Linzen
Benjamin Van Durme
5
+ Explaining NLP Models via Minimal Contrastive Editing (MiCE) 2020 Alexis Ross
Ana Marasović
Matthew E. Peters
4
+ Explanation in artificial intelligence: Insights from the social sciences 2018 Tim Miller
4
+ Generate Your Counterfactuals: Towards Controlled Counterfactual Generation for Text. 2021 Nishtha Madaan
Inkit Padhi
Naveen Panwar
Diptikalyan Saha
4
+ The Goldilocks Principle: Reading Children's Books with Explicit Memory Representations 2015 Felix Hill
Antoine Bordes
Sumit Chopra
Jason Weston
4
+ Right for the Wrong Reasons: Diagnosing Syntactic Heuristics in Natural Language Inference 2019 Tom McCoy
Ellie Pavlick
Tal Linzen
4
+ A large annotated corpus for learning natural language inference 2015 Samuel R. Bowman
Gabor Angeli
Christopher Potts
Christopher D. Manning
4
+ Evaluating Models’ Local Decision Boundaries via Contrast Sets 2020 Matt Gardner
Yoav Artzi
Victoria Basmov
Jonathan Berant
Ben Bogin
Sihao Chen
Pradeep Dasigi
Dheeru Dua
Yanai Elazar
Ananth Gottumukkala
3
+ Attention Is All You Need 2017 Ashish Vaswani
Noam Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan N. Gomez
Ɓukasz Kaiser
Illia Polosukhin
3
+ Annotation Artifacts in Natural Language Inference Data 2018 Suchin Gururangan
Swabha Swayamdipta
Omer Levy
Roy Schwartz
Samuel R. Bowman
Noah A. Smith
3
+ MS MARCO: A Human Generated MAchine Reading COmprehension Dataset 2016 Payal Bajaj
Daniel Campos
Nick Craswell
Li Deng
Jianfeng Gao
Xiaodong Liu
Rangan Majumder
Andrew McNamara
Bhaskar Mitra
Tri Gia Nguyen
3
+ BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding 2018 Jacob Devlin
Ming‐Wei Chang
Kenton Lee
Kristina Toutanova
3
+ Unified Language Model Pre-training for Natural Language Understanding and Generation 2019 Li Dong
Nan Yang
Wenhui Wang
Furu Wei
Xiaodong Liu
Yu Wang
Jianfeng Gao
Ming Zhou
Hsiao-Wuen Hon
3
+ TextAttack: A Framework for Adversarial Attacks, Data Augmentation, and Adversarial Training in NLP 2020 John X. Morris
Eli Lifland
Jin Yong Yoo
Jake Grigsby
Di Jin
Yanjun Qi
3
+ PDF Chat Learning What Makes a Difference from Counterfactual Examples and Gradient Supervision 2020 Damien Teney
Ehsan Abbasnedjad
Anton van den Hengel
3
+ Beyond Accuracy: Behavioral Testing of NLP Models with CheckList 2020 Marco TĂșlio Ribeiro
Tongshuang Wu
Carlos Guestrin
Sameer Singh
3
+ BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension 2020 Mike Lewis
Yinhan Liu
Naman Goyal
Marjan Ghazvininejad
Abdelrahman Mohamed
Omer Levy
Veselin Stoyanov
Luke Zettlemoyer
3
+ Linguistically-Informed Transformations (LIT): A Method for Automatically Generating Contrast Sets 2020 Chuanrong Li
Lin Shengshuo
Zeyu Liu
Xinyi Wu
Xuhui Zhou
Shane Steinert‐Threlkeld
3
+ Plug and Play Language Models: A Simple Approach to Controlled Text Generation 2019 Sumanth Dathathri
Andrea Madotto
Janice Lan
Jane Hung
Eric Frank
Piero Molino
Jason Yosinski
Rosanne Liu
3
+ CTRL: A Conditional Transformer Language Model for Controllable Generation 2019 Nitish Shirish Keskar
Bryan McCann
Lav R. Varshney
Caiming Xiong
Richard Socher
3
+ PDF Chat "Why is 'Chicago' deceptive?" Towards Building Model-Driven Tutorials for Humans 2020 Vivian Lai
Han Liu
Chenhao Tan
3
+ A Joint Model for Question Answering and Question Generation 2017 Tong Wang
Xingdi Yuan
Adam Trischler
3
+ Are We Modeling the Task or the Annotator? An Investigation of Annotator Bias in Natural Language Understanding Datasets 2019 Mor Geva
Yoav Goldberg
Jonathan Berant
3
+ PDF Chat The NarrativeQA Reading Comprehension Challenge 2018 TomĂĄĆĄ KočiskĂœ
Jonathan Schwarz
Phil Blunsom
Chris Dyer
Karl Moritz Hermann
GĂĄbor Melis
Edward Grefenstette
3
+ Question Answering and Question Generation as Dual Tasks 2017 Duyu Tang
Nan Duan
Tao Qin
Zhao Yan
Ming Zhou
3
+ PDF Chat ParaNMT-50M: Pushing the Limits of Paraphrastic Sentence Embeddings with Millions of Machine Translations 2018 John Wieting
Kevin Gimpel
3
+ Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks 2019 Nils Reimers
Iryna Gurevych
3
+ Learning the Difference that Makes a Difference with Counterfactually-Augmented Data 2019 Divyansh Kaushik
Eduard Hovy
Zachary C. Lipton
3
+ Universal Adversarial Triggers for Attacking and Analyzing NLP 2019 Eric Wallace
Shi Feng
Nikhil Kandpal
Matt Gardner
Sameer Singh
3
+ PDF Chat WinoGrande: An Adversarial Winograd Schema Challenge at Scale 2020 Keisuke Sakaguchi
Ronan Le Bras
Chandra Bhagavatula
Yejin Choi
3
+ Politeness Transfer: A Tag and Generate Approach 2020 Aman Madaan
Amrith Setlur
Tanmay Parekh
BarnabĂĄs PĂłczos
Graham Neubig
Yiming Yang
Ruslan Salakhutdinov
Alan W. Black
Shrimai Prabhumoye
3
+ Training language models to follow instructions with human feedback 2022 Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
Pamela Mishkin
Chong Zhang
Sandhini Agarwal
Katarina Slama
Alex Ray
3
+ Correlation-based Intrinsic Evaluation of Word Vector Representations 2016 Yulia Tsvetkov
Manaal Faruqui
Chris Dyer
2
+ Natural Language Generation enhances human decision-making with uncertain information 2016 Dimitra Gkatzia
Oliver Lemon
Verena Rieser
2
+ PDF Chat Incentivizing High Quality Crowdwork 2015 Chien-Ju Ho
Aleksandrs Slivkins
Siddharth Suri
Jennifer Wortman Vaughan
2
+ Know What You Don’t Know: Unanswerable Questions for SQuAD 2018 Pranav Rajpurkar
Robin Jia
Percy Liang
2
+ PDF Chat Analyzing Compositionality-Sensitivity of NLI Models 2019 Yixin Nie
Yicheng Wang
Mohit Bansal
2
+ Counterfactual Fairness in Text Classification through Robustness 2019 Sahaj Garg
Vincent Perot
Nicole Limtiaco
Ankur Taly
Ed H.
Alex Beutel
2
+ PDF Chat The mythos of model interpretability 2018 Zachary C. Lipton
2
+ A Unified Approach to Interpreting Model Predictions 2017 Scott Lundberg
Su‐In Lee
2
+ PDF Chat Learning to Ask: Neural Question Generation for Reading Comprehension 2017 Xinya Du
Junru Shao
Claire Cardie
2
+ BERT Rediscovers the Classical NLP Pipeline. 2019 Ian Tenney
Dipanjan Das
Ellie Pavlick
2