+
PDF
Chat
|
Improving Automated Feedback Systems for Tutor Training in Low-Resource
Scenarios through Data Augmentation
|
2025
|
Chentianye Xu
Jionghao Lin
Tongshuang Wu
Vincent Aleven
Kenneth R. Koedinger
|
+
PDF
Chat
|
Tool Learning with Foundation Models
|
2024
|
Yujia Qin
Shengding Hu
Yankai Lin
Weize Chen
Ning Ding
Ganqu Cui
Zheni Zeng
Xuanhe Zhou
Y.-Y. Huang
Chaojun Xiao
|
+
PDF
Chat
|
Orbit: A Framework for Designing and Evaluating Multi-objective Rankers
|
2024
|
Chenyang Yang
Tesi Xiao
Michael Shavlovsky
Christian KĂ€stner
Tongshuang Wu
|
+
PDF
Chat
|
HiMemFormer: Hierarchical Memory-Aware Transformer for Multi-Agent
Action Anticipation
|
2024
|
Zirui Wang
Xinlei Zhao
Simon Stepputtis
Woojun Kim
Tongshuang Wu
Katia Sycara
Yaqi Xie
|
+
PDF
Chat
|
What You Say = What You Want? Teaching Humans to Articulate Requirements
for LLMs
|
2024
|
Qianou Ma
Weirui Peng
Hua Shen
Kenneth R. Koedinger
Tongshuang Wu
|
+
PDF
Chat
|
What Is Wrong with My Model? Identifying Systematic Problems with
Semantic Data Slicing
|
2024
|
Chenyang Yang
Yao Hong
Grace A. Lewis
Tongshuang Wu
Christian KĂ€stner
|
+
|
A large-scale audit of dataset licensing and attribution in AI
|
2024
|
Shayne Longpre
Robert Mahari
Anthony Chen
Naana Obeng-Marnu
Damien Sileo
William Brannon
Niklas Muennighoff
Nathan Khazam
Jad Kabbara
Kartik Perisetla
|
+
PDF
Chat
|
SELF-GUIDE: Better Task-Specific Instruction Following via
Self-Synthetic Finetuning
|
2024
|
Chenyang Zhao
Xueying Jia
Vijay Viswanathan
Tongshuang Wu
Graham Neubig
|
+
PDF
Chat
|
Synthetic Multimodal Question Generation
|
2024
|
Ian Wu
Sravan Jayanthi
Vijay Viswanathan
Simon Rosenberg
Sina Pakazad
Tongshuang Wu
Graham Neubig
|
+
PDF
Chat
|
WebCanvas: Benchmarking Web Agents in Online Environments
|
2024
|
Yichen Pan
Dehan Kong
Sida Zhou
Cheng Cui
Yifei Leng
Bing Jiang
Hangyu Liu
Yanyi Shang
Shuyan Zhou
Tongshuang Wu
|
+
|
Wikibench: Community-Driven Data Curation for AI Evaluation on Wikipedia
|
2024
|
Tzu-Sheng Kuo
Aaron Halfaker
Zirui Cheng
Jiwoo Kim
Meng-Hsin Wu
Tongshuang Wu
Kenneth Holstein
Haiyi Zhu
|
+
|
Selenite: Scaffolding Online Sensemaking with Comprehensive Overviews Elicited from Large Language Models
|
2024
|
Michael Xieyang Liu
Tongshuang Wu
Tianying Chen
Franklin Mingzhe Li
Aniket Kittur
Brad A. Myers
|
+
PDF
Chat
|
Beyond Relevance: Evaluate and Improve Retrievers on Perspective
Awareness
|
2024
|
Xinran Zhao
Tong Chen
Sihao Chen
Hongming Zhang
Tongshuang Wu
|
+
PDF
Chat
|
Better Synthetic Data by Retrieving and Transforming Existing Datasets
|
2024
|
Saumya Gandhi
Ritu Gala
Vijay Viswanathan
Tongshuang Wu
Graham Neubig
|
+
PDF
Chat
|
Evaluating Mathematical Reasoning Beyond Accuracy
|
2024
|
Shijie Xia
Xuefeng Li
Yixin Liu
Tongshuang Wu
Pengfei Liu
|
+
PDF
Chat
|
Fact-and-Reflection (FaR) Improves Confidence Calibration of Large
Language Models
|
2024
|
Xinran Zhao
Hongming Zhang
Xiaoman Pan
Wenlin Yao
Dong Yu
Tongshuang Wu
Jianshu Chen
|
+
PDF
Chat
|
Large Language Models Enable Few-Shot Clustering
|
2024
|
Vijay Viswanathan
Kiril Gashteovski
Kiril Gashteovski
Carolin Lawrence
Tongshuang Wu
Graham Neubig
|
+
|
Do LLMs Exhibit Human-like Response Biases? A Case Study in Survey Design
|
2024
|
Lindia Tjuatja
Valerie Chen
Tongshuang Wu
Ameet Talwalkwar
Graham Neubig
|
+
|
Synergi: A Mixed-Initiative System for Scholarly Synthesis and Sensemaking
|
2023
|
Hyeonsu B Kang
Tongshuang Wu
Joseph Chee Chang
Aniket Kittur
|
+
PDF
Chat
|
ConvXAI : Delivering Heterogeneous AI Explanations via Conversations to Support Human-AI Scientific Writing
|
2023
|
Hua Shen
Chieh-Yang Huang
Tongshuang Wu
Ting-Hao Huang
|
+
|
Parachute: Evaluating Interactive Human-LM Co-writing Systems
|
2023
|
Hua Shen
Tongshuang Wu
|
+
|
Tool Learning with Foundation Models
|
2023
|
Yujia Qin
Shengding Hu
Yankai Lin
Weize Chen
Ning Ding
Ganqu Cui
Zheni Zeng
Yufei Huang
Chaojun Xiao
Chi Han
|
+
|
Bridging the Gap: A Survey on Integrating (Human) Feedback for Natural Language Generation
|
2023
|
Patrick Fernandes
Aman Madaan
Emmy Liu
AntĂłnio Farinhas
Pedro Henrique Martins
Amanda Bertsch
José G. C. de Souza
Shuyan Zhou
Tongshuang Wu
Graham Neubig
|
+
|
ConvXAI: Delivering Heterogeneous AI Explanations via Conversations to Support Human-AI Scientific Writing
|
2023
|
Hua Shen
Chieh-Yang Huang
Tongshuang Wu
Ting-Hao Huang
|
+
|
BiasX: "Thinking Slow" in Toxic Content Moderation with Explanations of Implied Social Biases
|
2023
|
Yiming Zhang
Sravani Nanduri
Liwei Jiang
Tongshuang Wu
Maarten Sap
|
+
|
DataFinder: Scientific Dataset Recommendation from Natural Language Descriptions
|
2023
|
Vijay Viswanathan
Luyu Gao
Tongshuang Wu
Pengfei Liu
Graham Neubig
|
+
|
Seeing Seeds Beyond Weeds: Green Teaming Generative AI for Beneficial Uses
|
2023
|
Logan Stapleton
Jordan Taylor
Sarah Jane Fox
Tongshuang Wu
Haiyi Zhu
|
+
|
Is AI the better programming partner? Human-Human Pair Programming vs. Human-AI pAIr Programming
|
2023
|
Qianou
Ma -
Tongshuang Wu
Kenneth R. Koedinger
|
+
|
Large Language Models Enable Few-Shot Clustering
|
2023
|
Vijay Viswanathan
Kiril Gashteovski
Carolin Lawrence
Tongshuang Wu
Graham Neubig
|
+
|
LLMs as Workers in Human-Computational Algorithms? Replicating Crowdsourcing Pipelines with LLMs
|
2023
|
Tongshuang Wu
Haiyi Zhu
Maya Albayrak
Alexis Axon
Amanda Bertsch
Wenxing Deng
Ziqi Ding
Bill Guo
Sireesh Gururaja
Tzu-Sheng Kuo
|
+
|
DataFinder: Scientific Dataset Recommendation from Natural Language Descriptions
|
2023
|
Vijay Viswanathan
Luyu Gao
Tongshuang Wu
Pengfei Liu
Graham Neubig
|
+
|
Prompt2Model: Generating Deployable Models from Natural Language Instructions
|
2023
|
Vijay Viswanathan
Chenyang Zhao
Amanda Bertsch
Tongshuang Wu
Graham Neubig
|
+
|
Selenite: Scaffolding Online Sensemaking with Comprehensive Overviews Elicited from Large Language Models
|
2023
|
Michael Xieyang Liu
Tongshuang Wu
Tianying Chen
Franklin Mingzhe Li
Aniket Kittur
Brad A. Myers
|
+
|
HypoCompass: Large-Language-Model-based Tutor for Hypothesis Construction in Debugging for Novices
|
2023
|
Qianou Ma
Hua Shen
Kenneth R. Koedinger
Tongshuang Wu
|
+
|
From Nuisance to News Sense: Augmenting the News with Cross-Document Evidence and Context
|
2023
|
Jeremiah Milbauer
Ziqi Ding
Zhijin Wu
Tongshuang Wu
|
+
|
Beyond Testers' Biases: Guiding Model Testing with Knowledge Bases using LLMs
|
2023
|
Chenyang Yang
Rishabh Rustogi
Rachel BrowerâSinning
Grace A. Lewis
Christian KĂ€stner
Tongshuang Wu
|
+
|
The Data Provenance Initiative: A Large Scale Audit of Dataset Licensing & Attribution in AI
|
2023
|
Shayne Longpre
Robert Mahari
Anthony Chen
Naana Obeng-Marnu
Damien Sileo
William Brannon
Niklas Muennighoff
Nathan Khazam
Jad Kabbara
Kartik Perisetla
|
+
|
Measuring Adversarial Datasets
|
2023
|
Yuanchen Bai
Raoyi Huang
Vijay Viswanathan
Tzu-Sheng Kuo
Tongshuang Wu
|
+
PDF
Chat
|
BiasX: âThinking Slowâ in Toxic Content Moderation with Explanations of Implied Social Biases
|
2023
|
Yiming Zhang
Sravani Nanduri
Liwei Jiang
Tongshuang Wu
Maarten Sap
|
+
|
Beyond Testersâ Biases: Guiding Model Testing with Knowledge Bases using LLMs
|
2023
|
Chenyang Yang
Rishabh Rustogi
Rachel BrowerâSinning
Grace A. Lewis
Christian Kaestner
Tongshuang Wu
|
+
|
Prompt2Model: Generating Deployable Models from Natural Language Instructions
|
2023
|
Vijay Viswanathan
Chenyang Zhao
Amanda Bertsch
Tongshuang Wu
Graham Neubig
|
+
PDF
Chat
|
Bridging the Gap: A Survey on Integrating (Human) Feedback for Natural Language Generation
|
2023
|
Patrick Fernandes
Aman Madaan
Emmy Liu
AntĂłnio Farinhas
Pedro Henrique Martins
Amanda Bertsch
José G. C. de Souza
Shuyan Zhou
Tongshuang Wu
Graham Neubig
|
+
|
Measuring Adversarial Datasets
|
2023
|
Yuanchen Bai
Raoyi Huang
Vijay Viswanathan
Tzu-Sheng Kuo
Tongshuang Wu
|
+
PDF
Chat
|
AI Chains: Transparent and Controllable Human-AI Interaction by Chaining Large Language Model Prompts
|
2022
|
Tongshuang Wu
Michael Terry
Carrie J. Cai
|
+
PDF
Chat
|
StoryBuddy: A Human-AI Collaborative Chatbot for Parent-Child Interactive Storytelling with Flexible Parental Involvement
|
2022
|
Zheng Zhang
Ying Xu
Yanhao Wang
Bingsheng Yao
Daniel Ritchie
Tongshuang Wu
Mo Yu
Dakuo Wang
Toby Jia-Jun Li
|
+
PDF
Chat
|
Pretty Princess vs. Successful Leader: Gender Roles in Greeting Card Messages
|
2022
|
Jiao Sun
Tongshuang Wu
Yue Jiang
Ronil Awalegaonkar
Xi Lin
Diyi Yang
|
+
PDF
Chat
|
PromptChainer: Chaining Large Language Model Prompts through Visual Programming
|
2022
|
Tongshuang Wu
Ellen Jiang
Aaron Donsbach
Jeff Gray
Alejandra Molina
Michael Terry
Carrie J. Cai
|
+
PDF
Chat
|
Tailor: Generating and Perturbing Text with Semantic Controls
|
2022
|
Alexis Ross
Tongshuang Wu
Hao Peng
Matthew N. Peters
Matt Gardner
|
+
|
Are Shortest Rationales the Best Explanations for Human Understanding?
|
2022
|
Hua Shen
Tongshuang Wu
Wenbo Guo
Ting-Hao Huang
|
+
PDF
Chat
|
Fantastic Questions and Where to Find Them: FairytaleQA â An Authentic Dataset for Narrative Comprehension
|
2022
|
Ying Xu
Dakuo Wang
Mo Yu
Daniel Ritchie
Bingsheng Yao
Tongshuang Wu
Zheng Zhang
Toby Jia-Jun Li
Nora Bradford
Branda Sun
|
+
|
PromptChainer: Chaining Large Language Model Prompts through Visual Programming
|
2022
|
Tongshuang Wu
Ellen Jiang
Aaron Donsbach
Jeff Gray
Alejandra Molina
Michael Terry
Carrie J. Cai
|
+
|
Are Shortest Rationales the Best Explanations for Human Understanding?
|
2022
|
Hua Shen
Tongshuang Wu
Wenbo Guo
Ting-Hao Huang
|
+
PDF
Chat
|
It is AIâs Turn to Ask Humans a Question: Question-Answer Pair Generation for Childrenâs Story Books
|
2022
|
Bingsheng Yao
Dakuo Wang
Tongshuang Wu
Zheng Zhang
Toby Li
Mo Yu
Ying Xu
|
+
|
Towards Natural Language-Based Visualization Authoring
|
2022
|
Yun Wang
Zhitao Hou
Leixian Shen
Tongshuang Wu
Jiaqi Wang
He Huang
Haidong Zhang
Dongmei Zhang
|
+
|
Fantastic Questions and Where to Find Them: FairytaleQA -- An Authentic Dataset for Narrative Comprehension
|
2022
|
Ying Xu
Dakuo Wang
Mo Yu
Daniel Ritchie
Bingsheng Yao
Tongshuang Wu
Zheng Zhang
Toby Jia-Jun Li
Nora Bradford
Branda Sun
|
+
PDF
Chat
|
Towards Natural Language-Based Visualization Authoring
|
2022
|
Yun Wang
Zhitao Hou
Leixian Shen
Tongshuang Wu
Jiaqi Wang
He Huang
Haidong Zhang
Dongmei Zhang
|
+
|
Capabilities for Better ML Engineering
|
2022
|
Chenyang Yang
Rachel BrowerâSinning
Grace A. Lewis
Christian KĂ€stner
Tongshuang Wu
|
+
|
Decisions that Explain Themselves: A User-Centric Deep Reinforcement Learning Explanation System
|
2022
|
Xiaoran Wu
Zihan Yan
Chongjie Zhang
Tongshuang Wu
|
+
PDF
Chat
|
NL-Augmenter: A Framework for Task-Sensitive Natural Language
Augmentation
|
2021
|
Kaustubh Dhole
Varun Gangal
Sebastian Gehrmann
Aadesh Gupta
Zhenhao Li
Saad Mahamood
Abinaya Mahendiran
Simon Mille
Ashish Srivastava
Samson Tan
|
+
|
It is AI's Turn to Ask Human a Question: Question and Answer Pair Generation for Children Storybooks in FairytaleQA Dataset.
|
2021
|
Bingsheng Yao
Dakuo Wang
Tongshuang Wu
Tran Manh Hoang
Branda Sun
Toby Jia-Jun Li
Mo Yu
Ying Xu
|
+
PDF
Chat
|
DeHumor: Visual Analytics for Decomposing Humor
|
2021
|
Xingbo Wang
Ming Yao
Tongshuang Wu
Haipeng Zeng
Yong Wang
Huamin Qu
|
+
PDF
Chat
|
Does the Whole Exceed its Parts? The Effect of AI Explanations on Complementary Team Performance
|
2021
|
Gagan Bansal
Tongshuang Wu
Joyce Zhou
Raymond Fok
Besmira Nushi
Ece Kamar
Marco TĂșlio Ribeiro
Daniel S. Weld
|
+
|
Polyjuice: Automated, General-purpose Counterfactual Generation.
|
2021
|
Tongshuang Wu
Marco TĂșlio Ribeiro
Jeffrey Heer
Daniel S. Weld
|
+
|
Polyjuice: Generating Counterfactuals for Explaining, Evaluating, and Improving Models
|
2021
|
Tongshuang Wu
Marco TĂșlio Ribeiro
Jeffrey Heer
Daniel S. Weld
|
+
|
Polyjuice: Generating Counterfactuals for Explaining, Evaluating, and Improving Models
|
2021
|
Tongshuang Wu
Marco TĂșlio Ribeiro
Jeffrey Heer
Daniel S. Weld
|
+
|
NL-Augmenter: A Framework for Task-Sensitive Natural Language Augmentation
|
2021
|
Kaustubh Dhole
Varun Gangal
Sebastian Gehrmann
Aadesh Gupta
Zhenhao Li
Saad Mahamood
Abinaya Mahendiran
Simon Mille
Ashish Shrivastava
Samson Tan
|
+
|
AI Chains: Transparent and Controllable Human-AI Interaction by Chaining Large Language Model Prompts
|
2021
|
Tongshuang Wu
Michael Terry
Carrie J. Cai
|
+
|
Tailor: Generating and Perturbing Text with Semantic Controls
|
2021
|
Alexis Ross
Tongshuang Wu
Hao Peng
Matthew E. Peters
Matt Gardner
|
+
|
It is AI's Turn to Ask Humans a Question: Question-Answer Pair Generation for Children's Story Books
|
2021
|
Bingsheng Yao
Dakuo Wang
Tongshuang Wu
Zheng Zhang
Toby Jia-Jun Li
Mo Yu
Ying Xu
|
+
|
Pretty Princess vs. Successful Leader: Gender Roles in Greeting Card Messages
|
2021
|
Jiao Sun
Tongshuang Wu
Yue Jiang
Ronil Awalegaonkar
Xi Victoria Lin
Diyi Yang
|
+
|
Does the Whole Exceed its Parts? The Effect of AI Explanations on Complementary Team Performance
|
2020
|
Gagan Bansal
Tongshuang Wu
Joyce Zhou
Raymond Fok
Besmira Nushi
Ece Kamar
Marco TĂșlio Ribeiro
Daniel S. Weld
|
+
|
Beyond Accuracy: Behavioral Testing of NLP models with CheckList
|
2020
|
Marco TĂșlio Ribeiro
Tongshuang Wu
Carlos Guestrin
Sameer Singh
|
+
|
Beyond Accuracy: Behavioral Testing of NLP Models with CheckList
|
2020
|
Marco TĂșlio Ribeiro
Tongshuang Wu
Carlos Guestrin
Sameer Singh
|
+
|
Does the Whole Exceed its Parts? The Effect of AI Explanations on Complementary Team Performance
|
2020
|
Gagan Bansal
Tongshuang Wu
Joyce Zhou
Raymond Fok
Besmira Nushi
Ece Kamar
Marco TĂșlio Ribeiro
Daniel S. Weld
|
+
|
Beyond Accuracy: Behavioral Testing of NLP models with CheckList
|
2020
|
Marco TĂșlio Ribeiro
Tongshuang Wu
Carlos Guestrin
Sameer Singh
|
+
|
Technology-Enabled Disinformation: Summary, Lessons, and Recommendations
|
2018
|
John Garland Akers
Gagan Bansal
Gabriel Cadamuro
Christine Chen
Quanze Chen
Lucy Lin
Phoebe Mulcaire
Rajalakshmi Nandakumar
Matthew S. Rockett
Lucy Simko
|