Yi-Hao Peng

Generating author description...

All published works

Action	Title	Year	Authors
+ PDF Chat	AutoPresent: Designing Structured Visuals from Scratch	2025	Jiaxin Ge Zora Zhiruo Wang Xuhui Zhou Yi-Hao Peng Sanjay Subramanian Qinyue Tan Maarten Sap Alane Suhr Daniel Fried Graham Neubig
+ PDF Chat	UIClip: A Data-driven Model for Assessing User Interface Design	2024	Jason Wu Yi-Hao Peng Amanda Li Amanda Swearngin Jeffrey P. Bigham Jeffrey Nichols
+ PDF Chat	Long-Form Answers to Visual Questions from Blind and Low Vision People	2024	Mina Huh Fangyuan Xu Yi-Hao Peng Chongyan Chen Hansika Murugu Danna Gurari Eunsol Soul Choi Amy Pavel
+ PDF Chat	"This really lets us see the entire world:" Designing a conversational telepresence robot for homebound older adults	2024	Yaxin Hu Laura Stegner Yasmine Kotturi Caroline Zhang Yi-Hao Peng Faria Huq Yuhang Zhao Jeffrey P. Bigham Bilge Mutlu
+ PDF Chat	Towards Bidirectional Human-AI Alignment: A Systematic Review for Clarifications, Framework, and Future Directions	2024	Hua Shen Tiffany Knearem Reshmi Ghosh Kenan Alkiek K. Siva Krishna Yachuan Liu Ziqiao Ma Savvas Petridis Yi-Hao Peng Li Qiwei
+ PDF Chat	UIClip: A Data-driven Model for Assessing User Interface Design	2024	Jason Wu Yi-Hao Peng Amanda Li Amanda Swearngin Jeffrey P. Bigham Jeffrey Nichols
+	GenAssist: Making Image Generation Accessible	2023	Mina Huh Yi-Hao Peng Amy Pavel
+	AVscript: Accessible Video Editing with Audio-Visual Scripts	2023	Mina Huh Saelyne Yang Yi-Hao Peng Xiang Chen Young‐Ho Kim Amy Pavel
+	WebUI: A Dataset for Enhancing Visual UI Understanding with Web Semantics	2023	Jason Wu Siyan Wang Siman Shen Yi-Hao Peng Jeffrey Nichols Jeffrey P. Bigham
+	GenAssist: Making Image Generation Accessible	2023	Mina Huh Yi-Hao Peng Amy Pavel
+	Say It All: Feedback for Improving Non-Visual Presentation Accessibility	2021	Yi-Hao Peng JiWoong Jang Jeffrey P. Bigham Amy Pavel
+	Say It All: Feedback for Improving Non-Visual Presentation Accessibility	2021	Yi-Hao Peng JiWoong Jang Jeffrey P. Bigham Amy Pavel

Common Coauthors

Coauthor	Papers Together
Jeffrey P. Bigham	7
Amy Pavel	6
Mina Huh	4
Jason Wu	3
Jeffrey Nichols	3
JiWoong Jang	2
Amanda Li	2
Sanjay Subramanian	1
Yasmine Kotturi	1
Xiang Chen	1
Saelyne Yang	1
Trevor Darrell	1
Zachary C. Lipton	1
Siman Shen	1
Faria Huq	1
Frank Bentley	1
Jiaxin Ge	1
Zora Zhiruo Wang	1
Savvas Petridis	1
David Jurgens	1
Qiaozhu Mei	1
Paul Resnick	1
Yuhang Zhao	1
Caroline Zhang	1
Meredith Ringel Morris	1
Young‐Ho Kim	1
Yaxin Hu	1
Graham Neubig	1
Michael Terry	1
Danna Gurari	1
Rada Mihalcea	1
Fangyuan Xu	1
Amanda Swearngin	1
Siyan Wang	1
Diyi Yang	1
Tiffany Knearem	1
Amanda Swearngin	1
Hua Shen	1
Li Qiwei	1
Sushrita Rakshit	1
Chenglei Si	1
Yutong Xie	1
Joyce Chai	1
Reshmi Ghosh	1
Xuhui Zhou	1
Daniel Fried	1
K. Siva Krishna	1
Yachuan Liu	1
Hansika Murugu	1
Ziqiao Ma	1

Commonly Cited References

Action	Title	Year	Authors	# of times referenced
+	Say It All: Feedback for Improving Non-Visual Presentation Accessibility	2021	Yi-Hao Peng JiWoong Jang Jeffrey P. Bigham Amy Pavel	2
+ PDF Chat	Mudslide	2015	Elena L. Glassman Juho Kim Andrés Monroy‐Hernández Meredith Ringel Morris	2
+ PDF Chat	Modeling Mobile Interface Tappability Using Crowdsourcing and Deep Learning	2019	Amanda Swearngin Yang Li	2
+ PDF Chat	Detecting Twenty-Thousand Classes Using Image-Level Supervision	2022	Xingyi Zhou Rohit Girdhar Armand Joulin Philipp Krähenbühl Ishan Misra	2
+	Screen2Words: Automatic Mobile UI Summarization with Multimodal Learning	2021	Bryan Wang Gang Li Xin Zhou Zhourong Chen Tovi Grossman Yang Li	2
+ PDF Chat	DenseCap: Fully Convolutional Localization Networks for Dense Captioning	2016	Justin Johnson Andrej Karpathy Li Fei-Fei	2
+ PDF Chat	VoiceCoach: Interactive Evidence-based Training for Voice Modulation Skills in Public Speaking	2020	Xingbo Wang Haipeng Zeng Yong Wang Aoyu Wu Zhida Sun Xiaojuan Ma Huamin Qu	2
+ PDF Chat	VINS: Visual Search for Mobile User Interface Design	2021	Sara Bunian Kai Li Chaima Jemmali Casper Harteveld Yun Fu Magy Seif El‐Nasr	2
+ PDF Chat	Learning to Describe Differences Between Pairs of Similar Images	2018	Harsh Jhamtani Taylor Berg-Kirkpatrick	1
+ PDF Chat	FCOS: Fully Convolutional One-Stage Object Detection	2019	Zhi Tian Chunhua Shen Hao Chen Tong He	1
+ PDF Chat	Humanoid: A Deep Learning-Based Approach to Automated Black-box Android App Testing	2019	Yuanchun Li Ziyue Yang Yao Guo Xiangqun Chen	1
+	Interweaving Multimodal Interaction With Flexible Unit Visualizations for Data Exploration	2020	Arjun Srinivasan Bongshin Lee John Stasko	1
+ PDF Chat	InChorus: Designing Consistent Multimodal Interactions for Data Visualization on Tablet Devices	2020	Arjun Srinivasan Bongshin Lee Nathalie Henry Riche Steven M. Drucker Ken Hinckley	1
+ PDF Chat	GUIComp: A GUI Design Assistant with Real-Time, Multi-Faceted Feedback	2020	Chunggi Lee Sang-Hoon Kim Dongyun Han Hongjun Yang Young‐Woo Park Bum Chul Kwon Sungahn Ko	1
+ PDF Chat	Self-Training With Noisy Student Improves ImageNet Classification	2020	Qizhe Xie Minh-Thang Luong Eduard Hovy Quoc V. Le	1
+ PDF Chat	Generative adversarial networks	2020	Ian Goodfellow Jean Pouget-Abadie Mehdi Mirza Bing Xu David Warde-Farley Sherjil Ozair Aaron Courville Yoshua Bengio	1
+ PDF Chat	Scout: Rapid Exploration of Interface Layout Alternatives through High-Level Design Constraints	2020	Amanda Swearngin Chenglong Wang Alannah Oleson James Fogarty Amy J. Ko	1
+	SSD: Single Shot MultiBox Detector	2016	Wei Liu Dragomir Anguelov Dumitru Erhan Christian Szegedy Scott Reed Cheng-Yang Fu Alexander C. Berg	1
+ PDF Chat	ActionBert: Leveraging User Actions for Semantic Understanding of User Interfaces	2021	Zecheng He Srinivas Sunkara Xiaoxue Zang Ying Xu Lijuan Liu Nevan Wichers Gabriel Schubiner Ruby Lee Jindong Chen	1
+ PDF Chat	It Takes Two to Tango: Combining Visual and Textual Information for Detecting Duplicate Video-Based Bug Reports	2021	Nathan Cooper Carlos Bernal-Cárdenas Oscar Chaparro Kevin Moran Denys Poshyvanyk	1
+ PDF Chat	Data@Hand: Fostering Visual Exploration of Personal Data on Smartphones Leveraging Speech and Touch Interaction	2021	Young‐Ho Kim Bongshin Lee Arjun Srinivasan Eun Kyoung Choe	1
+	Screen Recognition: Creating Accessibility Metadata for Mobile Applications from Pixels	2021	Xiaoyi Zhang Lilian de Greef Amanda Swearngin Samuel White Kyle I. Murray Lisa Yu Qi Shan Jeffrey Nichols Jason Wu Chris Fleizach	1
+	Revamp: Enhancing Accessible Information Seeking Experience of Online Shopping for Blind or Low Vision Users	2021	Ruolin Wang Zixuan Chen Mingrui Ray Zhang Zhaoheng Li Zhixiu Liu Zihan Dang Chun Yu Xiang Chen	1
+ PDF Chat	Scaling Vision Transformers	2022	Xiaohua Zhai Alexander Kolesnikov Neil Houlsby Lucas Beyer	1
+ PDF Chat	Design Guidelines for Prompt Engineering Text-to-Image Generative Models	2022	Vivian Liu Lydia B. Chilton	1
+ PDF Chat	Let there be a clock on the beach: Reducing Object Hallucination in Image Captioning	2022	Ali Furkan Biten Lluís Gómez Dìmosthenis Karatzas	1
+	Screen Parsing: Towards Reverse Engineering of UI Models from Screenshots	2021	Jason Wu Xiaoyi Zhang Jeffrey A. Nichols Jeffrey P. Bigham	1
+	Non-Visual Cooking: Exploring Practices and Challenges of Meal Preparation by People with Visual Impairments	2021	Franklin Mingzhe Li Jamie Dorst Peter Cederberg Patrick Carrington	1
+ PDF Chat	“It Feels Like Taking a Gamble”: Exploring Perceptions, Practices, and Challenges of Using Makeup and Cosmetics for People with Visual Impairments	2022	Franklin Mingzhe Li Franchesca Spektor Meng Xia Mina Huh Peter Cederberg Yuqi Gong Kristen Shinohara Patrick Carrington	1
+	Hierarchical Text-Conditional Image Generation with CLIP Latents	2022	Aditya Ramesh Prafulla Dhariwal Alex Nichol Casey Chu Mark Chen	1
+	PaLM: Scaling Language Modeling with Pathways	2022	Aakanksha Chowdhery Sharan Narang Jacob Devlin Maarten Bosma Gaurav Mishra Adam Roberts Paul Barham Hyung Won Chung Charles Sutton Sebastian Gehrmann	1
+ PDF Chat	Predicting and Explaining Mobile UI Tappability with Vision Modeling and Saliency Analysis	2022	Eldon Schoop Xin Zhou Gang Li Zhourong Chen Bjoern Hartmann Yang Li	1
+ PDF Chat	Learning to Denoise Raw Mobile UI Layouts for Improving Datasets at Scale	2022	Gang Li Gilles Baechler Manuel Tragut Yang Li	1
+ PDF Chat	Friendscope: Exploring In-the-Moment Experience Sharing on Camera Glasses via a Shared Camera	2022	Molly Jane Nicholas Brian A. Smith Rajan Vaish	1
+ PDF Chat	Artificial Intelligence for Human Computer Interaction: A Modern Approach	2021	Forrest Huang Eldon Schoop David Ha Jeffrey Nichols John Canny	1
+	GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models	2021	Alex Nichol Prafulla Dhariwal Aditya Ramesh Pranav Shyam Pamela Mishkin Bob McGrew Ilya Sutskever Mark Chen	1
+	BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation	2022	Junnan Li Dongxu Li Caiming Xiong Steven C. H. Hoi	1
+ PDF Chat	The Unboxing Experience: Exploration and Design of Initial Interactions Between Children and Social Robots	2022	Christine P. Lee Bengisu Çağıltay Bilge Mutlu	1
+	Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding	2022	Chitwan Saharia William Chan Saurabh Saxena Lala Li Jay Whang Emily Denton Seyed Kamyar Seyed Ghasemipour Burcu Karagol Ayan S. Sara Mahdavi Rapha Gontijo Lopes	1
+	CogVideo: Large-scale Pretraining for Text-to-Video Generation via Transformers	2022	Wenyi Hong Ming Ding Wendi Zheng Xinghan Liu Jie Tang	1
+ PDF Chat	Translating Video Recordings of Complex Mobile App UI Gestures into Replayable Scenarios	2022	Carlos Bernal-Cárdenas Nathan Cooper Madeleine Havranek Kevin Moran Oscar Chaparro Denys Poshyvanyk Andrian Marcus	1
+	Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer	2019	Colin Raffel Noam Shazeer Adam Roberts Katherine Lee Sharan Narang Michael Matena Yanqi Zhou Wei Li Peter J. Liu	1
+	Language Models are Few-Shot Learners	2020	T. B. Brown Benjamin F. Mann Nick Ryder Melanie Subbiah Jared Kaplan Prafulla Dhariwal Arvind Neelakantan Pranav Shyam Girish Sastry Amanda Askell	1
+ PDF Chat	You Watch, You Give, and You Engage	2018	Zhicong Lu Haijun Xia Seongkook Heo Daniel Wigdor	1
+ PDF Chat	Opal: Multimodal Image Generation for News Illustration	2022	Vivian Liu Han Qiao Lydia B. Chilton	1
+ PDF Chat	"I was Confused by It; It was Confused by Me:" Exploring the Experiences of People with Visual Impairments around Mobile Service Robots	2022	Prajna Bhat Yuhang Zhao	1
+	BLOOM: A 176B-Parameter Open-Access Multilingual Language Model	2022	Teven Le Scao Angela Fan Christopher Akiki Ellie Pavlick Suzana Ilić Daniel Hesslow Roman Castagné Alexandra Sasha Luccioni François Yvon Matthias Gallé	1
+ PDF Chat	Text2LIVE: Text-Driven Layered Image and Video Editing	2022	Omer Bar-Tal Dolev Ofri-Amar Rafail Fridman Yoni Kasten Tali Dekel	1
+ PDF Chat	High-Resolution Image Synthesis with Latent Diffusion Models	2022	Robin Rombach Andreas Blattmann Dominik Lorenz Patrick Esser Björn Ommer	1
+	Large-scale Text-to-Image Generation Models for Visual Artists’ Creative Works	2023	Hyung-Kwon Ko Gwanmo Park Hyeon Jeon Jaemin Jo Juho Kim Jinwook Seo	1