Yi-Hao Peng

Follow

Generating author description...

All published works
Action Title Year Authors
+ PDF Chat AutoPresent: Designing Structured Visuals from Scratch 2025 Jiaxin Ge
Zora Zhiruo Wang
Xuhui Zhou
Yi-Hao Peng
Sanjay Subramanian
Qinyue Tan
Maarten Sap
Alane Suhr
Daniel Fried
Graham Neubig
+ PDF Chat UIClip: A Data-driven Model for Assessing User Interface Design 2024 Jason Wu
Yi-Hao Peng
Amanda Li
Amanda Swearngin
Jeffrey P. Bigham
Jeffrey Nichols
+ PDF Chat Long-Form Answers to Visual Questions from Blind and Low Vision People 2024 Mina Huh
Fangyuan Xu
Yi-Hao Peng
Chongyan Chen
Hansika Murugu
Danna Gurari
Eunsol Soul Choi
Amy Pavel
+ PDF Chat "This really lets us see the entire world:" Designing a conversational telepresence robot for homebound older adults 2024 Yaxin Hu
Laura Stegner
Yasmine Kotturi
Caroline Zhang
Yi-Hao Peng
Faria Huq
Yuhang Zhao
Jeffrey P. Bigham
Bilge Mutlu
+ PDF Chat Towards Bidirectional Human-AI Alignment: A Systematic Review for Clarifications, Framework, and Future Directions 2024 Hua Shen
Tiffany Knearem
Reshmi Ghosh
Kenan Alkiek
K. Siva Krishna
Yachuan Liu
Ziqiao Ma
Savvas Petridis
Yi-Hao Peng
Li Qiwei
+ PDF Chat UIClip: A Data-driven Model for Assessing User Interface Design 2024 Jason Wu
Yi-Hao Peng
Amanda Li
Amanda Swearngin
Jeffrey P. Bigham
Jeffrey Nichols
+ GenAssist: Making Image Generation Accessible 2023 Mina Huh
Yi-Hao Peng
Amy Pavel
+ AVscript: Accessible Video Editing with Audio-Visual Scripts 2023 Mina Huh
Saelyne Yang
Yi-Hao Peng
Xiang Chen
Young‐Ho Kim
Amy Pavel
+ WebUI: A Dataset for Enhancing Visual UI Understanding with Web Semantics 2023 Jason Wu
Siyan Wang
Siman Shen
Yi-Hao Peng
Jeffrey Nichols
Jeffrey P. Bigham
+ GenAssist: Making Image Generation Accessible 2023 Mina Huh
Yi-Hao Peng
Amy Pavel
+ Say It All: Feedback for Improving Non-Visual Presentation Accessibility 2021 Yi-Hao Peng
JiWoong Jang
Jeffrey P. Bigham
Amy Pavel
+ Say It All: Feedback for Improving Non-Visual Presentation Accessibility 2021 Yi-Hao Peng
JiWoong Jang
Jeffrey P. Bigham
Amy Pavel
Common Coauthors
Commonly Cited References
Action Title Year Authors # of times referenced
+ Say It All: Feedback for Improving Non-Visual Presentation Accessibility 2021 Yi-Hao Peng
JiWoong Jang
Jeffrey P. Bigham
Amy Pavel
2
+ PDF Chat Mudslide 2015 Elena L. Glassman
Juho Kim
AndrĂ©s Monroy‐HernĂĄndez
Meredith Ringel Morris
2
+ PDF Chat Modeling Mobile Interface Tappability Using Crowdsourcing and Deep Learning 2019 Amanda Swearngin
Yang Li
2
+ PDF Chat Detecting Twenty-Thousand Classes Using Image-Level Supervision 2022 Xingyi Zhou
Rohit Girdhar
Armand Joulin
Philipp KrĂ€henbĂŒhl
Ishan Misra
2
+ Screen2Words: Automatic Mobile UI Summarization with Multimodal Learning 2021 Bryan Wang
Gang Li
Xin Zhou
Zhourong Chen
Tovi Grossman
Yang Li
2
+ PDF Chat DenseCap: Fully Convolutional Localization Networks for Dense Captioning 2016 Justin Johnson
Andrej Karpathy
Li Fei-Fei
2
+ PDF Chat VoiceCoach: Interactive Evidence-based Training for Voice Modulation Skills in Public Speaking 2020 Xingbo Wang
Haipeng Zeng
Yong Wang
Aoyu Wu
Zhida Sun
Xiaojuan Ma
Huamin Qu
2
+ PDF Chat VINS: Visual Search for Mobile User Interface Design 2021 Sara Bunian
Kai Li
Chaima Jemmali
Casper Harteveld
Yun Fu
Magy Seif El‐Nasr
2
+ PDF Chat Learning to Describe Differences Between Pairs of Similar Images 2018 Harsh Jhamtani
Taylor Berg-Kirkpatrick
1
+ PDF Chat FCOS: Fully Convolutional One-Stage Object Detection 2019 Zhi Tian
Chunhua Shen
Hao Chen
Tong He
1
+ PDF Chat Humanoid: A Deep Learning-Based Approach to Automated Black-box Android App Testing 2019 Yuanchun Li
Ziyue Yang
Yao Guo
Xiangqun Chen
1
+ Interweaving Multimodal Interaction With Flexible Unit Visualizations for Data Exploration 2020 Arjun Srinivasan
Bongshin Lee
John Stasko
1
+ PDF Chat InChorus: Designing Consistent Multimodal Interactions for Data Visualization on Tablet Devices 2020 Arjun Srinivasan
Bongshin Lee
Nathalie Henry Riche
Steven M. Drucker
Ken Hinckley
1
+ PDF Chat GUIComp: A GUI Design Assistant with Real-Time, Multi-Faceted Feedback 2020 Chunggi Lee
Sang-Hoon Kim
Dongyun Han
Hongjun Yang
Young‐Woo Park
Bum Chul Kwon
Sungahn Ko
1
+ PDF Chat Self-Training With Noisy Student Improves ImageNet Classification 2020 Qizhe Xie
Minh-Thang Luong
Eduard Hovy
Quoc V. Le
1
+ PDF Chat Generative adversarial networks 2020 Ian Goodfellow
Jean Pouget-Abadie
Mehdi Mirza
Bing Xu
David Warde-Farley
Sherjil Ozair
Aaron Courville
Yoshua Bengio
1
+ PDF Chat Scout: Rapid Exploration of Interface Layout Alternatives through High-Level Design Constraints 2020 Amanda Swearngin
Chenglong Wang
Alannah Oleson
James Fogarty
Amy J. Ko
1
+ SSD: Single Shot MultiBox Detector 2016 Wei Liu
Dragomir Anguelov
Dumitru Erhan
Christian Szegedy
Scott Reed
Cheng-Yang Fu
Alexander C. Berg
1
+ PDF Chat ActionBert: Leveraging User Actions for Semantic Understanding of User Interfaces 2021 Zecheng He
Srinivas Sunkara
Xiaoxue Zang
Ying Xu
Lijuan Liu
Nevan Wichers
Gabriel Schubiner
Ruby Lee
Jindong Chen
1
+ PDF Chat It Takes Two to Tango: Combining Visual and Textual Information for Detecting Duplicate Video-Based Bug Reports 2021 Nathan Cooper
Carlos Bernal-CĂĄrdenas
Oscar Chaparro
Kevin Moran
Denys Poshyvanyk
1
+ PDF Chat Data@Hand: Fostering Visual Exploration of Personal Data on Smartphones Leveraging Speech and Touch Interaction 2021 Young‐Ho Kim
Bongshin Lee
Arjun Srinivasan
Eun Kyoung Choe
1
+ Screen Recognition: Creating Accessibility Metadata for Mobile Applications from Pixels 2021 Xiaoyi Zhang
Lilian de Greef
Amanda Swearngin
Samuel White
Kyle I. Murray
Lisa Yu
Qi Shan
Jeffrey Nichols
Jason Wu
Chris Fleizach
1
+ Revamp: Enhancing Accessible Information Seeking Experience of Online Shopping for Blind or Low Vision Users 2021 Ruolin Wang
Zixuan Chen
Mingrui Ray Zhang
Zhaoheng Li
Zhixiu Liu
Zihan Dang
Chun Yu
Xiang Chen
1
+ PDF Chat Scaling Vision Transformers 2022 Xiaohua Zhai
Alexander Kolesnikov
Neil Houlsby
Lucas Beyer
1
+ PDF Chat Design Guidelines for Prompt Engineering Text-to-Image Generative Models 2022 Vivian Liu
Lydia B. Chilton
1
+ PDF Chat Let there be a clock on the beach: Reducing Object Hallucination in Image Captioning 2022 Ali Furkan Biten
LluĂ­s GĂłmez
DĂŹmosthenis Karatzas
1
+ Screen Parsing: Towards Reverse Engineering of UI Models from Screenshots 2021 Jason Wu
Xiaoyi Zhang
Jeffrey A. Nichols
Jeffrey P. Bigham
1
+ Non-Visual Cooking: Exploring Practices and Challenges of Meal Preparation by People with Visual Impairments 2021 Franklin Mingzhe Li
Jamie Dorst
Peter Cederberg
Patrick Carrington
1
+ PDF Chat “It Feels Like Taking a Gamble”: Exploring Perceptions, Practices, and Challenges of Using Makeup and Cosmetics for People with Visual Impairments 2022 Franklin Mingzhe Li
Franchesca Spektor
Meng Xia
Mina Huh
Peter Cederberg
Yuqi Gong
Kristen Shinohara
Patrick Carrington
1
+ Hierarchical Text-Conditional Image Generation with CLIP Latents 2022 Aditya Ramesh
Prafulla Dhariwal
Alex Nichol
Casey Chu
Mark Chen
1
+ PaLM: Scaling Language Modeling with Pathways 2022 Aakanksha Chowdhery
Sharan Narang
Jacob Devlin
Maarten Bosma
Gaurav Mishra
Adam Roberts
Paul Barham
Hyung Won Chung
Charles Sutton
Sebastian Gehrmann
1
+ PDF Chat Predicting and Explaining Mobile UI Tappability with Vision Modeling and Saliency Analysis 2022 Eldon Schoop
Xin Zhou
Gang Li
Zhourong Chen
Bjoern Hartmann
Yang Li
1
+ PDF Chat Learning to Denoise Raw Mobile UI Layouts for Improving Datasets at Scale 2022 Gang Li
Gilles Baechler
Manuel Tragut
Yang Li
1
+ PDF Chat Friendscope: Exploring In-the-Moment Experience Sharing on Camera Glasses via a Shared Camera 2022 Molly Jane Nicholas
Brian A. Smith
Rajan Vaish
1
+ PDF Chat Artificial Intelligence for Human Computer Interaction: A Modern Approach 2021 Forrest Huang
Eldon Schoop
David Ha
Jeffrey Nichols
John Canny
1
+ GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models 2021 Alex Nichol
Prafulla Dhariwal
Aditya Ramesh
Pranav Shyam
Pamela Mishkin
Bob McGrew
Ilya Sutskever
Mark Chen
1
+ BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation 2022 Junnan Li
Dongxu Li
Caiming Xiong
Steven C. H. Hoi
1
+ PDF Chat The Unboxing Experience: Exploration and Design of Initial Interactions Between Children and Social Robots 2022 Christine P. Lee
Bengisu Çağıltay
Bilge Mutlu
1
+ Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding 2022 Chitwan Saharia
William Chan
Saurabh Saxena
Lala Li
Jay Whang
Emily Denton
Seyed Kamyar Seyed Ghasemipour
Burcu Karagol Ayan
S. Sara Mahdavi
Rapha Gontijo Lopes
1
+ CogVideo: Large-scale Pretraining for Text-to-Video Generation via Transformers 2022 Wenyi Hong
Ming Ding
Wendi Zheng
Xinghan Liu
Jie Tang
1
+ PDF Chat Translating Video Recordings of Complex Mobile App UI Gestures into Replayable Scenarios 2022 Carlos Bernal-CĂĄrdenas
Nathan Cooper
Madeleine Havranek
Kevin Moran
Oscar Chaparro
Denys Poshyvanyk
Andrian Marcus
1
+ Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer 2019 Colin Raffel
Noam Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
1
+ Language Models are Few-Shot Learners 2020 T. B. Brown
Benjamin F. Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
Prafulla Dhariwal
Arvind Neelakantan
Pranav Shyam
Girish Sastry
Amanda Askell
1
+ PDF Chat You Watch, You Give, and You Engage 2018 Zhicong Lu
Haijun Xia
Seongkook Heo
Daniel Wigdor
1
+ PDF Chat Opal: Multimodal Image Generation for News Illustration 2022 Vivian Liu
Han Qiao
Lydia B. Chilton
1
+ PDF Chat "I was Confused by It; It was Confused by Me:" Exploring the Experiences of People with Visual Impairments around Mobile Service Robots 2022 Prajna Bhat
Yuhang Zhao
1
+ BLOOM: A 176B-Parameter Open-Access Multilingual Language Model 2022 Teven Le Scao
Angela Fan
Christopher Akiki
Ellie Pavlick
Suzana Ilić
Daniel Hesslow
Roman Castagné
Alexandra Sasha Luccioni
François Yvon
Matthias Gallé
1
+ PDF Chat Text2LIVE: Text-Driven Layered Image and Video Editing 2022 Omer Bar-Tal
Dolev Ofri-Amar
Rafail Fridman
Yoni Kasten
Tali Dekel
1
+ PDF Chat High-Resolution Image Synthesis with Latent Diffusion Models 2022 Robin Rombach
Andreas Blattmann
Dominik Lorenz
Patrick Esser
Björn Ommer
1
+ Large-scale Text-to-Image Generation Models for Visual Artists’ Creative Works 2023 Hyung-Kwon Ko
Gwanmo Park
Hyeon Jeon
Jaemin Jo
Juho Kim
Jinwook Seo
1