Besmira Nushi

Follow

Generating author description...

All published works
Action Title Year Authors
+ PDF Chat MM-GEN: Enhancing Task Performance Through Targeted Multimodal Data Curation 2025 Siddharth Joshi
Besmira Nushi
Vidhisha Balachandran
Varun Chandrasekaran
Vibhav Vineet
Neel Joshi
Baharan Mirzasoleiman
+ Improving Instruction-Following in Language Models through Activation Steering 2024 Alessandro Stolfo
Vidhisha Balachandran
Safoora Yousefi
Eric Horvitz
Besmira Nushi
+ PDF Chat Attention Speaks Volumes: Localizing and Mitigating Bias in Language Models 2024 Rishabh Adiga
Besmira Nushi
Varun Chandrasekaran
+ PDF Chat BENCHAGENTS: Automated Benchmark Creation with Agent Interaction 2024 Nigar Azhar Butt
Varun Chandrasekaran
Neel Joshi
Besmira Nushi
Vidhisha Balachandran
+ PDF Chat Unearthing Skill-Level Insights for Understanding Trade-Offs of Foundation Models 2024 Mazda Moayeri
Vidhisha Balachandran
Varun Chandrasekaran
Safoora Yousefi
Thomas Fel
Soheil Feizi
Besmira Nushi
Neel Joshi
Vibhav Vineet
+ PDF Chat Improving Instruction-Following in Language Models through Activation Steering 2024 Alessandro Stolfo
Vidhisha Balachandran
Safoora Yousefi
Eric Horvitz
Besmira Nushi
+ PDF Chat Eureka: Evaluating and Understanding Large Foundation Models 2024 Vidhisha Balachandran
J. C. Chen
Neel Joshi
Besmira Nushi
Hamid Palangi
Eduardo Salinas
Vibhav Vineet
James Woffinden-Luey
Safoora Yousefi
+ PDF Chat Understanding Information Storage and Transfer in Multi-modal Large Language Models 2024 Samyadeep Basu
Martin Grayson
Cecily Morrison
Besmira Nushi
Soheil Feizi
Daniela Massiceti
+ PDF Chat Introducing v0.5 of the AI Safety Benchmark from MLCommons 2024 Bertie Vidgen
Adarsh Agrawal
Ahmed Mohamed Ahmed
Victor Akinwande
Namir Al-Nuaimi
Najla Alfaraj
Elie Alhajjar
Lora Aroyo
Trupti Bavalatti
Borhane Blili-Hamelin
+ PDF Chat Elephants Never Forget: Memorization and Learning of Tabular Data in Large Language Models 2024 Sebastian Bordt
Harsha Nori
Vanessa Mello Rodrigues
Besmira Nushi
Rich Caruana
+ Social Biases through the Text-to-Image Generation Lens 2023 Ranjita Naik
Besmira Nushi
+ Advancing Human-AI Complementarity: The Impact of User Expertise and Algorithmic Tuning on Joint Decision Making 2023 Kori Inkpen
Shreya Chappidi
Keri Mallari
Besmira Nushi
Divya Ramesh
Pietro Michelucci
Vani Mandava
Libuše Hannah Vepřek
Gabrielle Quinn
+ Mitigating Spurious Correlations in Multi-modal Models during Fine-tuning 2023 Yu Yang
Besmira Nushi
Hamid Palangi
Baharan Mirzasoleiman
+ Social Biases through the Text-to-Image Generation Lens 2023 Ranjita Naik
Besmira Nushi
+ Attention Satisfies: A Constraint-Satisfaction Lens on Factual Errors of Language Models 2023 Mert Yüksekgönül
Varun Chandrasekaran
Erik Jones
Suriya Gunasekar
Ranjita Naik
Hamid Palangi
Ece Kamar
Besmira Nushi
+ Diversity of Thought Improves Reasoning Abilities of Large Language Models 2023 Ranjita Naik
Varun Chandrasekaran
Mert Yüksekgönül
Hamid Palangi
Besmira Nushi
+ KITAB: Evaluating LLMs on Constraint Satisfaction for Information Retrieval 2023 Marah Abdin
Suriya Gunasekar
Varun Chandrasekaran
Jerry Li
Mert Yüksekgönül
Rahee Ghosh Peshawaria
Ranjita Naik
Besmira Nushi
+ Investigations of Performance and Bias in Human-AI Teamwork in Hiring 2022 Andi Peng
Besmira Nushi
Emre Kıcıman
Kori Inkpen
Ece Kamar
+ PDF Chat Who Goes First? Influences of Human-AI Workflow on Decision Making in Clinical Imaging 2022 Riccardo Fogliato
Shreya Chappidi
Matthew P. Lungren
Paul Fisher
Diane Wilson
Michael Fitzke
Mark Parkinson
Eric Horvitz
Kori Inkpen
Besmira Nushi
+ Who Goes First? Influences of Human-AI Workflow on Decision Making in Clinical Imaging 2022 Riccardo Fogliato
Shreya Chappidi
Matthew P. Lungren
Michael Fitzke
Mark Parkinson
Diane Wilson
Paul B. Fisher
Eric Horvitz
Kori Inkpen
Besmira Nushi
+ Advancing Human-AI Complementarity: The Impact of User Expertise and Algorithmic Tuning on Joint Decision Making 2022 Kori Inkpen
Shreya Chappidi
Keri Mallari
Besmira Nushi
Divya Ramesh
Pietro Michelucci
Vani Mandava
Libuše Hannah Vepřek
Gabrielle Quinn
+ Investigations of Performance and Bias in Human-AI Teamwork in Hiring 2022 Andi Peng
Besmira Nushi
Emre Kıcıman
Kori Inkpen
Ece Kamar
+ Benchmarking Spatial Relationships in Text-to-Image Generation 2022 Tejas Gokhale
Hamid Palangi
Besmira Nushi
Vibhav Vineet
Eric Horvitz
Ece Kamar
Chitta Baral
Yezhou Yang
+ Hierarchical Analysis of Visual COVID-19 Features from Chest Radiographs 2021 Shruthi Bannur
Ozan Oktay
Mélanie Bernhardt
Anton Schwaighofer
R. Jena
Besmira Nushi
Sharan Wadhwani
Aditya V. Nori
K. Natarajan
Shazad Q. Ashraf
+ PDF Chat Understanding Failures of Deep Networks via Robust Feature Extraction 2021 Sahil Singla
Besmira Nushi
Shital Shah
Ece Kamar
Eric Horvitz
+ PDF Chat Is the Most Accurate AI the Best Teammate? Optimizing AI for Teamwork 2021 Gagan Bansal
Besmira Nushi
Ece Kamar
Eric Horvitz
Daniel S. Weld
+ PDF Chat Does the Whole Exceed its Parts? The Effect of AI Explanations on Complementary Team Performance 2021 Gagan Bansal
Tongshuang Wu
Joyce Zhou
Raymond Fok
Besmira Nushi
Ece Kamar
Marco Túlio Ribeiro
Daniel S. Weld
+ Hierarchical Analysis of Visual COVID-19 Features from Chest Radiographs 2021 Shruthi Bannur
Ozan Oktay
Mélanie Bernhardt
Anton Schwaighofer
R. Jena
Besmira Nushi
Sharan Wadhwani
Aditya Nori
K. Natarajan
Shazad Q. Ashraf
+ Understanding Failures of Deep Networks via Robust Feature Extraction 2020 Sahil Singla
Besmira Nushi
Shital Shah
Ece Kamar
Eric Horvitz
+ PDF Chat An Empirical Analysis of Backward Compatibility in Machine Learning Systems 2020 Megha Srivastava
Besmira Nushi
Ece Kamar
Shital Shah
Eric Horvitz
+ An Empirical Analysis of Backward Compatibility in Machine Learning Systems 2020 Megha Srivastava
Besmira Nushi
Ece Kamar
Shital Shah
Eric Horvitz
+ Does the Whole Exceed its Parts? The Effect of AI Explanations on Complementary Team Performance 2020 Gagan Bansal
Tongshuang Wu
Joyce Zhou
Raymond Fok
Besmira Nushi
Ece Kamar
Marco Túlio Ribeiro
Daniel S. Weld
+ PDF Chat SQuINTing at VQA Models: Introspecting VQA Models With Sub-Questions 2020 Ramprasaath R. Selvaraju
Purva Tendulkar
Devi Parikh
Eric Horvitz
Marco Túlio Ribeiro
Besmira Nushi
Ece Kamar
+ Optimizing AI for Teamwork. 2020 Gagan Bansal
Besmira Nushi
Ece Kamar
Eric Horvitz
Daniel S. Weld
+ PDF Chat Metareasoning in Modular Software Systems: On-the-Fly Configuration Using Reinforcement Learning with Rich Contextual Representations 2020 Aditya Modi
Debadeepta Dey
Alekh Agarwal
Adith Swaminathan
Besmira Nushi
Sean Andrist
Eric Horvitz
+ SQuINTing at VQA Models: Introspecting VQA Models with Sub-Questions 2020 Ramprasaath R. Selvaraju
Purva Tendulkar
Devi Parikh
Eric Horvitz
Marco Ribeiro
Besmira Nushi
Ece Kamar
+ An Empirical Analysis of Backward Compatibility in Machine Learning Systems 2020 Megha Srivastava
Besmira Nushi
Ece Kamar
Shital Shah
Eric Horvitz
+ Understanding Failures of Deep Networks via Robust Feature Extraction 2020 Sahil Singla
Besmira Nushi
Shital Shah
Ece Kamar
Eric Horvitz
+ Does the Whole Exceed its Parts? The Effect of AI Explanations on Complementary Team Performance 2020 Gagan Bansal
Tongshuang Wu
Joyce Zhou
Raymond Fok
Besmira Nushi
Ece Kamar
Marco Túlio Ribeiro
Daniel S. Weld
+ SQuINTing at VQA Models: Introspecting VQA Models with Sub-Questions 2020 Ramprasaath R. Selvaraju
Purva Tendulkar
Devi Parikh
Eric Horvitz
Marco Ribeiro
Besmira Nushi
Ece Kamar
+ Is the Most Accurate AI the Best Teammate? Optimizing AI for Teamwork 2020 Gagan Bansal
Besmira Nushi
Ece Kamar
Eric Horvitz
Daniel S. Weld
+ PDF Chat What You See Is What You Get? The Impact of Representation Criteria on Human Bias in Hiring 2019 Andi Peng
Besmira Nushi
Emre Kıcıman
Kori Inkpen
Siddharth Suri
Ece Kamar
+ What You See Is What You Get? The Impact of Representation Criteria on Human Bias in Hiring 2019 Andi Peng
Besmira Nushi
Emre Kıcıman
Kori Inkpen
Siddharth Suri
Ece Kamar
+ Metareasoning in Modular Software Systems: On-the-Fly Configuration using Reinforcement Learning with Rich Contextual Representations 2019 Aditya Modi
Debadeepta Dey
Alekh Agarwal
Adith Swaminathan
Besmira Nushi
Sean Andrist
Eric Horvitz
+ A Case for Backward Compatibility for Human-AI Teams 2019 Gagan Bansal
Besmira Nushi
Ece Kamar
Daniel S. Weld
Walter S. Lasecki
Eric Horvitz
+ What You See Is What You Get? The Impact of Representation Criteria on Human Bias in Hiring 2019 Andi Peng
Besmira Nushi
Emre Kıcıman
Kori Inkpen
Siddharth Suri
Ece Kamar
+ Metareasoning in Modular Software Systems: On-the-Fly Configuration using Reinforcement Learning with Rich Contextual Representations 2019 Aditya Modi
Debadeepta Dey
Alekh Agarwal
Adith Swaminathan
Besmira Nushi
Sean Andrist
Eric Horvitz
+ PDF Chat Characterizing the Internet Research Agency’s Social Media Operations During the 2016 U.S. Presidential Election using Linguistic Analyses 2018 Ryan L. Boyd
Alexander Spangher
Adam Fourney
Besmira Nushi
Gireeja Ranade
James W. Pennebaker
Eric Horvitz
+ PDF Chat Towards Accountable AI: Hybrid Human-Machine Analyses for Characterizing System Failure 2018 Besmira Nushi
Ece Kamar
Eric Horvitz
+ Analysis of Strategy and Spread of Russia-sponsored Content in the US in 2017 2018 Alexander Spangher
Gireeja Ranade
Besmira Nushi
Adam Fourney
Eric Horvitz
+ Towards Accountable AI: Hybrid Human-Machine Analyses for Characterizing System Failure 2018 Besmira Nushi
Ece Kamar
Eric Horvitz
+ PDF Chat On Human Intellect and Machine Failures: Troubleshooting Integrative Machine Learning Systems 2017 Besmira Nushi
Ece Kamar
Eric Horvitz
Donald Kossmann
+ On Human Intellect and Machine Failures: Troubleshooting Integrative Machine Learning Systems 2016 Besmira Nushi
Ece Kamar
Eric Horvitz
Donald Kossmann
+ Fault-Tolerant Entity Resolution with the Crowd. 2015 Anja Gruenheid
Besmira Nushi
Tim Kraska
Wolfgang Gatterbauer
Donald Kossmann
+ PDF Chat Crowd Access Path Optimization: Diversity Matters 2015 Besmira Nushi
Adish Singla
Anja Gruenheid
Erfan Zamanian
Andreas Krause
Donald Kossmann
+ Fault-Tolerant Entity Resolution with the Crowd 2015 Anja Gruenheid
Besmira Nushi
Tim Kraska
Wolfgang Gatterbauer
Donald Kossmann
+ Crowd Access Path Optimization: Diversity Matters 2015 Besmira Nushi
Adish Singla
Anja Gruenheid
Erfan Zamanian
Andreas Krause
Donald Kossmann
+ CrowdSTAR: A Social Task Routing Framework for Online Communities 2014 Besmira Nushi
Omar Alonso
Martin Hentschel
Vasileios Kandylas
+ CrowdSTAR: A Social Task Routing Framework for Online Communities 2014 Besmira Nushi
Ómar Alonso
Martin Hentschel
Vasileios Kandylas
+ Uncertain Time-Series Similarity: Return to the Basics 2012 Michele Dallachiesa
Besmira Nushi
Кацярына Мирыленка
Themis Palpanas
+ Uncertain Time-Series Similarity: Return to the Basics 2012 Michele Dallachiesa
Besmira Nushi
Кацярына Мирыленка
Themis Palpanas
Common Coauthors
Commonly Cited References
Action Title Year Authors # of times referenced
+ PDF Chat Deep Residual Learning for Image Recognition 2016 Kaiming He
Xiangyu Zhang
Shaoqing Ren
Jian Sun
5
+ PDF Chat Effect of confidence and explanation on accuracy and trust calibration in AI-assisted decision making 2020 Yunfeng Zhang
Q. Vera Liao
Rachel Bellamy
5
+ PDF Chat Manifold: A Model-Agnostic Framework for Interpretation and Diagnosis of Machine Learning Models 2018 Jiawei Zhang
Yang Wang
Piero Molino
Lezhi Li
David S. Ebert
4
+ Learning to Complement Humans 2020 Bryan Wilder
Eric Horvitz
Ece Kamar
4
+ Towards Deep Learning Models Resistant to Adversarial Attacks. 2018 Aleksander Mądry
Aleksandar Makelov
Ludwig Schmidt
Dimitris Tsipras
Adrian Vladu
4
+ PDF Chat On Human Intellect and Machine Failures: Troubleshooting Integrative Machine Learning Systems 2017 Besmira Nushi
Ece Kamar
Eric Horvitz
Donald Kossmann
4
+ PDF Chat The challenge of crafting intelligible intelligence 2019 Daniel S. Weld
Gagan Bansal
4
+ PDF Chat Does the Whole Exceed its Parts? The Effect of AI Explanations on Complementary Team Performance 2021 Gagan Bansal
Tongshuang Wu
Joyce Zhou
Raymond Fok
Besmira Nushi
Ece Kamar
Marco Túlio Ribeiro
Daniel S. Weld
3
+ PDF Chat Proxy tasks and subjective measures can be misleading in evaluating explainable AI systems 2020 Zana Buçinca
Phoebe Lin
Krzysztof Z. Gajos
Elena L. Glassman
3
+ Manipulating and Measuring Model Interpretability 2018 Forough Poursabzi-Sangdeh
Daniel G. Goldstein
Jake M. Hofman
Jennifer Wortman Vaughan
Hanna Wallach
3
+ Interpretable and Explorable Approximations of Black Box Models 2017 Himabindu Lakkaraju
Ece Kamar
Rich Caruana
Jure Leskovec
3
+ A Baseline for Detecting Misclassified and Out-of-Distribution Examples in Neural Networks 2016 Dan Hendrycks
Kevin Gimpel
3
+ PDF Chat Manipulating and Measuring Model Interpretability 2021 Forough Poursabzi-Sangdeh
Daniel G. Goldstein
Jake M. Hofman
Jennifer Wortman Vaughan
Hanna Wallach
3
+ PDF Chat Discrimination in the Age of Algorithms 2019 Jon Kleinberg
Jens Ludwig
Sendhil Mullainathan
Cass R. Sunstein
3
+ PDF Chat Towards Accountable AI: Hybrid Human-Machine Analyses for Characterizing System Failure 2018 Besmira Nushi
Ece Kamar
Eric Horvitz
3
+ PDF Chat The mythos of model interpretability 2018 Zachary C. Lipton
2
+ Explaining Classifiers with Causal Concept Effect (CaCE) 2019 Yash Goyal
Amir Feder
Uri Shalit
Been Kim
2
+ Deep Inside Convolutional Networks: Visualising Image Classification Models and Saliency Maps 2013 Karen Simonyan
Andrea Vedaldi
Andrew Zisserman
2
+ Quantifying Interpretability and Trust in Machine Learning Systems 2019 Philipp Schmidt
Felix Bießmann
2
+ PDF Chat From captions to visual concepts and back 2015 Hao Fang
Saurabh Gupta
Forrest Iandola
Rupesh K. Srivastava
Li Deng
Piotr Dollár
Jianfeng Gao
Xiaodong He
Margaret Mitchell
John Platt
2
+ PDF Chat On Human Predictions with Explanations and Predictions of Machine Learning Models 2019 Vivian Lai
Chenhao Tan
2
+ PDF Chat Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization 2017 Ramprasaath R. Selvaraju
Michael Cogswell
Abhishek Das
Ramakrishna Vedantam
Devi Parikh
Dhruv Batra
2
+ PDF Chat Do explanations make VQA models more predictable to a human? 2018 Arjun Chandrasekaran
Viraj Prabhu
Deshraj Jain
Prithvijit Chattopadhyay
Devi Parikh
2
+ Deep Anomaly Detection with Outlier Exposure 2018 Dan Hendrycks
Mantas Mazeika
Thomas G. Dietterich
2
+ Pythia v0.1: the Winning Entry to the VQA Challenge 2018 2018 Yu Jiang
Vivek Natarajan
Xinlei Chen
Marcus Rohrbach
Dhruv Batra
Devi Parikh
2
+ Eliciting and Enforcing Subjective Individual Fairness. 2019 Christopher Jung
Michael Kearns
Seth Neel
Aaron Roth
Logan Stapleton
Zhiwei Steven Wu
2
+ PDF Chat VQA: Visual Question Answering 2015 Stanislaw Antol
Aishwarya Agrawal
Jiasen Lu
Margaret Mitchell
Dhruv Batra
C. Lawrence Zitnick
Devi Parikh
2
+ A Human-Grounded Evaluation of SHAP for Alert Processing. 2019 Hilde Weerts
Werner van Ipenburg
Mykola Pechenizkiy
2
+ PDF Chat CIDEr: Consensus-based image description evaluation 2015 Ramakrishna Vedantam
C. Lawrence Zitnick
Devi Parikh
2
+ PDF Chat Men Also Like Shopping: Reducing Gender Bias Amplification using Corpus-level Constraints 2017 Jieyu Zhao
Tianlu Wang
Mark Yatskar
Vicente Ordóñez
Kai-Wei Chang
2
+ A Contextual Bandit Bake-off 2018 Alberto Bietti
Alekh Agarwal
John Langford
2
+ PDF Chat Interpreting Recurrent and Attention-Based Neural Models: a Case Study on Natural Language Inference 2018 Reza Ghaeini
Xiaoli Z. Fern
Prasad Tadepalli
2
+ Fairness and Accountability Design Needs for Algorithmic Support in High-Stakes Public Sector Decision-Making 2018 Michael Veale
Max Van Kleek
Reuben Binns
2
+ Understanding Neural Networks Through Deep Visualization 2015 Jason Yosinski
Jeff Clune
Anh Mai Nguyen
Thomas J. Fuchs
Hod Lipson
2
+ PDF Chat 'It's Reducing a Human Being to a Percentage': Perceptions of Justice in Algorithmic Decisions 2018 Reuben Binns
Max Van Kleek
Michael Veale
Ulrik Lyngs
Jun Zhao
Nigel Shadbolt
2
+ PDF Chat Visualizing Deep Convolutional Neural Networks Using Natural Pre-images 2016 Aravindh Mahendran
Andrea Vedaldi
2
+ PDF Chat Incentivizing High Quality Crowdwork 2015 Chien-Ju Ho
Aleksandrs Slivkins
Siddharth Suri
Jennifer Wortman Vaughan
2
+ An Efficient Bandit Algorithm for Realtime Multivariate Optimization 2017 Daniel Hill
Houssam Nassif
Yi Liu
Anand Iyer
S. V. N. Vishwanathan
2
+ On Calibration of Modern Neural Networks 2017 Chuan Guo
Geoff Pleiss
Yu Sun
Kilian Q. Weinberger
2
+ PDF Chat Learning Attitudes and Attributes from Multi-aspect Reviews 2012 Julian McAuley
Jure Leskovec
Dan Jurafsky
2
+ PDF Chat Using meta-heuristics and machine learning for software optimization of parallel computing systems: a systematic literature review 2018 Suejb Memeti
Sabri Pllana
Alécio Pedro Delazari Binotto
Joanna Kołodziej
Ivona Brandić
2
+ Multifaceted Feature Visualization: Uncovering the Different Types of Features Learned By Each Neuron in Deep Neural Networks 2016 Anh Mai Nguyen
Jason Yosinski
Jeff Clune
2
+ Exploring models and data for image question answering 2015 Mengye Ren
Ryan Kiros
Richard S. Zemel
2
+ Women also Snowboard: Overcoming Bias in Captioning Models 2018 Kaylee Burns
Lisa Anne Hendricks
Kate Saenko
Trevor Darrell
Anna Rohrbach
2
+ Understanding disentangling in $β$-VAE 2018 Christopher Burgess
Irina Higgins
Arka Pal
Löıc Matthey
Nick Watters
Guillaume Desjardins
Alexander Lerchner
2
+ Why Interpretability in Machine Learning? An Answer Using Distributed Detection and Data Fusion Theory. 2018 Kush R. Varshney
Prashant Khanduri
Pranay Sharma
Shan Zhang
Pramod K. Varshney
2
+ PDF Chat Measuring Catastrophic Forgetting in Neural Networks 2018 Ronald Kemker
Marc McClure
Angelina Abitino
Tyler L. Hayes
Christopher Kanan
2
+ Axiomatic Attribution for Deep Networks 2017 Mukund Sundararajan
Ankur Taly
Qiqi Yan
2
+ Challenging Common Assumptions in the Unsupervised Learning of Disentangled Representations 2018 Francesco Locatello
Stefan Bauer
Mario Lučić
Gunnar Rätsch
Sylvain Gelly
Bernhard Schölkopf
Olivier Bachem
2
+ The Isotonic Regression Problem and its Dual 1972 Richard E. Barlow
H. D. Brunk
2