Projects
Reading
People
Chat
SU\G
(𝔸)
/K·U
Projects
Reading
People
Chat
Sign Up
Light
Dark
System
Sameep Mehta
Follow
Share
Generating author description...
All published works
Action
Title
Year
Authors
+
PDF
Chat
LLMGuard: Guarding against Unsafe LLM Behavior
2024
Shubh Goyal
Medha Hira
Shubham Kumar Mishra
Sukriti Goyal
Arnav Goel
Niharika Dadu
Kirushikesh DB
Sameep Mehta
Nishtha Madaan
+
PDF
Chat
xLP: Explainable Link Prediction for Master Data Management
2024
Balaji Ganesan
Matheen Ahmed Pasha
Srinivasa Parkala
Neeraj R Singh
Gayatri Mishra
Sumit Bhatia
Hima Patel
Somashekar Naganna
Sameep Mehta
+
PDF
Chat
LLMGuard: Guarding Against Unsafe LLM Behavior
2024
Shubh Goyal
Medha Hira
Shubham Kumar Mishra
Sukriti Goyal
Arnav Goel
Niharika Dadu
Kirushikesh DB
Sameep Mehta
Nishtha Madaan
+
CFL: Causally Fair Language Models Through Token-level Attribute Controlled Generation
2023
Rahul Madhavan
Rishabh Garg
Kahini Wadhawan
Sameep Mehta
+
PDF
Chat
CFL: Causally Fair Language Models Through Token-level Attribute Controlled Generation
2023
Rahul Madhavan
Rishabh Garg
Kahini Wadhawan
Sameep Mehta
+
"Beware of deception": Detecting Half-Truth and Debunking it through Controlled Claim Editing
2023
Sandeep Singamsetty
Nishtha Madaan
Sameep Mehta
Varad Bhatnagar
Pushpak Bhattacharyya
+
Data Quality Toolkit: Automatic assessment of data quality and remediation for machine learning datasets
2021
Nitin Gupta
Hima Patel
Shazia Afzal
Naveen Panwar
Ruhi Sharma Mittal
Shanmukha Guttula
Abhinav Jain
Lokesh Nagalapatti
Sameep Mehta
Sandeep Hans
+
Explainable Link Prediction for Privacy-Preserving Contact Tracing.
2020
Balaji Ganesan
Hima Patel
Sameep Mehta
+
Data Readiness Report.
2020
Shazia Afzal
C Rajmohan
Manish Kesarwani
Sameep Mehta
Hima Patel
+
Fair Transfer of Multiple Style Attributes in Text
2020
Karan Dabas
Nishtha Madan
Vijay Arya
Sameep Mehta
Gautam B. Singh
Tanmoy Chakraborty
+
Explainable Link Prediction for Privacy-Preserving Contact Tracing
2020
Balaji Ganesan
Hima Patel
Sameep Mehta
+
Fair Transfer of Multiple Style Attributes in Text
2020
Karan Dabas
Nishtha Madan
Vijay Arya
Sameep Mehta
Gautam B. Singh
Tanmoy Chakraborty
+
Data Readiness Report
2020
Shazia Afzal
C Rajmohan
Manish Kesarwani
Sameep Mehta
Hima Patel
+
PDF
Chat
Fair Transfer of Multiple Style Attributes in Text
2019
Karan Dabas
Nishtha Madaan
Vijay Arya
Sameep Mehta
Tanmoy Chakraborty
Gautam B. Singh
+
PDF
Chat
FactSheets: Increasing trust in AI services through supplier's declarations of conformity
2019
Matthew Arnold
Rachel Bellamy
Michael Hind
Stephanie Houde
Sameep Mehta
Aleksandra Mojsilović
Ravi Nair
Karthikeyan Natesan Ramamurthy
Adriana Olteanu
David Piorkowski
+
PDF
Chat
Hardening Deep Neural Networks via Adversarial Model Cascades
2019
Deepak Vijaykeerthy
Anshuman Suri
Sameep Mehta
Ponnurangam Kumaraguru
+
PDF
Chat
On Efficiently Processing Workflow Provenance Queries in Spark
2019
C Rajmohan
Pranay Lohia
Himanshu Gupta
Siddhartha Brahma
Mauricio A. Hernández
Sameep Mehta
+
PDF
Chat
Ownership Preserving AI Market Places Using Blockchain
2019
Nishant Baranwal Somy
Kalapriya Kannan
Vijay Arya
Sandeep Hans
Abhishek Singh
Pranay Lohia
Sameep Mehta
+
PDF
Chat
Model Extraction Warning in MLaaS Paradigm
2018
Manish Kesarwani
Bhaskar Mukhoty
Vijay Arya
Sameep Mehta
+
What is my data worth? From data properties to data value.
2018
Kalapriya Kannan
Rema Ananthanarayanan
Sameep Mehta
+
AI Fairness 360: An Extensible Toolkit for Detecting, Understanding, and Mitigating Unwanted Algorithmic Bias
2018
Rachel Bellamy
Kuntal Dey
Michael Hind
Samuel C. Hoffman
Stephanie Houde
Kalapriya Kannan
Pranay Lohia
Jacquelyn Martino
Sameep Mehta
Aleksandra Mojsilović
+
Increasing Trust in AI Services through Supplier's Declarations of Conformity
2018
Michael Hind
Sameep Mehta
Aleksandra Mojsilović
Ravi Nair
Karthikeyan Natesan Ramamurthy
Alexandra Olteanu
Kush R. Varshney
+
FactSheets: Increasing Trust in AI Services through Supplier's Declarations of Conformity
2018
Matthew Arnold
Rachel Bellamy
Michael Hind
Stephanie Houde
Sameep Mehta
Aleksandra Mojsilović
Ravi Nair
Karthikeyan Natesan Ramamurthy
Darrell Reimer
Alexandra Olteanu
+
PDF
Chat
Secure k-NN as a Service over Encrypted Data in Multi-User Setting
2018
Gagandeep Singh
Akshar Kaul
Sameep Mehta
+
Hardening Deep Neural Networks via Adversarial Model Cascades
2018
Deepak Vijaykeerthy
Anshuman Suri
Sameep Mehta
Ponnurangam Kumaraguru
+
Secure k-NN as a Service Over Encrypted Data in Multi-User Setting
2018
Gagandeep Singh
Akshar Kaul
Sameep Mehta
+
Generating Clues for Gender based Occupation De-biasing in Text
2018
Nishtha Madaan
Gautam B. Singh
Sameep Mehta
Aditya Chetan
Brihi Joshi
+
Judging a Book by its Description : Analyzing Gender Stereotypes in the Man Bookers Prize Winning Fiction
2018
Nishtha Madaan
Sameep Mehta
Shravika Mittal
Ashima Suvarna
+
Efficiently Processing Workflow Provenance Queries on SPARK
2018
C Rajmohan
Pranay Lohia
Himanshu Gupta
Siddhartha Brahma
Mauricio A. Hernández
Sameep Mehta
+
Extracting Fairness Policies from Legal Documents
2018
Rashmi Nagpal
Chetna Wadhwa
Mallika Gupta
Samiulla Shaikh
Sameep Mehta
Vikram Goyal
+
What is my data worth? From data properties to data value
2018
Kalapriya Kannan
Rema Ananthanarayanan
Sameep Mehta
+
AI Fairness 360: An Extensible Toolkit for Detecting, Understanding, and Mitigating Unwanted Algorithmic Bias
2018
Rachel Bellamy
Kuntal Dey
Michael Hind
Samuel C. Hoffman
Stephanie Houde
Kalapriya Kannan
Pranay Lohia
Jacquelyn Martino
Sameep Mehta
Aleksandra Mojsilovic
+
FactSheets: Increasing Trust in AI Services through Supplier's Declarations of Conformity
2018
Matthew Arnold
Rachel Bellamy
Michael Hind
Stephanie Houde
Sameep Mehta
Aleksandra Mojsilovic
Ravi Nair
Karthikeyan Natesan Ramamurthy
Darrell Reimer
Alexandra Olteanu
+
Secure k-NN as a Service Over Encrypted Data in Multi-User Setting
2018
Gagandeep Singh
Akshar Kaul
Sameep Mehta
+
Hardening Deep Neural Networks via Adversarial Model Cascades
2018
Deepak Vijaykeerthy
Anshuman Suri
Sameep Mehta
Ponnurangam Kumaraguru
+
Model Extraction Warning in MLaaS Paradigm
2017
Manish Kesarwani
Bhaskar Mukhoty
Vijay Arya
Sameep Mehta
+
Towards Crafting Text Adversarial Samples
2017
Suranjana Samanta
Sameep Mehta
+
A Survey on Resilient Machine Learning
2017
Atul Kumar
Sameep Mehta
+
Bollywood Movie Corpus for Text, Images and Videos
2017
Nishtha Madaan
Sameep Mehta
Mayank Saxena
Aditi Aggarwal
Taneea S Agrawaal
Vrinda Malhotra
+
Analyzing Gender Stereotyping in Bollywood Movies
2017
Nishtha Madaan
Sameep Mehta
Taneea S Agrawaal
Vrinda Malhotra
Aditi Aggarwal
Mayank Saxena
+
An End-To-End Machine Learning Pipeline That Ensures Fairness Policies
2017
Samiulla Shaikh
Harit Vishwakarma
Sameep Mehta
Kush R. Varshney
Karthikeyan Natesan Ramamurthy
Dennis Wei
+
Model Extraction Warning in MLaaS Paradigm
2017
Manish Kesarwani
Bhaskar Mukhoty
Vijay Arya
Sameep Mehta
+
From Tweets to Events: Exploring a Scalable Solution for Twitter Streams.
2014
Shamanth Kumar
Huan Liu
Sameep Mehta
L. Venkata Subramaniam
+
From Tweets to Events: Exploring a Scalable Solution for Twitter Streams
2014
Shamanth Kumar
Huan Liu
Sameep Mehta
L. Venkata Subramaniam
Common Coauthors
Coauthor
Papers Together
Nishtha Madaan
8
Vijay Arya
7
Karthikeyan Natesan Ramamurthy
7
Kush R. Varshney
7
Michael Hind
6
Pranay Lohia
6
Kalapriya Kannan
5
Manish Kesarwani
5
Rachel Bellamy
5
Stephanie Houde
5
Hima Patel
5
C Rajmohan
4
Gautam B. Singh
4
Aleksandra Mojsilović
4
Shazia Afzal
3
Deepak Vijaykeerthy
3
Anshuman Suri
3
Akshar Kaul
3
Karan Dabas
3
David Piorkowski
3
Ravi Nair
3
Alexandra Olteanu
3
Bhaskar Mukhoty
3
Diptikalyan Saha
3
Jason Tsay
3
Balaji Ganesan
3
Tanmoy Chakraborty
3
Sandeep Hans
2
Matthew Arnold
2
Yunfeng Zhang
2
Rema Ananthanarayanan
2
Aditi Aggarwal
2
Niharika Dadu
2
Kuntal Dey
2
Kirushikesh DB
2
Huan Liu
2
Arnav Goel
2
Mayank Saxena
2
Rahul Madhavan
2
John T. Richards
2
Aleksandra Mojsilovic
2
Vrinda Malhotra
2
Taneea S Agrawaal
2
Ponnurangam Kumaraguru
2
Darrell Reimer
2
Kahini Wadhawan
2
Samiulla Shaikh
2
Prasanna Sattigeri
2
Siddhartha Brahma
2
Nishtha Madan
2
Commonly Cited References
Action
Title
Year
Authors
# of times referenced
+
PDF
Chat
Certifying and Removing Disparate Impact
2015
Michael Feldman
Sorelle A. Friedler
John Moeller
Carlos Scheidegger
Suresh Venkatasubramanian
4
+
Efficient Estimation of Word Representations in Vector Space
2013
Tomáš Mikolov
Kai Chen
Greg S. Corrado
Jay B. Dean
3
+
PDF
Chat
Red Teaming Language Models with Language Models
2022
Ethan Perez
Saffron Huang
Francis Song
Trevor Cai
Roman Ring
John Aslanides
Amelia Glaese
Nat McAleese
Geoffrey Irving
3
+
Evasion Attacks against Machine Learning at Test Time
2013
Battista Biggio
Igino Corona
Davide Maiorca
Blaine Nelson
Nedim Šrndić
Pavel Laskov
Giorgio Giacinto
Fabio Roli
3
+
Certified Defenses against Adversarial Examples
2018
Aditi Raghunathan
Jacob Steinhardt
Percy Liang
3
+
Explaining and Harnessing Adversarial Examples
2014
Ian Goodfellow
Jonathon Shlens
Christian Szegedy
3
+
PDF
Chat
FactSheets: Increasing trust in AI services through supplier's declarations of conformity
2019
Matthew Arnold
Rachel Bellamy
Michael Hind
Stephanie Houde
Sameep Mehta
Aleksandra Mojsilović
Ravi Nair
Karthikeyan Natesan Ramamurthy
Adriana Olteanu
David Piorkowski
3
+
PDF
Chat
Hacking smart machines with smarter ones: How to extract meaningful data from machine learning classifiers
2015
Giuseppe Ateniese
Luigi V. Mancini
Angelo Spognardi
Antonio Villani
Domenico Vitali
Giovanni Felici
3
+
PaLM: Scaling Language Modeling with Pathways
2022
Aakanksha Chowdhery
Sharan Narang
Jacob Devlin
Maarten Bosma
Gaurav Mishra
Adam Roberts
Paul Barham
Hyung Won Chung
Charles Sutton
Sebastian Gehrmann
3
+
PDF
Chat
Practical Black-Box Attacks against Machine Learning
2017
Nicolas Papernot
Patrick McDaniel
Ian Goodfellow
Somesh Jha
Z. Berkay Celik
Ananthram Swami
3
+
PDF
Chat
The Limitations of Deep Learning in Adversarial Settings
2016
Nicolas Papernot
Patrick McDaniel
Somesh Jha
Matt Fredrikson
Z. Berkay Celik
Ananthram Swami
3
+
HateBERT: Retraining BERT for Abusive Language Detection in English
2020
Tommaso Caselli
Valerio Basile
Jelena Mitrović
Michael Granitzer
2
+
PDF
Chat
Exploring Controllable Text Generation Techniques
2020
Shrimai Prabhumoye
Alan W. Black
Ruslan Salakhutdinov
2
+
RealToxicityPrompts: Evaluating Neural Toxic Degeneration in Language Models
2020
Samuel Gehman
Suchin Gururangan
Maarten Sap
Yejin Choi
Noah A. Smith
2
+
Social Biases in NLP Models as Barriers for Persons with Disabilities
2020
Ben Hutchinson
Vinodkumar Prabhakaran
Emily Denton
Kellie Webster
Yu Zhong
Stephen Denuyl
2
+
The State and Fate of Linguistic Diversity and Inclusion in the NLP World
2020
Pratik Joshi
Sebastin Santy
Amar Budhiraja
Kalika Bali
Monojit Choudhury
2
+
GeDi: Generative Discriminator Guided Sequence Generation
2021
Ben Krause
Akhilesh Gotmare
Bryan McCann
Nitish Shirish Keskar
Shafiq Joty
Richard Socher
Nazneen Fatema Rajani
2
+
Measurement and Fairness
2021
Abigail Z. Jacobs
Hanna Wallach
2
+
Energy and Policy Considerations for Deep Learning in NLP
2019
Emma Strubell
Ananya Ganesh
Andrew McCallum
2
+
Racial Bias in Hate Speech and Abusive Language Detection Datasets
2019
Thomas Davidson
Debasmita Bhattacharya
Ingmar Weber
2
+
PDF
Chat
Controlling Output Length in Neural Encoder-Decoders
2016
Yuta Kikuchi
Graham Neubig
Ryohei Sasano
Hiroya Takamura
Manabu Okumura
2
+
The Woman Worked as a Babysitter: On Biases in Language Generation
2019
Emily Sheng
Kai-Wei Chang
Prem Natarajan
Nanyun Peng
2
+
Plug and Play Language Models: A Simple Approach to Controlled Text Generation
2019
Sumanth Dathathri
Andrea Madotto
Janice Lan
Jane Hung
Eric Frank
Piero Molino
Jason Yosinski
Rosanne Liu
2
+
Don’t Stop Pretraining: Adapt Language Models to Domains and Tasks
2020
Suchin Gururangan
Ana Marasović
Swabha Swayamdipta
Kyle Lo
Iz Beltagy
Doug Downey
Noah A. Smith
2
+
Stealing Machine Learning Models via Prediction APIs
2016
Florian Tramèr
Fan Zhang
Ari Juels
Michael K. Reiter
Thomas Ristenpart
2
+
Semantics derived automatically from language corpora necessarily contain human biases.
2016
Aylin Caliskan Islam
Joanna J. Bryson
Arvind Narayanan
2
+
PDF
Chat
Secure k-nearest neighbor query over encrypted data in outsourced environments
2014
Yousef Elmehdwi
Bharath K. Samanthula
Wei Jiang
2
+
The Radicalization Risks of GPT-3 and Advanced Neural Language Models
2020
Kris McGuffie
Alex Newhouse
2
+
A Provably Secure Additive and Multiplicative Privacy Homomorphism*
2002
Josep Domingo‐Ferrer
2
+
PDF
Chat
Model Cards for Model Reporting
2019
Margaret Mitchell
Simone Wu
Andrew Zaldivar
Parker Barnes
Lucy Vasserman
Ben Hutchinson
Elena Spitzer
Inioluwa Deborah Raji
Timnit Gebru
2
+
Controlling Linguistic Style Aspects in Neural Language Generation
2017
Jessica Ficler
Yoav Goldberg
2
+
Delete, Retrieve, Generate: a Simple Approach to Sentiment and Style Transfer
2018
Juncen Li
Robin Jia
He He
Percy Liang
2
+
CTRL: A Conditional Transformer Language Model for Controllable Generation
2019
Nitish Shirish Keskar
Bryan McCann
Lav R. Varshney
Caiming Xiong
Richard Socher
2
+
Poisoning Attacks against Support Vector Machines
2012
Battista Biggio
Blaine Nelson
Pavel Laskov
2
+
AI Fairness 360: An Extensible Toolkit for Detecting, Understanding, and Mitigating Unwanted Algorithmic Bias
2018
Rachel Bellamy
Kuntal Dey
Michael Hind
Samuel C. Hoffman
Stephanie Houde
Kalapriya Kannan
Pranay Lohia
Jacquelyn Martino
Sameep Mehta
Aleksandra Mojsilović
2
+
PDF
Chat
Adversarial Feature Selection Against Evasion Attacks
2015
Fei Zhang
Patrick P. K. Chan
Battista Biggio
Daniel Yeung
Fabio Roli
2
+
PDF
Chat
SemEval-2019 Task 6: Identifying and Categorizing Offensive Language in Social Media (OffensEval)
2019
Marcos Zampieri
Shervin Malmasi
Preslav Nakov
Sara Rosenthal
Noura Farra
Ritesh Kumar
2
+
The Curious Case of Neural Text Degeneration
2019
Ari Holtzman
Jan Buys
Li Du
Maxwell Forbes
Yejin Choi
2
+
PDF
Chat
Who is trustworthy? Predicting trustworthy intentions and behavior.
2018
Emma Levine
T. Bradford Bitterly
Taya R. Cohen
Maurice E. Schweitzer
2
+
PDF
Chat
On the Safety of Machine Learning: Cyber-Physical Systems, Decision Sciences, and Data Products
2017
Kush R. Varshney
Homa Alemzadeh
2
+
Men Also Like Shopping: Reducing Gender Bias Amplification using Corpus-level Constraints
2017
Jieyu Zhao
Tianlu Wang
Mark Yatskar
Vicente Ordóñez
Kai-Wei Chang
2
+
PDF
Chat
Causal inference in statistics: An overview
2009
Judea Pearl
2
+
Differential Privacy: A Survey of Results
2008
Cynthia Dwork
2
+
PDF
Chat
A Stable and Effective Learning Strategy for Trainable Greedy Decoding
2018
Yun Chen
Victor O. K. Li
Kyunghyun Cho
Samuel Bowman
2
+
PDF
Chat
Towards Evaluating the Robustness of Neural Networks
2017
Nicholas Carlini
David Wagner
2
+
Dear Sir or Madam, May I Introduce the GYAFC Dataset: Corpus, Benchmarks and Metrics for Formality Style Transfer
2018
Sudha Rao
Joel Tetreault
2
+
Towards A Rigorous Science of Interpretable Machine Learning
2017
Finale Doshi‐Velez
Been Kim
2
+
The Dataset Nutrition Label: A Framework To Drive Higher Data Quality Standards.
2018
Sarah Holland
Ahmed Hosny
Sarah Newman
Joshua Joseph
Kasia Chmielinski
2
+
Causal Mediation Analysis for Interpreting Neural NLP: The Case of Gender Bias
2020
Jesse Vig
Sebastian Gehrmann
Yonatan Belinkov
Sharon Qian
Daniel Nevo
Yaron Singer
Stuart M. Shieber
2
+
PDF
Chat
Towards Composable Bias Rating of AI Services
2018
Biplav Srivastava
Francesca Rossi
2