Type: Preprint
Publication Date: 2025-01-16
Citations: 0
DOI: https://doi.org/10.48550/arxiv.2501.09833
Concept erasure techniques have recently gained significant attention for their potential to remove unwanted concepts from text-to-image models. While these methods often demonstrate success in controlled scenarios, their robustness in real-world applications and readiness for deployment remain uncertain. In this work, we identify a critical gap in evaluating sanitized models, particularly in terms of their performance across various concept dimensions. We systematically investigate the failure modes of current concept erasure techniques, with a focus on visually similar, binomial, and semantically related concepts. We propose that these interconnected relationships give rise to a phenomenon of concept entanglement resulting in ripple effects and degradation in image quality. To facilitate more comprehensive evaluation, we introduce EraseBENCH, a multi-dimensional benchmark designed to assess concept erasure methods with greater depth. Our dataset includes over 100 diverse concepts and more than 1,000 tailored prompts, paired with a comprehensive suite of metrics that together offer a holistic view of erasure efficacy. Our findings reveal that even state-of-the-art techniques struggle with maintaining quality post-erasure, indicating that these approaches are not yet ready for real-world deployment. This highlights the gap in reliability of the concept erasure techniques.
Action | Title | Year | Authors |
---|---|---|---|
+ | None | 1999 |
Ming Liao |
+ | None | 2001 |
I. N. Kostin |
+ | None | 1999 |
Yong-Gao Chen Imre Z. Ruzsa |
+ | None | 2003 |
Paul Sablonnière |
+ | None | 2001 |
Emmanuel Fragnière Jacek Gondzio Robert Sarkissian |
+ | None | 1998 |
G. Sardanashvily |
+ | None | 1998 |
Hans Keiding |
+ | None | 2003 |
Haihua Feng Vincenzo Galdi David A. Castañón |
+ | None | 2003 |
V. Z. Kanchukoev B. S. Karamurzov В. А. Созаев Vladimir Chernov |
+ | None | 2001 |
Petr Habala Nicole Tomczak-Jaegermann |
+ | None | 2001 |
S. E. Kozlov |
+ PDF Chat | None | 2008 |
田村 直義 |
+ | None | 2001 |
Joaquin Soriano |
+ | None | 2001 |
Shigetaka Fukuda |
+ | None | 2003 |
Solomon Friedberg |
+ | None | 2003 |
Igor Belegradek |
+ | None | 1997 |
Salih Çelïk |
+ | None | 2001 |
M. de Montigny Hubert de Guise |
+ | None | 2001 |
A. Yu. Kolesov Н. Х. Розов |
+ | None | 2002 |
D. G. Djumbayeva Erlan Nursultanov |
Action | Title | Year | Authors |
---|
Action | Title | Year | Authors |
---|