Ask a Question

Prefer a chat interface with context about you and your work?

Trick Me If You Can: Human-in-the-Loop Generation of Adversarial Examples for Question Answering

Trick Me If You Can: Human-in-the-Loop Generation of Adversarial Examples for Question Answering

Adversarial evaluation stress-tests a model’s understanding of natural language. Because past approaches expose superficial patterns, the resulting adversarial examples are limited in complexity and diversity. We propose human- in-the-loop adversarial generation, where human authors are guided to break models. We aid the authors with interpretations of model predictions through an …