Ask a Question

Prefer a chat interface with context about you and your work?

Investigating the Robustness of LLMs on Math Word Problems

Investigating the Robustness of LLMs on Math Word Problems

Large Language Models (LLMs) excel at various tasks, including solving math word problems (MWPs), but struggle with real-world problems containing irrelevant information. To address this, we propose a prompting framework that generates adversarial variants of MWPs by adding irrelevant variables. We introduce a dataset, ProbleMATHIC, containing both adversarial and non-adversarial …