Ask a Question

Prefer a chat interface with context about you and your work?

Step-by-Step Reasoning to Solve Grid Puzzles: Where do LLMs Falter?

Step-by-Step Reasoning to Solve Grid Puzzles: Where do LLMs Falter?

Solving grid puzzles involves a significant amount of logical reasoning. Hence, it is a good domain to evaluate the reasoning capability of a model which can then guide us to improve the reasoning ability of models. However, most existing works evaluate only the final predicted answer of a puzzle, without …