MathChat: Benchmarking Mathematical Reasoning and Instruction Following
in Multi-Turn Interactions
MathChat: Benchmarking Mathematical Reasoning and Instruction Following
in Multi-Turn Interactions
Large language models (LLMs) have demonstrated impressive capabilities in mathematical problem solving, particularly in single turn question answering formats. However, real world scenarios often involve mathematical question answering that requires multi turn or interactive information exchanges, and the performance of LLMs on these tasks is still underexplored. This paper introduces …