JustLogic: A Comprehensive Benchmark for Evaluating Deductive Reasoning
in Large Language Models
JustLogic: A Comprehensive Benchmark for Evaluating Deductive Reasoning
in Large Language Models
Logical reasoning is a critical component of Large Language Models (LLMs), and substantial research efforts in recent years have aimed to enhance their deductive reasoning capabilities. However, existing deductive reasoning benchmarks, which are crucial for evaluating and advancing LLMs, are inadequate due to their lack of task complexity, presence of …