RealCritic: Towards Effectiveness-Driven Evaluation of Language Model
Critiques
RealCritic: Towards Effectiveness-Driven Evaluation of Language Model
Critiques
Critiques are important for enhancing the performance of Large Language Models (LLMs), enabling both self-improvement and constructive feedback for others by identifying flaws and suggesting improvements. However, evaluating the critique capabilities of LLMs presents a significant challenge due to the open-ended nature of the task. In this work, we introduce …