Ask a Question

Prefer a chat interface with context about you and your work?

LLM-based relevance assessment still can't replace human relevance assessment

LLM-based relevance assessment still can't replace human relevance assessment

The use of large language models (LLMs) for relevance assessment in information retrieval has gained significant attention, with recent studies suggesting that LLM-based judgments provide comparable evaluations to human judgments. Notably, based on TREC 2024 data, Upadhyay et al. make a bold claim that LLM-based relevance assessments, such as those …