Ask a Question

Prefer a chat interface with context about you and your work?

Contextualized Evaluations: Taking the Guesswork Out of Language Model Evaluations

Contextualized Evaluations: Taking the Guesswork Out of Language Model Evaluations

Language model users often issue queries that lack specification, where the context under which a query was issued -- such as the user's identity, the query's intent, and the criteria for a response to be useful -- is not explicit. For instance, a good response to a subjective query like …