Contextualized Evaluations: Taking the Guesswork Out of Language Model
Evaluations
Contextualized Evaluations: Taking the Guesswork Out of Language Model
Evaluations
Language model users often issue queries that lack specification, where the context under which a query was issued -- such as the user's identity, the query's intent, and the criteria for a response to be useful -- is not explicit. For instance, a good response to a subjective query like …