As an AI Language Model, "Yes I Would Recommend Calling the Police'':
Norm Inconsistency in LLM Decision-Making
As an AI Language Model, "Yes I Would Recommend Calling the Police'':
Norm Inconsistency in LLM Decision-Making
We investigate the phenomenon of norm inconsistency: where LLMs apply different norms in similar situations. Specifically, we focus on the high-risk application of deciding whether to call the police in Amazon Ring home surveillance videos. We evaluate the decisions of three state-of-the-art LLMs -- GPT-4, Gemini 1.0, and Claude 3 …