What Goes Into a LM Acceptability Judgment? Rethinking the Impact of
Frequency and Length
What Goes Into a LM Acceptability Judgment? Rethinking the Impact of
Frequency and Length
When comparing the linguistic capabilities of language models (LMs) with humans using LM probabilities, factors such as the length of the sequence and the unigram frequency of lexical items have a significant effect on LM probabilities in ways that humans are largely robust to. Prior works in comparing LM and …