Ask a Question

Prefer a chat interface with context about you and your work?

Training Language Models with Memory Augmentation

Training Language Models with Memory Augmentation

Recent work has improved language models (LMs) remarkably by equipping them with a non-parametric memory component. However, most existing approaches only introduce mem-ories at testing time or represent them using a separately trained encoder, resulting in suboptimal training of the language model. In this work, we present TRIME, a novel …