Ask a Question

Prefer a chat interface with context about you and your work?

RNR: Teaching Large Language Models to Follow Roles and Rules

RNR: Teaching Large Language Models to Follow Roles and Rules

Instruction fine-tuning (IFT) elicits instruction following capabilities and steers the behavior of large language models (LLMs) via supervised learning. However, existing models trained on open-source IFT datasets only have the ability to follow instructions from users, and often fail to follow complex role and rules specified by developers, a.k.a. system …