Towards System 2 Reasoning in LLMs: Learning How to Think With Meta
Chain-of-Thought
Towards System 2 Reasoning in LLMs: Learning How to Think With Meta
Chain-of-Thought
We propose a novel framework, Meta Chain-of-Thought (Meta-CoT), which extends traditional Chain-of-Thought (CoT) by explicitly modeling the underlying reasoning required to arrive at a particular CoT. We present empirical evidence from state-of-the-art models exhibiting behaviors consistent with in-context search, and explore methods for producing Meta-CoT via process supervision, synthetic data …