Ask a Question

Prefer a chat interface with context about you and your work?

Accelerated Diffusion Models via Speculative Sampling

Accelerated Diffusion Models via Speculative Sampling

Speculative sampling is a popular technique for accelerating inference in Large Language Models by generating candidate tokens using a fast draft model and accepting or rejecting them based on the target model's distribution. While speculative sampling was previously limited to discrete sequences, we extend it to diffusion models, which generate …