Ask a Question

Prefer a chat interface with context about you and your work?

Prompt-A-Video: Prompt Your Video Diffusion Model via Preference-Aligned LLM

Prompt-A-Video: Prompt Your Video Diffusion Model via Preference-Aligned LLM

Text-to-video models have made remarkable advancements through optimization on high-quality text-video pairs, where the textual prompts play a pivotal role in determining quality of output videos. However, achieving the desired output often entails multiple revisions and iterative inference to refine user-provided prompts. Current automatic methods for refining prompts encounter challenges …