Ask a Question

Prefer a chat interface with context about you and your work?

Hints of Prompt: Enhancing Visual Representation for Multimodal LLMs in Autonomous Driving

Hints of Prompt: Enhancing Visual Representation for Multimodal LLMs in Autonomous Driving

In light of the dynamic nature of autonomous driving environments and stringent safety requirements, general MLLMs combined with CLIP alone often struggle to represent driving-specific scenarios accurately, particularly in complex interactions and long-tail cases. To address this, we propose the Hints of Prompt (HoP) framework, which introduces three key enhancements: …