CUE-M: Contextual Understanding and Enhanced Search with Multimodal
Large Language Model
CUE-M: Contextual Understanding and Enhanced Search with Multimodal
Large Language Model
The integration of Retrieval-Augmented Generation (RAG) with Multimodal Large Language Models (MLLMs) has expanded the scope of multimodal query resolution. However, current systems struggle with intent understanding, information retrieval, and safety filtering, limiting their effectiveness. This paper introduces Contextual Understanding and Enhanced Search with MLLM (CUE-M), a novel multimodal search …