Ask a Question

Prefer a chat interface with context about you and your work?

Multi-Modal Retrieval For Large Language Model Based Speech Recognition

Multi-Modal Retrieval For Large Language Model Based Speech Recognition

Retrieval is a widely adopted approach for improving language models leveraging external information. As the field moves towards multi-modal large language models, it is important to extend the pure text based methods to incorporate other modalities in retrieval as well for applications across the wide spectrum of machine learning tasks …