Multi-Modal Retrieval For Large Language Model Based Speech Recognition
Multi-Modal Retrieval For Large Language Model Based Speech Recognition
Retrieval is a widely adopted approach for improving language models leveraging external information. As the field moves towards multi-modal large language models, it is important to extend the pure text based methods to incorporate other modalities in retrieval as well for applications across the wide spectrum of machine learning tasks …