Align-SLM: Textless Spoken Language Models with Reinforcement Learning
from AI Feedback
Align-SLM: Textless Spoken Language Models with Reinforcement Learning
from AI Feedback
While textless Spoken Language Models (SLMs) have shown potential in end-to-end speech-to-speech modeling, they still lag behind text-based Large Language Models (LLMs) in terms of semantic coherence and relevance. This work introduces the Align-SLM framework, which leverages preference optimization inspired by Reinforcement Learning with AI Feedback (RLAIF) to enhance the …