Ask a Question

Prefer a chat interface with context about you and your work?

Align-SLM: Textless Spoken Language Models with Reinforcement Learning from AI Feedback

Align-SLM: Textless Spoken Language Models with Reinforcement Learning from AI Feedback

While textless Spoken Language Models (SLMs) have shown potential in end-to-end speech-to-speech modeling, they still lag behind text-based Large Language Models (LLMs) in terms of semantic coherence and relevance. This work introduces the Align-SLM framework, which leverages preference optimization inspired by Reinforcement Learning with AI Feedback (RLAIF) to enhance the …