Efficient Keyword Spotting by Capturing Long-Range Interactions with Temporal Lambda Networks
Efficient Keyword Spotting by Capturing Long-Range Interactions with Temporal Lambda Networks
Models based on attention mechanisms have shown unprecedented speech recognition performance. However, they are computationally expensive and unnecessarily complex for keyword spotting, a task targeted to small-footprint devices. This work explores the application of Lambda networks, an alternative framework for capturing long-range interactions without attention, for the keyword spotting task. …