Ask a Question

Prefer a chat interface with context about you and your work?

Frequency and Temporal Convolutional Attention for Text-Independent Speaker Recognition

Frequency and Temporal Convolutional Attention for Text-Independent Speaker Recognition

Majority of the recent approaches for text-independent speaker recognition apply attention or similar techniques for aggregation of frame-level feature descriptors generated by a deep neural network (DNN) front-end. In this paper, we propose methods of convolutional attention for independently modelling temporal and frequency information in a convolutional neural network (CNN) …