Self-Expressing Autoencoders for Unsupervised Spoken Term Discovery
Self-Expressing Autoencoders for Unsupervised Spoken Term Discovery
Unsupervised spoken term discovery consists of two tasks: finding the acoustic segment boundaries and labeling acoustically similar segments with the same labels.We perform segmentation based on the assumption that the frame feature vectors are more similar within a segment than across the segments.Therefore, for strong segmentation performance, it is crucial …