HI-MIA: A Far-Field Text-Dependent Speaker Verification Database and the Baselines

Type: Article

Publication Date: 2020-04-09

Citations: 52

DOI: https://doi.org/10.1109/icassp40776.2020.9054423

Abstract

This paper presents a far-field text-dependent speaker verification database named HI-MIA. We aim to meet the data requirement for far-field microphone array based speaker verification since most of the publicly available databases are single channel close-talking and text-independent. The database contains recordings of 340 people in rooms designed for the far-field scenario. Recordings are captured by multiple microphone arrays located in different directions and distance to the speaker and a high-fidelity close-talking microphone. Besides, we propose a set of end-to-end neural network based baseline systems that adopt single-channel data for training. Moreover, we propose a testing background aware enrollment augmentation strategy to further enhance the performance. Results show that the fusion systems could achieve 3.29% EER in the far-field enrollment far field testing task and 4.02% EER in the close-talking enrollment and far-field testing task.

Locations

  • arXiv (Cornell University) - View - PDF
  • ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) - View

Similar Works

Action Title Year Authors
+ HI-MIA : A Far-field Text-Dependent Speaker Verification Database and the Baselines 2019 Xiaoyi Qin
Hui Bu
Ming Li
+ The INTERSPEECH 2020 Far-Field Speaker Verification Challenge 2020 Xiaoyi Qin
Ming Li
Hui Bu
Wei Rao
Rohan Kumar Das
Shrikanth Narayanan
Haizhou Li
+ PDF Chat The INTERSPEECH 2020 Far-Field Speaker Verification Challenge 2020 Xiaoyi Qin
Ming Li
Hui Bu
Wei Rao
Rohan Kumar Das
Shrikanth Narayanan
Haizhou Li
+ NPU Speaker Verification System for INTERSPEECH 2020 Far-Field Speaker Verification Challenge 2020 Li Zhang
Jian Wu
Lei Xie
+ PDF Chat NPU Speaker Verification System for INTERSPEECH 2020 Far-Field Speaker Verification Challenge 2020 Li Zhang
Jian Wu
Lei Xie
+ An Empirical Study on Text-Independent Speaker Verification based on the GE2E Method 2020 Soroosh Tayebi Arasteh
+ Parameterized Channel Normalization for Far-field Deep Speaker Verification 2021 Xuechen Liu
Md Sahidullah
Tomi Kinnunen
+ PDF Chat Parameterized Channel Normalization for Far-Field Deep Speaker Verification 2021 Xuechen Liu
Md Sahidullah
Tomi Kinnunen
+ PDF Chat The 2022 Far-field Speaker Verification Challenge: Exploring domain mismatch and semi-supervised learning under the far-field scenario 2022 Xiaoyi Qin
Ming Li
Hui Bu
Shrikanth Narayanan
Haizhou Li
+ The 2022 Far-field Speaker Verification Challenge: Exploring domain mismatch and semi-supervised learning under the far-field scenario 2022 Xiaoyi Qin
Ming Li
Hui Bu
Shrikanth Narayanan
Haizhou Li
+ Summary On The ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Grand Challenge 2022 Fan Yu
Shiliang Zhang
Pengcheng Guo
Yihui Fu
Zhihao Du
Siqi Zheng
Weilong Huang
Lei Xie
Zheng‐Hua Tan
Deliang Wang
+ PDF Chat Summary on the ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Grand Challenge 2022 Fan Yu
Shiliang Zhang
Pengcheng Guo
Yihui Fu
Zhihao Du
Siqi Zheng
Weilong Huang
Lei Xie
Zheng‐Hua Tan
DeLiang Wang
+ End-to-End Attention based Text-Dependent Speaker Verification 2017 Shi-Xiong Zhang
Zhuo Chen
Yong Zhao
Jinyu Li
Yifan Gong
+ PDF Chat The NIST 2010 speaker recognition evaluation 2010 Alvin F. Martín
Craig S. Greenberg
+ Generalized LSTM-based End-to-End Text-Independent Speaker Verification. 2020 Soroosh Tayebi Arasteh
+ Royalflush Speaker Diarization System for ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge 2022 Jingguang Tian
Xinhui Hu
Xinkang Xu
+ PDF Chat The 2018 NIST Speaker Recognition Evaluation 2019 Seyed Omid Sadjadi
Craig S. Greenberg
Elliot Singer
Douglas A. Reynolds
Lisa Mason
Jaime Hernández-Cordero
+ PDF Chat The 2016 NIST Speaker Recognition Evaluation 2017 Seyed Omid Sadjadi
Timothée Kheyrkhah
Audrey Tong
Craig S. Greenberg
Douglas A. Reynolds
Elliot Singer
Lisa Mason
Jaime Hernández-Cordero
+ The DKU-SMIIP System for NIST 2018 Speaker Recognition Evaluation 2019 Danwei Cai
Weicheng Cai
Ming Li
+ PDF Chat The DKU-SMIIP System for NIST 2018 Speaker Recognition Evaluation 2019 Danwei Cai
Weicheng Cai
Ming Li

Works That Cite This (21)

Action Title Year Authors
+ The INTERSPEECH 2020 Far-Field Speaker Verification Challenge 2020 Xiaoyi Qin
Ming Li
Hui Bu
Wei Rao
Rohan Kumar Das
Shrikanth Narayanan
Haizhou Li
+ PDF Chat The INTERSPEECH 2020 Far-Field Speaker Verification Challenge 2020 Xiaoyi Qin
Ming Li
Hui Bu
Wei Rao
Rohan Kumar Das
Shrikanth Narayanan
Haizhou Li
+ The HCCL Speaker Verification System for Far-Field Speaker Verification Challenge 2021 Zhuo Li
Ce Fang
Runqiu Xiao
Zhigao Chen
Wenchao Wang
Yonghong Yan
+ PDF Chat Deep Representation Decomposition for Rate-Invariant Speaker Verification 2022 Fuchuan Tong
Siqi Zheng
Haodong Zhou
Xingjia Xie
Qingyang Hong
Lin Li
+ The FFSVC 2020 Evaluation Plan 2020 Xiaoyi Qin
Ming Li
Hui Bu
Rohan Kumar Das
Wei Rao
Shrikanth Narayanan
Haizhou Li
+ Libri-adhoc40: A dataset collected from synchronized ad-hoc microphone arrays 2021 Shanzheng Guan
Shupei Liu
Junqi Chen
Wenbo Zhu
Shengqiang Li
Xu Tan
Ziye Yang
Menglong Xu
Yijiang Chen
Jianyu Wang
+ An MAP Estimation for Between-Class Variance 2021 Jiao Han
Yunqi Cai
Lantian Li
Guanyu Li
Dong Wang
+ PDF Chat Multisv: Dataset for Far-Field Multi-Channel Speaker Verification 2022 Ladislav Mošner
Oldřich Plchot
Lukáš Burget
Jaň Černocký
+ PDF Chat A Principle Solution for Enroll-Test Mismatch in Speaker Recognition 2022 Lantian Li
Dong Wang
Jiawen Kang
Renyu Wang
Jing Wu
Zhendong Gao
Xiao Chen
+ Voxblink: A Large Scale Speaker Verification Dataset on Camera 2024 Yuke Lin
Xiaoyi Qin
Guoqing Zhao
Ming Cheng
Ning Jiang
Haiying Wu
Ming Li