3D-Speaker-Toolkit: An Open Source Toolkit for Multi-modal Speaker
Verification and Diarization
3D-Speaker-Toolkit: An Open Source Toolkit for Multi-modal Speaker
Verification and Diarization
This paper introduces 3D-Speaker-Toolkit, an open source toolkit for multi-modal speaker verification and diarization. It is designed for the needs of academic researchers and industrial practitioners. The 3D-Speaker-Toolkit adeptly leverages the combined strengths of acoustic, semantic, and visual data, seamlessly fusing these modalities to offer robust speaker recognition capabilities. The …