Tensor-To-Vector Regression for Multi-Channel Speech Enhancement Based on Tensor-Train Network

Type: Article

Publication Date: 2020-04-09

Citations: 16

DOI: https://doi.org/10.1109/icassp40776.2020.9052938

Abstract

We propose a tensor-to-vector regression approach to multi-channel speech enhancement in order to address the issue of input size explosion and hidden-layer size expansion. The key idea is to cast the conventional deep neural network (DNN) based vector-to-vector regression formulation under a tensor-train network (TTN) framework. TTN is a recently emerged solution for compact representation of deep models with fully connected hidden layers. Thus TTN maintains DNN's expressive power yet involves a much smaller amount of trainable parameters. Furthermore, TTN can handle a multi-dimensional tensor input by design, which exactly matches the desired setting in multi-channel speech enhancement. We first provide a theoretical extension from DNN to TTN based regression. Next, we show that TTN can attain speech enhancement quality comparable with that for DNN but with much fewer parameters, e.g., a reduction from 27 million to only 5 million parameters is observed in a single-channel scenario. TTN also improves PESQ over DNN from 2.86 to 2.96 by slightly increasing the number of trainable parameters. Finally, in 8-channel conditions, a PESQ of 3.12 is achieved using 20 million parameters for TTN, whereas a DNN with 68 million parameters can only attain a PESQ of 3.06.

Locations

  • arXiv (Cornell University) - View - PDF
  • Nova Science Publishers (Nova Science Publishers, Inc.) - View - PDF
  • ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) - View

Similar Works

Action Title Year Authors
+ Tensor-to-Vector Regression for Multi-channel Speech Enhancement based on Tensor-Train Network 2020 Jun Qi
Hu Hu
Yannan Wang
Chao-Han Huck Yang
Sabato Marco Siniscalchi
Chin‐Hui Lee
+ PDF Chat Exploring Deep Hybrid Tensor-to-Vector Network Architectures for Regression Based Speech Enhancement 2020 Jun Qi
Hu Hu
Yannan Wang
Chao-Han Huck Yang
Sabato Marco Siniscalchi
Chin‐Hui Lee
+ Exploring Deep Hybrid Tensor-to-Vector Network Architectures for Regression Based Speech Enhancement 2020 Jun Qi
Hu Hu
Yannan Wang
Chao-Han Huck Yang
Sabato Marco Siniscalchi
Chin‐Hui Lee
+ Tensor-Train Long Short-Term Memory for Monaural Speech Enhancement 2018 Suman Samui
Indrajit Chakrabarti
Soumya Ghosh
+ Tensor-Train Long Short-Term Memory for Monaural Speech Enhancement. 2018 Suman Samui
Indrajit Chakrabarti
Soumya K. Ghosh
+ TFCN: Temporal-Frequential Convolutional Network for Single-Channel Speech Enhancement 2022 Xupeng Jia
Dongmei Li
+ PDF Chat Exploiting Low-Rank Tensor-Train Deep Neural Networks Based on Riemannian Gradient Descent With Illustrations of Speech Processing 2023 Jun Qi
Chao-Han Huck Yang
Pin‐Yu Chen
Javier Tejedor
+ Exploiting Low-Rank Tensor-Train Deep Neural Networks Based on Riemannian Gradient Descent With Illustrations of Speech Processing 2022 Jun Qi
Chao-Han Huck Yang
Pin‐Yu Chen
+ Exploiting Low-Rank Tensor-Train Deep Neural Networks Based on Riemannian Gradient Descent With Illustrations of Speech Processing 2022 Jun Qi
Chao-Han Huck Yang
Pin‐Yu Chen
Javier Tejedor
+ A Multi-dimensional Deep Structured State Space Approach to Speech Enhancement Using Small-footprint Models 2023 Pin-Jui Ku
Chao-Han Huck Yang
Sabato Marco Siniscalchi
Chin‐Hui Lee
+ PDF Chat A Multi-dimensional Deep Structured State Space Approach to Speech Enhancement Using Small-footprint Models 2023 Pin-Jui Ku
Chao-Han Huck Yang
Sabato Marco Siniscalchi
Chin‐Hui Lee
+ Closing the Gap Between Time-Domain Multi-Channel Speech Enhancement on Real and Simulation Conditions 2021 Wangyou Zhang
Jing Shi
Chenda Li
Shinji Watanabe
Yanmin Qian
+ PDF Chat Closing the Gap Between Time-Domain Multi-Channel Speech Enhancement on Real and Simulation Conditions 2021 Wangyou Zhang
Jing Shi
Chenda Li
Shinji Watanabe
Yanmin Qian
+ Compressing Speech Recognition Networks with MLP via Tensor-Train Decomposition 2019 Dan He
Yubin Zhong
+ Deep Residual-Dense Lattice Network for Speech Enhancement 2020 Mohammad Nikzad
Aaron Nicolson
Yongsheng Gao
Jun Zhou
Kuldip K. Paliwal
Fanhua Shang
+ PDF Chat Deep Residual-Dense Lattice Network for Speech Enhancement 2020 Mohammad Nikzad
Aaron Nicolson
Yongsheng Gao
Jun Zhou
Kuldip K. Paliwal
Fanhua Shang
+ A Two-Stage Deep Representation Learning-Based Speech Enhancement Method Using Variational Autoencoder and Adversarial Training 2022 Yang Xiang
Jesper Lisby Højvang
Morten Højfeldt Rasmussen
Mads GrÌsbøll Christensen
+ Monaural Speech Enhancement Using a Multi-Branch Temporal Convolutional Network 2019 Qiquan Zhang
Aaron Nicolson
Mingjiang Wang
Kuldip K. Paliwal
Chenxu Wang
+ Single-Channel Speech Enhancement with Deep Complex U-Networks and Probabilistic Latent Space Models 2023 Eike J. Nustede
JĂśrn AnemĂźller
+ PDF Chat Heterogeneous Space Fusion and Dual-Dimension Attention: A New Paradigm for Speech Enhancement 2024 Tao Zheng
Liejun Wang
Yingfeng Yu

Works That Cite This (12)

Action Title Year Authors
+ PDF Chat SEOFP-NET: Compression and Acceleration of Deep Neural Networks for Speech Enhancement Using Sign-Exponent-Only Floating-Points 2021 Yu-Chen Lin
Cheng Yu
Yi‐Te Hsu
Szu‐Wei Fu
Yu Tsao
Tei‐Wei Kuo
+ PDF Chat Improving Speech Enhancement Performance by Leveraging Contextual Broad Phonetic Class Information 2023 Yen-Ju Lu
Chia‐Yu Chang
Cheng Yu
Ching‐Feng Liu
Jeih-weih Hung
Shinji Watanabe
Yu Tsao
+ PDF Chat Time-Domain Multi-Modal Bone/Air Conducted Speech Enhancement 2020 Cheng Yu
Kuo-Hsuan Hung
Syu‐Siang Wang
Yu Tsao
Jeih-weih Hung
+ Exploring Deep Hybrid Tensor-to-Vector Network Architectures for Regression Based Speech Enhancement 2020 Jun Qi
Hu Hu
Yannan Wang
Chao-Han Huck Yang
Sabato Marco Siniscalchi
Chin‐Hui Lee
+ QTN-VQC: An End-to-End Learning framework for Quantum Neural Networks 2021 Jun Qi
Chao-Han Huck Yang
Pin‐Yu Chen
+ PDF Chat Analyzing Upper Bounds on Mean Absolute Errors for Deep Neural Network-Based Vector-to-Vector Regression 2020 Jun Qi
Jun Du
Sabato Marco Siniscalchi
Xiaoli Ma
Chin‐Hui Lee
+ PDF Chat On Mean Absolute Error for Deep Neural Network Based Vector-to-Vector Regression 2020 Jun Qi
Jun Du
Sabato Marco Siniscalchi
Xiaoli Ma
Chin‐Hui Lee
+ PDF Chat Boosting Self-Supervised Embeddings for Speech Enhancement 2022 Kuo-Hsuan Hung
Szu‐Wei Fu
H. Eric Tseng
Hsin-Tien Chiang
Yu Tsao
Chii‐Wann Lin
+ PDF Chat Exploring Deep Hybrid Tensor-to-Vector Network Architectures for Regression Based Speech Enhancement 2020 Jun Qi
Hu Hu
Yannan Wang
Chao-Han Huck Yang
Sabato Marco Siniscalchi
Chin‐Hui Lee
+ Designing Tensor-Train Deep Neural Networks For Time-Varying MIMO Channel Estimation 2021 Jing Zhang
Xiaoli Ma
Jun Qi
Shi Jin