Differentiable Signal Processing With Black-Box Audio Effects

Type: Article

Publication Date: 2021-05-13

Citations: 18

DOI: https://doi.org/10.1109/icassp39728.2021.9415103

Abstract

We present a data-driven approach to automate audio signal processing by incorporating stateful third-party, audio effects as layers within a deep neural network. We then train a deep encoder to analyze input audio and control effect parameters to perform the desired signal manipulation, requiring only input-target paired audio data as supervision. To train our network with non-differentiable black-box effects layers, we use a fast, parallel stochastic gradient approximation scheme within a standard auto differentiation graph, yielding efficient end-to-end backpropagation. We demonstrate the power of our approach with three separate automatic audio production applications: tube amplifier emulation, automatic removal of breaths and pops from voice recordings, and automatic music mastering. We validate our results with a subjective listening test, showing our approach not only can enable new automatic audio effects tasks, but can yield results comparable to a specialized, state-of-the-art commercial solution for music mastering.

Locations

  • arXiv (Cornell University) - View - PDF
  • ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) - View

Similar Works

Action Title Year Authors
+ Differentiable Signal Processing With Black-Box Audio Effects 2021 Marco A. MartĂ­nez RamĂ­rez
Oliver Wang
Paris Smaragdis
Nicholas J. Bryan
+ Differentiable Signal Processing With Black-Box Audio Effects 2021 Marco A. MartĂ­nez RamĂ­rez
Oliver Wang
Paris Smaragdis
Nicholas J. Bryan
+ Style Transfer of Audio Effects with Differentiable Signal Processing 2022 Christian J. Steinmetz
Nicholas J. Bryan
Joshua D. Reiss
+ PDF Chat Style Transfer of Audio Effects with Differentiable Signal Processing 2022 Christian J. Steinmetz
Nicholas J. Bryan
Joshua D. Reiss
+ Modelling black-box audio effects with time-varying feature modulation 2022 Marco ComunitĂ 
Christian J. Steinmetz
Huy Phan
Joshua D. Reiss
+ Modelling Black-Box Audio Effects with Time-Varying Feature Modulation 2023 Marco ComunitĂ 
Christian J. Steinmetz
Huy Phan
Joshua D. Reiss
+ Style Transfer for Non-differentiable Audio Effects 2023 K. D. Grant
+ High-Fidelity Noise Reduction with Differentiable Signal Processing 2023 Christian J. Steinmetz
Thomas Walther
Joshua D. Reiss
+ DDSP: Differentiable Digital Signal Processing 2020 Jesse Engel
Lamtharn Hantrakul
Chenjie Gu
Adam P. Roberts
+ PDF Chat Steerable discovery of neural audio effects 2021 Christian J. Steinmetz
Joshua D. Reiss
+ Steerable discovery of neural audio effects 2021 Christian J. Steinmetz
Joshua D. Reiss
+ NoiseBandNet: Controllable Time-Varying Neural Synthesis of Sound Effects Using Filterbanks 2023 AdriĂĄn Barahona-RĂ­os
Tom Collins
+ PDF Chat Modeling Nonlinear Audio Effects with End-to-end Deep Neural Networks 2019 Marco A. MartĂ­nez RamĂ­rez
Joshua D. Reiss
+ SignalTrain: Profiling Audio Compressors with Deep Neural Networks 2019 Scott H. Hawley
Benjamin Colburn
Stylianos Ioannis Mimilakis
+ Exploring Quality and Generalizability in Parameterized Neural Audio Effects 2020 William Mitchell
Scott H. Hawley
+ Exploring Quality and Generalizability in Parameterized Neural Audio Effects 2020 William Mitchell
Scott H. Hawley
+ Efficient Neural Networks for Real-time Analog Audio Effect Modeling. 2021 Christian J. Steinmetz
Joshua D. Reiss
+ DDX7: Differentiable FM Synthesis of Musical Instrument Sounds 2022 Franco Caspe
Andrew McPherson
M. Sandler
+ Modeling of nonlinear audio effects with end-to-end deep neural networks 2018 Marco A. MartĂ­nez RamĂ­rez
Joshua D. Reiss
+ Modeling of nonlinear audio effects with end-to-end deep neural networks 2018 Marco A. MartĂ­nez RamĂ­rez
Joshua D. Reiss