Siddharth Gururani

Follow

Generating author description...

All published works
Action Title Year Authors
+ PDF Chat Cosmos World Foundation Model Platform for Physical AI 2025 NVIDIA
NULL AUTHOR_ID
Niket Agarwal
Adnan Ali
Madhu Bala
Yogesh Balaji
E. N. Barker
Tiffany Cai
Prithvijit Chattopadhyay
Yongxin Chen
+ PDF Chat Edify Image: High-Quality Image Generation with Pixel Space Laplacian Diffusion Models 2024 Nvidia Nvidia
NULL AUTHOR_ID
Yuval Atzmon
Madhu Bala
Yogesh Balaji
Tiffany Cai
Yin Cui
Jiaojiao Fan
Yunhao Ge
Siddharth Gururani
+ PDF Chat Symbolic Music Generation with Non-Differentiable Rule Guided Diffusion 2024 Yujia Huang
Adishree Ghatare
Y Liu
Ziniu Hu
Qinsheng Zhang
Chandramouli Shama Sastry
Siddharth Gururani
Sageev Oore
Yisong Yue
+ PDF Chat SPACE: Speech-driven Portrait Animation with Controllable Expression 2023 Siddharth Gururani
Arun Mallya
Ting-Chun Wang
Rafael Valle
Ming-Yu Liu
+ Multilingual Multiaccented Multispeaker TTS with RADTTS 2023 Rohan Badlani
Rafael Valle
Kevin J. Shih
João Felipe Santos
Siddharth Gururani
Bryan Catanzaro
+ Anomalous behaviour in loss-gradient based interpretability methods 2022 Vinod Subramanian
Siddharth Gururani
Emmanouil Benetos
M. Sandler
+ SPACE: Speech-driven Portrait Animation with Controllable Expression 2022 Siddharth Gururani
Arun Mallya
Ting-Chun Wang
Rafael Valle
Ming-Yu Liu
+ PDF Chat Semi-Supervised Audio Classification with Partially Labeled Data 2021 Siddharth Gururani
Alexander Lerch
+ Semi-Supervised Audio Classification with Partially Labeled Data 2021 Siddharth Gururani
Alexander Lerch
+ Visual Attention for Musical Instrument Recognition 2020 Karn N. Watcharasupat
Siddharth Gururani
Alexander Lerch
+ dMelodies: A Music Dataset for Disentanglement Learning 2020 Ashis Pati
Siddharth Gururani
Alexander Lerch
+ Score-informed Networks for Music Performance Assessment 2020 Jiawen Huang
Yun-Ning Hung
Ashis Pati
Siddharth Gururani
Alexander Lerch
+ PDF Chat An Interdisciplinary Review of Music Performance Analysis 2020 Alexander Lerch
Claire Arthur
Kumar Ashis Pati
Siddharth Gururani
+ An Attention Mechanism for Musical Instrument Recognition 2019 Siddharth Gururani
Mohit Sharma
Alexander Lerch
+ Music Performance Analysis: A Survey 2019 Alexander Lerch
Claire Arthur
Ashis Pati
Siddharth Gururani
+ Prosody Transfer in Neural Text to Speech Using Global Pitch and Loudness Features 2019 Siddharth Gururani
Kilol Gupta
Dhɑvɑl Shɑh
Zahra Shakeri
Jervis Pinto
+ An Attention Mechanism for Musical Instrument Recognition 2019 Siddharth Gururani
Mohit Sharma
Alexander Lerch
Common Coauthors
Commonly Cited References
Action Title Year Authors # of times referenced
+ Transfer learning for music classification and regression tasks 2017 Keunwoo Choi
György Fazekas
M. Sandler
Kyunghyun Cho
3
+ PDF Chat Audio Set Classification with Attention Model: A Probabilistic Perspective 2018 Qiuqiang Kong
Yong Xu
Wenwu Wang
Mark D. Plumbley
3
+ PDF Chat CNN architectures for large-scale audio classification 2017 Shawn Hershey
Sourish Chaudhuri
Daniel P. W. Ellis
Jort F. Gemmeke
Aren Jansen
Robert C. Moore
Manoj Plakal
Devin Platt
Rif A. Saurous
Bryan Seybold
2
+ PDF Chat Multitask Learning for Frame-level Instrument Recognition 2019 Yun-Ning Hung
Yi‐An Chen
Yi‐Hsuan Yang
2
+ Neural Translation of Musical Style 2017 Iman Malik
Carl Henrik Ek
2
+ PDF Chat Learning a Deep ConvNet for Multi-Label Classification With Partial Labels 2019 Thibaut Durand
Nazanin Mehrasa
Greg Mori
2
+ Music Performance Analysis: A Survey 2019 Alexander Lerch
Claire Arthur
Ashis Pati
Siddharth Gururani
2
+ Adam: A Method for Stochastic Optimization 2014 Diederik P. Kingma
Jimmy Ba
2
+ PDF Chat This time with feeling: learning expressive musical performance 2018 Sageev Oore
Ian Simon
Sander Dieleman
Douglas Eck
Karen Simonyan
2
+ Frame-level Instrument Recognition by Timbre and Pitch 2018 Yun-Ning Hung
Yi‐Hsuan Yang
2
+ PDF Chat Deep Convolutional Neural Networks for Predominant Instrument Recognition in Polyphonic Music 2016 Yoonchang Han
Jaehun Kim
Kyogu Lee
2
+ Neural Machine Translation by Jointly Learning to Align and Translate 2015 Dzmitry Bahdanau
Kyunghyun Cho
Yoshua Bengio
2
+ Deep convolutional networks on the pitch spiral for musical instrument recognition 2016 Vincent Lostanlen
Carmine-Emanuele Cella
1
+ PDF Chat Audio Event Detection using Weakly Labeled Data 2016 Anurag Kumar
Bhiksha Raj
1
+ Recurrent Models of Visual Attention 2014 Volodymyr Mnih
Nicolas Heess
Alex Graves
Koray Kavukcuoglu
1
+ PDF Chat Face2Face: Real-Time Face Capture and Reenactment of RGB Videos 2016 Justus Thies
Michael Zollhöfer
Marc Stamminger
Christian Theobalt
Matthias Nießner
1
+ PDF Chat Face Alignment in Full Pose Range: A 3D Total Solution 2017 Xiangyu Zhu
Xiaoming Liu
Zhen Lei
Stan Z. Li
1
+ Learning with Pseudo-Ensembles 2014 Phil Bachman
Ouais Alsharif
Doina Precup
1
+ PDF Chat Effective Approaches to Attention-based Neural Machine Translation 2015 Thang Luong
Hieu Pham
Christopher D. Manning
1
+ PDF Chat FiLM: Visual Reasoning with a General Conditioning Layer 2018 Ethan Perez
Florian Strub
Harm de Vries
Vincent Dumoulin
Aaron Courville
1
+ PDF Chat Expressive Speech Synthesis via Modeling Expressions with Variational Autoencoder 2018 Kei Akuzawa
Yusuke Iwasawa
Yutaka Matsuo
1
+ Automatic Instrument Recognition in Polyphonic Music Using Convolutional Neural Networks 2015 Peter Li
Jiyuan Qian
Tian Wang
1
+ Multi-level Attention Model for Weakly Supervised Audio Classification 2018 Changsong Yu
Karim Said Barsim
Qiuqiang Kong
Bin Yang
1
+ Towards End-to-End Prosody Transfer for Expressive Speech Synthesis with Tacotron 2018 RJ Skerry-Ryan
Eric Battenberg
Ying Xiao
Yuxuan Wang
Daisy Stanton
Joel Shor
Ron J. Weiss
Rob Clark
Rif A. Saurous
1
+ Adaptive Pooling Operators for Weakly Labeled Sound Event Detection 2018 Brian McFee
Justin Salamon
Juan Pablo Bello
1
+ Understanding disentangling in $β$-VAE 2018 Christopher Burgess
Irina Higgins
Arka Pal
Löıc Matthey
Nick Watters
Guillaume Desjardins
Alexander Lerchner
1
+ PDF Chat VoxCeleb2: Deep Speaker Recognition 2018 Joon Son Chung
Arsha Nagrani
Andrew Zisserman
1
+ Tacotron: Towards End-to-End Speech Synthesis 2017 Yuxuan Wang
RJ Skerry-Ryan
Daisy Stanton
Yonghui Wu
Ron J. Weiss
Navdeep Jaitly
Zongheng Yang
Ying Xiao
Zhifeng Chen
Samy Bengio
1
+ Transfer learning for music classification and regression tasks 2017 Keunwoo Choi
György Fazekas
M. Sandler
Kyunghyun Cho
1
+ PDF Chat PerformanceNet: Score-to-Audio Music Generation with Multi-Band Convolutional Residual Network 2019 Bryan Wang
Yi‐Hsuan Yang
1
+ Feed-Forward Networks with Attention Can Solve Some Long-Term Memory Problems 2015 Colin Raffel
Daniel P. W. Ellis
1
+ Semi-supervised Learning with Deep Generative Models 2014 Diederik P. Kingma
Shakir Mohamed
Danilo Jimenez Rezende
Max Welling
1
+ PDF Chat Convolutional Recurrent Neural Networks for Polyphonic Sound Event Detection 2017 Emre Çakır
Giambattista Parascandolo
Toni Heittola
Heikki Huttunen
Tuomas Virtanen
1
+ Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift 2015 Sergey Ioffe
Christian Szegedy
1
+ On the Transfer of Inductive Bias from Simulation to the Real World: a New Disentanglement Dataset 2019 Muhammad Waleed Gondal
Manuel Wüthrich
Đorđe Miladinović
Francesco Locatello
Martin Breidt
Valentin Volchkov
Joel Akpo
Olivier Bachem
Bernhard Schölkopf
Stefan Bauer
1
+ FMA: A Dataset For Music Analysis 2016 Michaël Defferrard
Kirell Benzi
Pierre Vandergheynst
Xavier Bresson
1
+ Semi-Supervised Learning with Deep Generative Models 2014 Diederik P. Kingma
Danilo Jimenez Rezende
Shakir Mohamed
Max Welling
1
+ Enabling Factorized Piano Music Modeling and Generation with the MAESTRO Dataset 2018 Curtis Hawthorne
Andriy Stasyuk
Adam P. Roberts
Ian Simon
Cheng-Zhi Anna Huang
Sander Dieleman
Erich Elsen
Jesse Engel
Douglas Eck
1
+ CHiVE: Varying Prosody in Speech Synthesis with a Linguistically Driven Dynamic Hierarchical Conditional Variational Network 2019 Vincent Wan
Chun-an Chan
Tom Kenter
Jakub Vít
Rob Clark
1
+ Deep Sets 2017 Manzil Zaheer
Satwik Kottur
Siamak Ravanbakhsh
Barnabás Póczos
Ruslan Salakhutdinov
Alexander J. Smola
1
+ Mean teachers are better role models: Weight-averaged consistency targets improve semi-supervised deep learning results 2017 Antti Tarvainen
Harri Valpola
1
+ PDF Chat Deep Residual Learning for Image Recognition 2016 Kaiming He
Xiangyu Zhang
Shaoqing Ren
Jian Sun
1
+ Learning to Groove with Inverse Sequence Transformations 2019 Jon Gillick
Adam P. Roberts
Jesse Engel
Douglas Eck
David Bamman
1
+ Multi-Level Variational Autoencoder: Learning Disentangled Representations from Grouped Observations 2017 Diane Bouchacourt
Ryota Tomioka
Sebastian Nowozin
1
+ Very Deep Convolutional Networks for Large-Scale Image Recognition 2014 Karen Simonyan
Andrew Zisserman
1
+ PDF Chat Wav2Pix: Speech-conditioned Face Generation Using Generative Adversarial Networks 2019 Amanda Duarte
Francisco Roldan
Miquel Tubau
Janna Escur
Santiago Pascual
Amaia Salvador
Eva Mohedano
Kevin McGuinness
Jordi Torres
Xavier Giró-i-Nieto
1
+ PDF Chat Sound event detection using spatial features and convolutional recurrent neural network 2017 Sharath Adavanne
Pasi Pertilä
Tuomas Virtanen
1
+ PDF Chat GLSR-VAE: Geodesic latent space regularization for variational autoencoder architectures 2017 Gaëtan Hadjeres
Frank Nielsen
François Pachet
1
+ PDF Chat Talking Face Generation by Adversarially Disentangled Audio-Visual Representation 2019 Hang Zhou
Yü Liu
Ziwei Liu
Ping Luo
Xiaogang Wang
1
+ Challenging Common Assumptions in the Unsupervised Learning of Disentangled Representations 2018 Francesco Locatello
Stefan Bauer
Mario Lučić
Gunnar Rätsch
Sylvain Gelly
Bernhard Schölkopf
Olivier Bachem
1