Björn Schüller

Follow

Generating author description...

All published works
Action Title Year Authors
+ PDF Chat Parameterised Quantum Circuits for Novel Representation Learning in Speech Emotion Recognition 2025 Thejan Rajapakshe
Rajib Rana
Farhan Riaz
Sara Khalifa
Björn W. Schuller
+ PDF Chat DOTA-ME-CS: Daily Oriented Text Audio-Mandarin English-Code Switching Dataset 2025 Yupei Li
Wei Zhang
Heng Yu
Huichi Zhou
Björn W. Schuller
+ PDF Chat DFingerNet: Noise-Adaptive Speech Enhancement for Hearing Aids 2025 Iosif Tsangko
Andreas Triantafyllopoulos
Michael G. Müller
Hendrik Schröter
Björn W. Schuller
+ PDF Chat MAD-UV: The 1st INTERSPEECH Mice Autism Detection via Ultrasound Vocalization Challenge 2025 Zijiang Yang
Meishu Song
Jing Xin
Haojie Zhang
Kun Qian
Bin Hu
Kota Tamada
Toru Takumi
Björn W. Schuller
Yoshiharu Yamamoto
+ Automating Airborne Pollen Classification: Identifying and Interpreting Hard Samples for Classifiers 2025 Manuel Milling
Simon Rampp
Andreas Triantafyllopoulos
Maria Pilar Plaza
Jens O. Brunner
Claudia Traidl‐Hoffmann
Björn W. Schuller
Athanasios Damialis
+ PDF Chat Gender Bias in Text-to-Video Generation Models: A case study of Sora 2024 Mohammad Nadeem
Shahab Saquib Sohail
Erik Cambria
Björn W. Schuller
Amir Hussain
+ Explainable Artificial Intelligence for Medical Applications: A Review 2024 Qiyang Sun
Alican Akman
Björn Schüller
+ PDF Chat Towards Friendly AI: A Comprehensive Review and New Perspectives on Human-AI Alignment 2024 Qiyang Sun
Yupei Li
Emran Alturki
S Murthy
Björn Schüller
+ PDF Chat Detecting Document-level Paraphrased Machine Generated Content: Mimicking Human Writing Style and Involving Discourse Features 2024 Yupei Li
Manuel Milling
Lucia Specia
Björn Schüller
+ PDF Chat Detecting Machine-Generated Music with Explainability -- A Challenge and Early Benchmarks 2024 Yupei Li
Qiyang Sun
Hui Li
Lucia Specia
Björn Schüller
+ PDF Chat ProsodyFM: Unsupervised Phrasing and Intonation Control for Intelligible Speech Synthesis 2024 Xiangheng He
Junjie Chen
Zixing Zhang
Björn Schüller
+ PDF Chat autrainer: A Modular and Extensible Deep Learning Toolkit for Computer Audition Tasks 2024 Simon Rampp
Andreas Triantafyllopoulos
Manuel Milling
Björn Schüller
+ PDF Chat M6: Multi-generator, Multi-domain, Multi-lingual and cultural, Multi-genres, Multi-instrument Machine-Generated Music Detection Databases 2024 Yupei Li
Hui Li
Lucia Specia
Björn Schüller
+ PDF Chat From Audio Deepfake Detection to AI-Generated Music Detection -- A Pathway and Overview 2024 Yupei Li
Manuel Milling
Lucia Specia
Björn Schüller
+ PDF Chat Raw Audio Classification with Cosine Convolutional Neural Network (CosCovNN) 2024 Kazi Nazmul Haque
Rajib Rana
T. Jarin
Björn Schüller
+ PDF Chat Using voice analysis as an early indicator of risk for depression in young adults 2024 Klaus R. Scherer
Felix Burkhardt
Uwe D. Reichel
Florian Eyben
Björn Schüller
+ PDF Chat Explainable Artificial Intelligence for Medical Applications: A Review 2024 Qiyang Sun
Alican Akman
Björn Schüller
+ PDF Chat Non-Invasive Suicide Risk Prediction Through Speech Analysis 2024 Shahin Amiriparian
Maurice Gerczuk
Justina Lutz
Wolfgang Strube
Irina Papazova
Alkomiet Hasan
Alexander Kathan
Björn Schüller
+ PDF Chat Does the Definition of Difficulty Matter? Scoring Functions and their Role for Curriculum Learning 2024 Simon Rampp
Manuel Milling
Andreas Triantafyllopoulos
Björn Schüller
+ PDF Chat Audio-based Kinship Verification Using Age Domain Conversion 2024 Qiyang Sun
Alican Akman
Xin Jing
Manuel Milling
Björn Schüller
+ PDF Chat Audio Explanation Synthesis with Generative Foundation Models 2024 Alican Akman
Qiyang Sun
Björn Schüller
+ PDF Chat PerCo (SD): Open Perceptual Compression 2024 Nikolai Körber
Eduard Kromer
Andreas Siebert
Sascha Hauke
Daniel Mueller-Gritschneder
Björn Schüller
+ PDF Chat Trading through Earnings Seasons using Self-Supervised Contrastive Representation Learning 2024 Zhengxin Joseph Ye
Björn Schüller
+ PDF Chat Affective Computing Has Changed: The Foundation Model Disruption 2024 Björn Schüller
Adria Mallol-Ragolta
Alejandro Peña Almansa
Iosif Tsangko
Mostafa M. Amin
Anastasia Semertzidou
Lukas Christ
Shahin Amiriparian
+ PDF Chat Enhancing Emotional Text-to-Speech Controllability with Natural Language Guidance through Contrastive Learning and Diffusion Models 2024 Xin Jing
Kun Zhou
Andreas Triantafyllopoulos
Björn Schüller
+ PDF Chat ParaCLAP – Towards a general language-audio model for computational paralinguistic tasks 2024 Xin Jing
Andreas Triantafyllopoulos
Björn Schüller
+ PDF Chat Are you sure? Analysing Uncertainty Quantification Approaches for Real-world Speech Emotion Recognition 2024 Oliver Schrüfer
Manuel Milling
Felix Burkhardt
Florian Eyben
Björn Schüller
+ PDF Chat Sustained Vowels for Pre- vs Post-Treatment COPD Classification 2024 Andreas Triantafyllopoulos
Anton Batliner
Wolfgang Mayr
Markus Fendler
Florian B. Pokorny
Maurice Gerczuk
Shahin Amiriparian
Thomas M. Berghaus
Björn Schüller
+ PDF Chat Exploring Gender-Specific Speech Patterns in Automatic Suicide Risk Assessment 2024 Maurice Gerczuk
Shahin Amiriparian
Justina Lutz
Wolfgang Strube
Irina Papazova
Alkomiet Hasan
Björn Schüller
+ PDF Chat Enrolment-based personalisation for improving individual-level fairness in speech emotion recognition 2024 Andreas Triantafyllopoulos
Björn Schüller
+ PDF Chat ExHuBERT: Enhancing HuBERT Through Block Extension and Fine-Tuning on 37 Emotion Datasets 2024 Shahin Amiriparian
Filip Packań
Maurice Gerczuk
Björn Schüller
+ PDF Chat This Paper Had the Smartest Reviewers - Flattery Detection Utilising an Audio-Textual Transformer-Based Approach 2024 Lukas Christ
Shahin Amiriparian
Friederike Hawighorst
Ann-Kathrin Schill
Angelo Boutalikakis
Lorenz Graf‐Vlachy
Andreas König
Björn Schüller
+ PDF Chat Negation Blindness in Large Language Models: Unveiling the NO Syndrome in Image Generation 2024 Mohammad Nadeem
Shahab Saquib Sohail
Erik Cambria
Björn Schüller
Amir Hussain
+ PDF Chat Audio-Based Step-Count Estimation for Running - Windowing and Neural Network Baselines 2024 Philipp Wagner
Andreas Triantafyllopoulos
Alexander Gebhard
Björn Schüller
+ PDF Chat Wav2Small: Distilling Wav2Vec2 to 72K parameters for Low-Resource Speech emotion recognition 2024 Dionyssos Kounadis-Bastian
Oliver Schrüfer
Anna Derington
Hagen Wierstorf
Florian Eyben
Felix Burkhardt
Björn Schüller
+ PDF Chat Computer Audition: From Task-Specific Machine Learning to Foundation Models 2024 Andreas Triantafyllopoulos
Iosif Tsangko
Alexander Gebhard
Annamaria Mesaros
Tuomas Virtanen
Björn Schüller
+ PDF Chat A Comprehensive Survey on Heart Sound Analysis in the Deep Learning Era 2024 Zhao Ren
Yi Chang
Thành Tâm Nguyên
Yang Tan
Kun Qian
Björn Schüller
+ PDF Chat Emotion and Intent Joint Understanding in Multimodal Conversation: A Benchmarking Dataset 2024 Rui Liu
Haolin Zuo
Zheng Lian
Xiaofen Xing
Björn Schüller
Haizhou Li
+ PDF Chat Are you sure? Analysing Uncertainty Quantification Approaches for Real-world Speech Emotion Recognition 2024 Oliver Schrüfer
Manuel Milling
Felix Burkhardt
Florian Eyben
Björn Schüller
+ PDF Chat Audio Enhancement for Computer Audition—An Iterative Training Paradigm Using Sample Importance 2024 Manuel Milling
Shuo Liu
Andreas Triantafyllopoulos
Ilhan Aslan
Björn Schüller
+ PDF Chat A Wide Evaluation of ChatGPT on Affective Computing Tasks 2024 Mostafa M. Amin
Rui Mao
Erik Cambria
Björn Schüller
+ PDF Chat Exploring Gender-Specific Speech Patterns in Automatic Suicide Risk Assessment 2024 Maurice Gerczuk
Shahin Amiriparian
Justina Lutz
Wolfgang Strube
Irina Papazova
Alkomiet Hasan
Björn Schüller
+ PDF Chat This Paper Had the Smartest Reviewers -- Flattery Detection Utilising an Audio-Textual Transformer-Based Approach 2024 Lukas Christ
Shahin Amiriparian
Friederike Hawighorst
Ann-Kathrin Schill
Angelo Boutalikakis
Lorenz Graf‐Vlachy
Andreas König
Björn Schüller
+ PDF Chat Speech Emotion Recognition under Resource Constraints with Data Distillation 2024 Yi Chang
Zhao Ren
Zhonghao Zhao
Thành Tâm Nguyên
Kun Qian
Tanja Schultz
Björn Schüller
+ PDF Chat ParaCLAP -- Towards a general language-audio model for computational paralinguistic tasks 2024 Xin Jing
Andreas Triantafyllopoulos
Björn Schüller
+ PDF Chat The MuSe 2024 Multimodal Sentiment Analysis Challenge: Social Perception and Humor Recognition 2024 Shahin Amiriparian
Lukas Christ
Alexander Kathan
Maurice Gerczuk
Niklas Müller
Steffen Klug
Lukas Stappen
Andreas König
Erik Cambria
Björn Schüller
+ PDF Chat DB3V: A Dialect Dominated Dataset of Bird Vocalisation for Cross-corpus Bird Species Recognition 2024 Xin Jing
Luyang Zhang
Jiangjian Xie
Alexander Gebhard
Alice Baird
Björn Schüller
+ PDF Chat ExHuBERT: Enhancing HuBERT Through Block Extension and Fine-Tuning on 37 Emotion Datasets 2024 Shahin Amiriparian
Filip Packań
Maurice Gerczuk
Björn Schüller
+ PDF Chat An automatic analysis of ultrasound vocalisations for the prediction of interaction context in captive Egyptian fruit bats 2024 Andreas Triantafyllopoulos
Alexander Gebhard
Manuel Milling
Simon Rampp
Björn Schüller
+ PDF Chat Audio-based Step-count Estimation for Running -- Windowing and Neural Network Baselines 2024 Philipp Wagner
Andreas Triantafyllopoulos
Alexander Gebhard
Björn Schüller
+ PDF Chat Sustained Vowels for Pre- vs Post-Treatment COPD Classification 2024 Andreas Triantafyllopoulos
Anton Batliner
Wolfgang Mayr
Markus Fendler
Florian B. Pokorny
Maurice Gerczuk
Shahin Amiriparian
Thomas M. Berghaus
Björn Schüller
+ PDF Chat INTERSPEECH 2009 Emotion Challenge Revisited: Benchmarking 15 Years of Progress in Speech Emotion Recognition 2024 Andreas Triantafyllopoulos
Anton Batliner
Simon Rampp
Manuel Milling
Björn Schüller
+ PDF Chat Enrolment-based personalisation for improving individual-level fairness in speech emotion recognition 2024 Andreas Triantafyllopoulos
Björn Schüller
+ PDF Chat Modeling Emotional Trajectories in Written Stories Utilizing Transformers and Weakly-Supervised Learning 2024 Lukas Christ
Shahin Amiriparian
Manuel Milling
Ilhan Aslan
Björn Schüller
+ PDF Chat Refashioning Emotion Recognition Modelling: The Advent of Generalised Large Models 2024 Zixing Zhang
Liyizhe Peng
Tao Pang
Jing Han
Huan Zhao
Björn Schüller
+ PDF Chat Identity-free Artificial Emotional Intelligence via Micro-Gesture Understanding 2024 Rong Gao
Xin Liu
Bohao Xing
Zitong Yu
Björn Schüller
Heikki Kälviäinen
+ PDF Chat HAFFormer: A Hierarchical Attention-Free Framework for Alzheimer's Disease Detection From Spontaneous Speech 2024 Zhongren Dong
Zixing Zhang
Weixiang Xu
Jing Han
Jianjun Ou
Björn Schüller
+ PDF Chat Intelligent Cardiac Auscultation for Murmur Detection via Parallel-Attentive Models with Uncertainty Estimation 2024 Zixing Zhang
Tao Pang
Jing Han
Björn Schüller
+ PDF Chat Expressivity and Speech Synthesis 2024 Andreas Triantafyllopoulos
Björn Schüller
+ PDF Chat MER 2024: Semi-Supervised Learning, Noise Robustness, and Open-Vocabulary Multimodal Emotion Recognition 2024 Zheng Lian
Haiyang Sun
Licai Sun
Zhuofan Wen
Siyuan Zhang
Shun Chen
Hao Gu
Jinming Zhao
Ziyang Ma
Xie Chen
+ PDF Chat Enhancing Suicide Risk Assessment: A Speech-Based Automated Approach in Emergency Medicine 2024 Shahin Amiriparian
Maurice Gerczuk
Justina Lutz
Wolfgang Strube
Irina Papazova
Alkomiet Hasan
Alexander Kathan
Björn Schüller
+ Identifying depression-related topics in smartphone-collected free-response speech recordings using an automatic speech recognition system and a deep learning topic model 2024 Yuezhou Zhang
Amos Folarin
Judith Dineley
Pauline Conde
Valeria de Angel
Shaoxiong Sun
Yatharth Ranjan
Zulqarnain Rashid
Callum Stewart
Petroula Laiou
+ PDF Chat On Prompt Sensitivity of ChatGPT in Affective Computing 2024 Mostafa M. Amin
Björn Schüller
+ PDF Chat emoDARTS: Joint Optimisation of CNN & Sequential Neural Network Architectures for Superior Speech Emotion Recognition 2024 Thejan Rajapakshe
Rajib Rana
Sara Khalifa
Berrak Şişman
Björn Schüller
Carlos Busso
+ Speech Swin-Transformer: Exploring a Hierarchical Transformer with Shifted Windows for Speech Emotion Recognition 2024 Yong Wang
Cheng Lu
Hailun Lian
Zhao Yan
Björn Schüller
Yuan Zong
Wenming Zheng
+ Intelligent Cardiac Auscultation for Murmur Detection via Parallel-Attentive Models with Uncertainty Estimation 2024 Zixing Zhang
Tao Pang
Jing Han
Björn Schüller
+ Customising General Large Language Models for Specialised Emotion Recognition Tasks 2024 Liyizhe Peng
Zixing Zhang
Tao Pang
Jing Han
Huan Zhao
Hao Chen
Björn Schüller
+ Improving Speaker-Independent Speech Emotion Recognition using Dynamic Joint Distribution Adaptation 2024 Cheng Lu
Yuan Zong
Hailun Lian
Yan Zhao
Björn Schüller
Wenming Zheng
+ HAFFormer: A Hierarchical Attention-Free Framework for Alzheimer’s Disease Detection From Spontaneous Speech 2024 Zhongren Dong
Zixing Zhang
Weixiang Xu
Jing Han
Jianjun Ou
Björn Schüller
+ Emotion-Aware Contrastive Adaptation Network for Source-Free Cross-Corpus Speech Emotion Recognition 2024 Yan Zhao
Jincen Wang
Cheng Lu
Sunan Li
Björn Schüller
Yuan Zong
Wenming Zheng
+ Bringing the Discussion of Minima Sharpness to the Audio Domain: A Filter-Normalised Evaluation for Acoustic Scene Classification 2024 Manuel Milling
Andreas Triantafyllopoulos
Iosif Tsangko
Simon Rampp
Björn Schüller
+ Synthia’s Melody: A Benchmark Framework for Unsupervised Domain Adaptation in Audio 2024 Chiahsin Lin
Charles Jones
Björn Schüller
Harry Coppock
Alican Akman
+ Task Selection and Assignment for Multi-Modal Multi-Task Dialogue Act Classification with Non-Stationary Multi-Armed Bandits 2024 Xiangheng He
Junjie Chen
Björn Schüller
+ PDF Chat Propagating variational model uncertainty for bioacoustic call label smoothing 2024 Georgios Rizos
Jenna Lawson
Sımon F. Mıtchell
Pranay Shah
Xin Wen
Cristina Banks‐Leite
Robert M. Ewers
Björn Schüller
+ PDF Chat STAA-Net: A Sparse and Transferable Adversarial Attack for Speech Emotion Recognition 2024 Yi Chang
Zhao Ren
Zixing Zhang
Xin Jing
Kun Qian
Xi Shao
Bin Hu
Tanja Schultz
Björn Schüller
+ Improving Speaker-independent Speech Emotion Recognition Using Dynamic Joint Distribution Adaptation 2024 Cheng Lu
Yuan Zong
Hailun Lian
Yan Zhao
Björn Schüller
Wenming Zheng
+ Speech Swin-Transformer: Exploring a Hierarchical Transformer with Shifted Windows for Speech Emotion Recognition 2024 Yong Wang
Cheng Lu
Hailun Lian
Yan Zhao
Björn Schüller
Yuan Zong
Wenming Zheng
+ Emotion-Aware Contrastive Adaptation Network for Source-Free Cross-Corpus Speech Emotion Recognition 2024 Yan Zhao
Jincen Wang
Cheng Lu
Sunan Li
Björn Schüller
Yuan Zong
Wenming Zheng
+ emoDARTS: Joint Optimization of CNN and Sequential Neural Network Architectures for Superior Speech Emotion Recognition 2024 Thejan Rajapakshe
Rajib Rana
Sara Khalifa
Berrak Şişman
Björn Schüller
Carlos Busso
+ PDF Chat Computational charisma—A brick by brick blueprint for building charismatic artificial intelligence 2023 Björn Schüller
Shahin Amiriparian
Anton Batliner
Alexander Gebhard
Maurice Gerczuk
Vincent Karas
Alexander Kathan
Lennart Seizer
Johanna Löchner
+ The UK COVID-19 Vocal Audio Dataset 2023 Harry Coppock
The Alan Turing Institute
UK Health Security Agency
Jobie Budd
Emma Karoune
Chris Holmes
Kieran Baker
Davide Pigoli
George Nicholson
Richard Payne
+ The UK COVID-19 Vocal Audio Dataset 2023 Harry Coppock
The Alan Turing Institute
UK Health Security Agency
Jobie Budd
Emma Karoune
Chris Holmes
Kieran Baker
Davide Pigoli
George Nicholson
Richard Payne
+ MER 2023: Multi-label Learning, Modality Robustness, and Semi-Supervised Learning 2023 Zheng Lian
Haiyang Sun
Licai Sun
Kang Chen
Mngyu Xu
Kexin Wang
Ke Xu
Yu He
Ying Li
Jinming Zhao
+ PDF Chat The MuSe 2023 Multimodal Sentiment Analysis Challenge: Mimicked Emotions, Cross-Cultural Humour, and Personalisation 2023 Lukas Christ
Shahin Amiriparian
Alice Baird
Alexander Kathan
Niklas Müller
Steffen Klug
Chris Gagne
Panagiotis Tzirakis
Lukas Stappen
Eva-Maria Meßner
+ PDF Chat COLD Fusion: Calibrated and Ordinal Latent Distribution Fusion for Uncertainty-Aware Multimodal Emotion Recognition 2023 Mani Kumar Tellamekala
Shahin Amiriparian
Björn Schüller
Elisabeth André
Timo Giesbrecht
Michel Valstar
+ PDF Chat Integrating Generative Artificial Intelligence in Intelligent Vehicle Systems 2023 Lukas Stappen
Jeremy Dillmann
Serena Striegel
Hans J. Vogel
Nicolas Flores-Herr
Björn Schüller
+ PDF Chat HEAR4Health: a blueprint for making computer audition a staple of modern healthcare 2023 Andreas Triantafyllopoulos
Alexander Kathan
Alice Baird
Lukas Christ
Alexander Gebhard
Maurice Gerczuk
Vincent Karas
Tobias Hübner
Xin Jing
Shuo Liu
+ PDF Chat Can ChatGPT’s Responses Boost Traditional Natural Language Processing? 2023 Mostafa M. Amin
Erik Cambria
Björn Schüller
+ PDF Chat A Preliminary Study on Augmenting Speech Emotion Recognition using a Diffusion Model 2023 Mohammad Ibrahim Malik
Siddique Latif
Raja Jurdak
Björn Schüller
+ PDF Chat Abusive Speech Detection in Indic Languages Using Acoustic Features 2023 Anika A. Spiesberger
Andreas Triantafyllopoulos
Iosif Tsangko
Björn Schüller
+ PDF Chat Executive Voiced Laughter and Social Approval: An Explorative Machine Learning Study 2023 Niklas Mueller
Steffen Klug
Alexander Kathan
Lukas Christ
Björn Schüller
Shahin Amiriparian
+ PDF Chat Are 3D Face Shapes Expressive Enough for Recognising Continuous Emotions and Action Unit Intensities? 2023 Mani Kumar Tellamekala
Ömer Sümer
Björn Schüller
Elisabeth André
Timo Giesbrecht
Michel Valstar
+ Hierarchical Network with Decoupled Knowledge Distillation for Speech Emotion Recognition 2023 Ziping Zhao
Huan Wang
Haishuai Wang
Björn Schüller
+ Audio Barlow Twins: Self-Supervised Audio Representation Learning 2023 Jonah Anton
Harry Coppock
Pancham Shukla
Björn Schüller
+ Knowledge Transfer for on-Device Speech Emotion Recognition With Neural Structured Learning 2023 Yi Chang
Zhao Ren
Thành Tâm Nguyên
Kun Qian
Björn Schüller
+ Fast Yet Effective Speech Emotion Recognition with Self-Distillation 2023 Zhao Ren
Thành Tâm Nguyên
Yi Hua Chang
Björn Schüller
+ PDF Chat Dawn of the Transformer Era in Speech Emotion Recognition: Closing the Valence Gap 2023 Johannes Wagner
Andreas Triantafyllopoulos
Hagen Wierstorf
Maximilian Schmitt
Felix Burkhardt
Florian Eyben
Björn Schüller
+ PDF Chat An Overview of Affective Speech Synthesis and Conversion in the Deep Learning Era 2023 Andreas Triantafyllopoulos
Björn Schüller
Gökçe İymen
Tevfik Metin Sezgin
Xiangheng He
Zijiang Yang
Panagiotis Tzirakis
Shuo Liu
Silvan Mertes
Elisabeth André
+ PDF Chat A summary of the ComParE COVID-19 challenges 2023 Harry Coppock
Alican Akman
Christian Bergler
Maurice Gerczuk
Chloë Brown
Jagmohan Chauhan
Andreas Grammenos
Apinan Hasthanasombat
Dimitris Spathis
Xia Tong
+ PDF Chat Multistage linguistic conditioning of convolutional layers for speech emotion recognition 2023 Andreas Triantafyllopoulos
Uwe D. Reichel
Shuo Liu
Stephan Huber
Florian Eyben
Björn Schüller
+ Computational Charisma -- A Brick by Brick Blueprint for Building Charismatic Artificial Intelligence 2023 Björn Schüller
Shahin Amiriparian
Anton Batliner
Alexander Gebhard
Maurice Gerzcuk
Vincent Karas
Alexander Kathan
Lennart Seizer
Johanna Löchner
+ A Comprehensive Survey on Heart Sound Analysis in the Deep Learning Era 2023 Zhao Ren
Yi Chang
Thanh Tam Nguyen
Yang Tan
Kun Qian
Björn Schüller
+ HEAR4Health: A blueprint for making computer audition a staple of modern healthcare 2023 Andreas Triantafyllopoulos
Alexander Kathan
Alice Baird
Lukas Christ
Alexander Gebhard
Maurice Gerczuk
Vincent Karas
Tobias Hübner
Xin Jing
Shuo Liu
+ audb -- Sharing and Versioning of Audio and Annotation Data in Python 2023 Hagen Wierstorf
Johannes Wagner
Florian Eyben
Felix Burkhardt
Björn Schüller
+ Will Affective Computing Emerge from Foundation Models and General AI? A First Evaluation on ChatGPT 2023 Mostafa M. Amin
Erik Cambria
Björn Schüller
+ hierarchical network with decoupled knowledge distillation for speech emotion recognition 2023 Ziping Zhao
Huan Wang
Haishuai Wang
Björn Schüller
+ MER 2023: Multi-label Learning, Modality Robustness, and Semi-Supervised Learning 2023 Zheng Lian
Haiyang Sun
Licai Sun
Jinming Zhao
Ye Liu
Bin Liu
Jiangyan Yi
Meng Wang
Erik Cambria
Guoying Zhao
+ The ACM Multimedia 2023 Computational Paralinguistics Challenge: Emotion Share & Requests 2023 Björn Schüller
Anton Batliner
Shahin Amiriparian
Alexander Barnhill
Maurice Gerczuk
Andreas Triantafyllopoulos
Alice Baird
Panagiotis Tzirakis
Chris Gagne
Alan Cowen
+ The MuSe 2023 Multimodal Sentiment Analysis Challenge: Mimicked Emotions, Cross-Cultural Humour, and Personalisation 2023 Lukas Christ
Shahin Amiriparian
Alice Baird
Alexander Kathan
Niklas Müller
Steffen Klug
Chris Gagne
Panagiotis Tzirakis
Eva-Maria Meßner
Andreas König
+ Executive Voiced Laughter and Social Approval: An Explorative Machine Learning Study 2023 Niklas Mueller
Steffen Klug
Andreas Koenig
Alexander Kathan
Lukas Christ
Björn Schüller
Shahin Amiriparian
+ A Preliminary Study on Augmenting Speech Emotion Recognition using a Diffusion Model 2023 Ibrahim Malik
Siddique Latif
Raja Jurdak
Björn Schüller
+ U-DiT TTS: U-Diffusion Vision Transformer for Text-to-Speech 2023 Xin Jing
Yi Chang
Zijiang Yang
Jiangjian Xie
Andreas Triantafyllopoulos
Björn Schüller
+ Enhancing Speech Emotion Recognition Through Differentiable Architecture Search 2023 Thejan Rajapakshe
Rajib Rana
Sara Khalifa
Berrak Şişman
Björn Schüller
+ Happy or Evil Laughter? Analysing a Database of Natural Audio Samples 2023 Aljoscha Düsterhöft
Felix Burkhardt
Björn Schüller
+ Integrating Generative Artificial Intelligence in Intelligent Vehicle Systems 2023 Lukas Stappen
Jeremy Dillmann
Serena Striegel
Hans J. Vogel
Nicolas Flores-Herr
Björn Schüller
+ Speech-based Age and Gender Prediction with Transformers 2023 Felix Burkhardt
Johannes Wagner
Hagen Wierstorf
Florian Eyben
Björn Schüller
+ Going Retro: Astonishingly Simple Yet Effective Rule-based Prosody Modelling for Speech Synthesis Simulating Emotion Dimensions 2023 Felix Burkhardt
Uwe D. Reichel
Florian Eyben
Björn Schüller
+ Can ChatGPT's Responses Boost Traditional Natural Language Processing? 2023 Mostafa M. Amin
Erik Cambria
Björn Schüller
+ Can Large Language Models Aid in Annotating Speech Emotional Data? Uncovering New Frontiers 2023 Siddique Latif
Muhammad Usama
Mohammad Ibrahim Malik
Björn Schüller
+ Refashioning Emotion Recognition Modelling: The Advent of Generalised Large Models 2023 Zixing Zhang
Liyizhe Peng
Tao Pang
Jing Han
Huan Zhao
Björn Schüller
+ Identifying depression-related topics in smartphone-collected free-response speech recordings using an automatic speech recognition system and a deep learning topic model 2023 Yuezhou Zhang
Amos Folarin
Judith Dineley
Pauline Conde
Valeria de Angel
Shaoxiong Sun
Yatharth Ranjan
Zulqarnain Rashid
Callum Stewart
Petroula Laiou
+ Sparks of Large Audio Models: A Survey and Outlook 2023 Siddique Latif
Moazzam Shoukat
Fahad Shamshad
Muhammad Usama
Heriberto Cuayáhuitl
Björn Schüller
+ A Wide Evaluation of ChatGPT on Affective Computing Tasks 2023 Mostafa M. Amin
Rui Mao
Erik Cambria
Björn Schüller
+ Exploring Meta Information for Audio-based Zero-shot Bird Classification 2023 Alexander Gebhard
Andreas Triantafyllopoulos
Teresa Bez
Lukas Christ
Alexander Kathan
Björn Schüller
+ Task Selection and Assignment for Multi-modal Multi-task Dialogue Act Classification with Non-stationary Multi-armed Bandits 2023 Xiangheng He
Junjie Chen
Björn Schüller
+ Synthia's Melody: A Benchmark Framework for Unsupervised Domain Adaptation in Audio 2023 Chiahsin Lin
Charles Jones
Björn Schüller
Harry Coppock
+ Bringing the Discussion of Minima Sharpness to the Audio Domain: a Filter-Normalised Evaluation for Acoustic Scene Classification 2023 Manuel Milling
Andreas Triantafyllopoulos
Iosif Tsangko
Simon Rampp
Björn Schüller
+ Customising General Large Language Models for Specialised Emotion Recognition Tasks 2023 Liyizhe Peng
Zixing Zhang
Tao Pang
Jing Han
Huan Zhao
Hao Chen
Björn Schüller
+ Testing Speech Emotion Recognition Machine Learning Models 2023 Anna Derington
Hagen Wierstorf
Ali Gürcan Özkil
Florian Eyben
Felix Burkhardt
Björn Schüller
+ PDF Chat Speech Synthesis With Mixed Emotions 2022 Kun Zhou
Berrak Şişman
Rajib Rana
Björn Schüller
Haizhou Li
+ PDF Chat Audio self-supervised learning: A survey 2022 Shuo Liu
Adria Mallol-Ragolta
Emilia Parada‐Cabaleiro
Kun Qian
Xin Jing
Alexander Kathan
Bin Hu
Björn Schüller
+ PDF Chat Multitask Learning From Augmented Auxiliary Data for Improving Speech Emotion Recognition 2022 Siddique Latif
Rajib Rana
Sara Khalifa
Raja Jurdak
Björn Schüller
+ PDF Chat The MuSe 2022 Multimodal Sentiment Analysis Challenge 2022 Lukas Christ
Shahin Amiriparian
Alice Baird
Panagiotis Tzirakis
Alexander Kathan
Niklas Müller
Lukas Stappen
Eva-Maria Meßner
Andreas König
Alan Cowen
+ PDF Chat A Temporal-oriented Broadcast ResNet for COVID-19 Detection 2022 Xin Jing
Shuo Liu
Emilia Parada‐Cabaleiro
Andreas Triantafyllopoulos
Meishu Song
Zijiang Yang
Björn Schüller
+ PDF Chat Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on Data-Driven Deep Learning 2022 Rui Liu
Berrak Şişman
Björn Schüller
Guanglai Gao
Haizhou Li
+ PDF Chat Data Augmentation for Dementia Detection in Spoken Language. 2022 Dominika Woszczyk
Anna Hedlikova
Alican Akman
Soteris Demetriou
Björn Schüller
+ PDF Chat SVTS: Scalable Video-to-Speech Synthesis 2022 Rodrigo Mira
Alexandros Haliassos
Stavros Petridis
Björn Schüller
Maja Pantić
+ PDF Chat Example-based Explanations with Adversarial Attacks for Respiratory Sound Analysis 2022 Yi Chang
Zhao Ren
Thành Tâm Nguyên
Wolfgang Nejdl
Björn Schüller
+ PDF Chat Distinguishing between pre- and post-treatment in the speech of patients with chronic obstructive pulmonary disease 2022 Andreas Triantafyllopoulos
Markus Fendler
Anton Batliner
Maurice Gerczuk
Shahin Amiriparian
Thomas M. Berghaus
Björn Schüller
+ PDF Chat Probing speech emotion recognition transformers for linguistic knowledge 2022 Andreas Triantafyllopoulos
Johannes Wagner
Hagen Wierstorf
Maximilian Schmitt
Uwe D. Reichel
Florian Eyben
Felix Burkhardt
Björn Schüller
+ Introducing the COVID-19 YouTube (COVYT) speech dataset featuring the same speakers with and without infection 2022 Andreas Triantafyllopoulos
Anastasia Semertzidou
Meishu Song
Florian B. Pokorny
Björn Schüller
+ Introducing the COVID-19 YouTube (COVYT) speech dataset featuring the same speakers with and without infection 2022 Andreas Triantafyllopoulos
Anastasia Semertzidou
Meishu Song
Florian B. Pokorny
Björn Schüller
+ PDF Chat Depression Diagnosis and Forecast based on Mobile Phone Sensor Data 2022 Xiangheng He
Andreas Triantafyllopoulos
Alexander Kathan
Manuel Milling
Tian‐Hao Yan
Srividya Tirunellai Rajamani
Ludwig Küster
Mathias Harrer
Elena Heber
Inga Grossmann
+ PDF Chat Journaling Data for Daily PHQ-2 Depression Prediction and Forecasting 2022 Alexander Kathan
Andreas Triantafyllopoulos
Xiangheng He
Manuel Milling
Tianhao Yan
Srividya Tirunellai Rajamani
Ludwig Küster
Mathias Harrer
Elena Heber
Inga Grossmann
+ PDF Chat Insights on Modelling Physiological, Appraisal, and Affective Indicators of Stress using Audio Features 2022 Andreas Triantafyllopoulos
Sandra Zänkert
Alice Baird
Julian Konzok
Brigitte M. Kudielka
Björn Schüller
+ PDF Chat Fatigue Prediction in Outdoor Running Conditions using Audio Data 2022 Andreas Triantafyllopoulos
Sandra Ottl
Alexander Gebhard
Esther Rituerto-González
Mirko Jaumann
Steffen Huttner
Valerie Dieter
Patrick Schneeweiß
Inga Krauß
Maurice Gerczuk
+ PDF Chat Evaluating the COVID-19 Identification ResNet (CIdeR) on the INTERSPEECH COVID-19 From Audio Challenges 2022 Alican Akman
Harry Coppock
Alexander Gaskell
Panagiotis Tzirakis
Lyn Jones
Björn Schüller
+ PDF Chat Emotion Intensity and its Control for Emotional Voice Conversion 2022 Kun Zhou
Berrak Şişman
Rajib Rana
Björn Schüller
Haizhou Li
+ PDF Chat End-to-End Video-to-Speech Synthesis Using Generative Adversarial Networks 2022 Rodrigo Mira
Konstantinos Vougioukas
Pingchuan Ma
Stavros Petridis
Björn Schüller
Maja Pantić
+ PDF Chat Self Supervised Adversarial Domain Adaptation for Cross-Corpus and Cross-Language Speech Emotion Recognition 2022 Siddique Latif
Rajib Rana
Sara Khalifa
Raja Jurdak
Björn Schüller
+ Fairness and underspecification in acoustic scene classification: The case for disaggregated evaluations 2022 Andreas Triantafyllopoulos
Manuel Milling
Konstantinos Drossos
Björn Schüller
+ PDF Chat A Novel Policy for Pre-trained Deep Reinforcement Learning for Speech Emotion Recognition 2022 Thejan Rajapakshe
Rajib Rana
Sara Khalifa
Jiajun Liu
Björn Schüller
+ PDF Chat MEDAS: an open-source platform as a service to help break the walls between medicine and informatics 2022 Liang Zhang
Johann Li
Ping Li
Xiaoyuan Lu
Maoguo Gong
Peiyi Shen
Guangming Zhu
Syed Afaq Ali Shah
Mohammed Bennamoun
Kun Qian
+ Continuous-Time Audiovisual Fusion with Recurrence vs. Attention for In-The-Wild Affect Recognition 2022 Vincent Karas
Mani Kumar Tellamekala
Adria Mallol-Ragolta
Michel Valstar
Björn Schüller
+ Audiovisual Affect Assessment and Autonomous Automobiles: Applications 2022 Björn Schüller
Dagmar Schuller
+ Climate Change & Computer Audition: A Call to Action and Overview on Audio Intelligence to Help Save the Planet 2022 Björn Schüller
Alican Akman
Yi Chang
Harry Coppock
Alexander Gebhard
Alexander Kathan
Esther Rituerto-González
Andreas Triantafyllopoulos
Florian B. Pokorny
+ Example-based Explanations with Adversarial Attacks for Respiratory Sound Analysis 2022 Yi Chang
Zhao Ren
Thanh Tam Nguyen
Wolfgang Nejdl
Björn Schüller
+ HEAR: Holistic Evaluation of Audio Representations 2022 Joseph Turian
Jordie Shier
Humair Raj Khan
Bhiksha Raj
Björn Schüller
Christian J. Steinmetz
Colin Malloy
George Tzanetakis
Gissel Velarde
Kirk McNally
+ An Overview & Analysis of Sequence-to-Sequence Emotional Voice Conversion 2022 Zijiang Yang
Xin Jing
Andreas Triantafyllopoulos
Meishu Song
Ilhan Aslan
Björn Schüller
+ Self Supervised Adversarial Domain Adaptation for Cross-Corpus and Cross-Language Speech Emotion Recognition 2022 Siddique Latif
Rajib Rana
Sara Khalifa
Raja Jurdak
Björn Schüller
+ Normalise for Fairness: A Simple Normalisation Technique for Fairness in Regression Machine Learning Problems 2022 Mostafa M. Mohamed
Björn Schüller
+ Audio Self-supervised Learning: A Survey 2022 Shuo Liu
Adria Mallol-Ragolta
Emilia Parada-Cabeleiro
Kun Qian
Xin Jing
Alexander Kathan
Bin Hu
Björn Schüller
+ A Temporal-oriented Broadcast ResNet for COVID-19 Detection 2022 Xin Jing
Shuo Liu
Emilia Parada‐Cabaleiro
Andreas Triantafyllopoulos
Meishu Song
Zijiang Yang
Björn Schüller
+ Robust Federated Learning Against Adversarial Attacks for Speech Emotion Recognition 2022 Yi Chang
Sofiane Laridi
Zhao Ren
Gregory M. Palmer
Björn Schüller
Marco Fisichella
+ Evaluating Deep Music Generation Methods Using Data Augmentation 2022 Toby Godwin
Georgios Rizos
Alice Baird
Najla D. Al Futaisi
Vincent Brisse
Björn Schüller
+ A Summary of the ComParE COVID-19 Challenges 2022 Harry Coppock
Alican Akman
Christian Bergler
Maurice Gerczuk
Chloë Brown
Jagmohan Chauhan
Andreas Grammenos
Apinan Hasthanasombat
Dimitris Spathis
Xia Tong
+ Probing Speech Emotion Recognition Transformers for Linguistic Knowledge 2022 Andreas Triantafyllopoulos
Johannes Wagner
Hagen Wierstorf
Maximilian Schmitt
Uwe D. Reichel
Florian Eyben
Felix Burkhardt
Björn Schüller
+ Dawn of the transformer era in speech emotion recognition: closing the valence gap 2022 Johannes Wagner
Andreas Triantafyllopoulos
Hagen Wierstorf
Maximilian Schmitt
Felix Burkhardt
Florian Eyben
Björn Schüller
+ Predicting Sex and Stroke Success -- Computer-aided Player Grunt Analysis in Tennis Matches 2022 Lukas Stappen
Manuel Milling
Valentin Munst
Korakot Hoffmann
Björn Schüller
+ SVTS: Scalable Video-to-Speech Synthesis 2022 Rodrigo Mira
Alexandros Haliassos
Stavros Petridis
Björn Schüller
Maja Pantić
+ The ICML 2022 Expressive Vocalizations Workshop and Competition: Recognizing, Generating, and Personalizing Vocal Bursts 2022 Alice Baird
Panagiotis Tzirakis
Gauthier Gidel
Marco Jiralerspong
Eilif Müller
Kory W. Mathewson
Björn Schüller
Erik Cambria
Dacher Keltner
Alan Cowen
+ Journaling Data for Daily PHQ-2 Depression Prediction and Forecasting 2022 Alexander Kathan
Andreas Triantafyllopoulos
Xiangheng He
Manuel Milling
Tian‐Hao Yan
Srividya Tirunellai Rajamani
Ludwig Küster
Mathias Harrer
Elena Heber
Inga Großmann
+ The ACM Multimedia 2022 Computational Paralinguistics Challenge: Vocalisations, Stuttering, Activity, & Mosquitoes 2022 Björn Schüller
Anton Batliner
Shahin Amiriparian
Christian Bergler
Maurice Gerczuk
Natalie Holz
Pauline Larrouy-Maestri
Sebastian P. Bayerl
Korbinian Riedhammer
Adria Mallol-Ragolta
+ COLD Fusion: Calibrated and Ordinal Latent Distribution Fusion for Uncertainty-Aware Multimodal Emotion Recognition 2022 Mani Kumar Tellamekala
Shahin Amiriparian
Björn Schüller
Elisabeth André
Timo Giesbrecht
Michel Valstar
+ Exploring speaker enrolment for few-shot personalisation in emotional vocalisation prediction 2022 Andreas Triantafyllopoulos
Meishu Song
Zijiang Yang
Xin Jing
Björn Schüller
+ Redundancy Reduction Twins Network: A Training framework for Multi-output Emotion Regression 2022 Xin Jing
Meishu Song
Andreas Triantafyllopoulos
Zijiang Yang
Björn Schüller
+ Dynamic Restrained Uncertainty Weighting Loss for Multitask Learning of Vocal Expression 2022 Meishu Song
Zijiang Yang
Andreas Triantafyllopoulos
Xin Jing
Vincent Karas
Jiangjian Xie
Zixing Zhang
Yoshiharu Yamamoto
Björn Schüller
+ COVYT: Introducing the Coronavirus YouTube and TikTok speech dataset featuring the same speakers with and without infection 2022 Andreas Triantafyllopoulos
Anastasia Semertzidou
Meishu Song
Florian B. Pokorny
Björn Schüller
+ Data Augmentation for Dementia Detection in Spoken Language 2022 Anna Hlédiková
Dominika Woszczyk
Alican Acman
Soteris Demetriou
Björn Schüller
+ Are 3D Face Shapes Expressive Enough for Recognising Continuous Emotions and Action Unit Intensities? 2022 Mani Kumar Tellamekala
Ömer Sümer
Björn Schüller
Elisabeth André
Timo Giesbrecht
Michel Valstar
+ The ACII 2022 Affective Vocal Bursts Workshop & Competition: Understanding a critically understudied modality of emotional expression 2022 Alice Baird
Panagiotis Tzirakis
Jeffrey A. Brooks
Christopher B. Gregory
Björn Schüller
Anton Batliner
Dacher Keltner
Alan Cowen
+ Multitask Learning from Augmented Auxiliary Data for Improving Speech Emotion Recognition 2022 Siddique Latif
Rajib Rana
Sara Khalifa
Raja Jurdak
Björn Schüller
+ The MuSe 2022 Multimodal Sentiment Analysis Challenge: Humor, Emotional Reactions, and Stress 2022 Lukas Christ
Shahin Amiriparian
Alice Baird
Panagiotis Tzirakis
Alexander Kathan
Niklas Müller
Lukas Stappen
Eva-Maria Meßner
Andreas König
Alan Cowen
+ Proceedings of the ICML 2022 Expressive Vocalizations Workshop and Competition: Recognizing, Generating, and Personalizing Vocal Bursts 2022 Alice Baird
Panagiotis Tzirakis
Gauthier Gidel
Marco Jiralerspong
Eilif Müller
Kory W. Mathewson
Björn Schüller
Erik Cambria
Dacher Keltner
Alan Cowen
+ Distinguishing between pre- and post-treatment in the speech of patients with chronic obstructive pulmonary disease 2022 Andreas Triantafyllopoulos
Markus Fendler
Anton Batliner
Maurice Gerczuk
Shahin Amiriparian
Thomas M. Berghaus
Björn Schüller
+ Self-Supervised Attention Networks and Uncertainty Loss Weighting for Multi-Task Emotion Recognition on Vocal Bursts 2022 Vincent Karas
Andreas Triantafyllopoulos
Meishu Song
Björn Schüller
+ Audio Barlow Twins: Self-Supervised Audio Representation Learning 2022 Jonah Anton
Harry Coppock
Pancham Shukla
Björn Schüller
+ An Overview of Affective Speech Synthesis and Conversion in the Deep Learning Era 2022 Andreas Triantafyllopoulos
Björn Schüller
Gökçe İymen
Tevfik Metin Sezgin
Xiangheng He
Zijiang Yang
Panagiotis Tzirakis
Shuo Liu
Silvan Mertes
Elisabeth André
+ Propagating Variational Model Uncertainty for Bioacoustic Call Label Smoothing 2022 Georgios Rizos
Jenna Lawson
Sımon F. Mıtchell
Pranay Shah
Xin Wen
Cristina Banks‐Leite
Robert M. Ewers
Björn Schüller
+ Fast Yet Effective Speech Emotion Recognition with Self-distillation 2022 Zhao Ren
Thành Tâm Nguyên
Yi Chang
Björn Schüller
+ Knowledge Transfer For On-Device Speech Emotion Recognition with Neural Structured Learning 2022 Yi Chang
Zhao Ren
Thành Tâm Nguyên
Kun Qian
Björn Schüller
+ Depression Diagnosis and Forecast based on Mobile Phone Sensor Data 2022 Xiangheng He
Andreas Triantafyllopoulos
Alexander Kathan
Manuel Milling
Tian‐Hao Yan
Srividya Tirunellai Rajamani
Ludwig Küster
Mathias Harrer
Elena Heber
Inga Großmann
+ Proceedings of the ACII Affective Vocal Bursts Workshop and Competition 2022 (A-VB): Understanding a critically understudied modality of emotional expression 2022 Alice Baird
Panagiotis Tzirakis
Jeffrey A. Brooks
Christopher B. Gregory
Björn Schüller
Anton Batliner
Dacher Keltner
Alan Cowen
+ AI-Based Emotion Recognition: Promise, Peril, and Prescriptions for Prosocial Path 2022 Siddique Latif
Hafiz Shehbaz Ali
Muhammad Usama
Rajib Rana
Björn Schüller
Junaid Qadir
+ Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on Data-Driven Deep Learning 2022 Rui Liu
Berrak Şişman
Björn Schüller
Guanglai Gao
Haizhou Li
+ A large-scale and PCR-referenced vocal audio dataset for COVID-19 2022 Jobie Budd
Kieran Baker
Emma Karoune
Harry Coppock
Selina Patel
Ana Tendero Cañadas
Alexander Titcomb
Richard Payne
David J. Hurley
Sabrina Egglestone
+ Statistical Design and Analysis for Robust Machine Learning: A Case Study from COVID-19 2022 Davide Pigoli
Kieran Baker
Jobie Budd
Lorraine Butler
Harry Coppock
Sabrina Egglestone
Steven G. Gilmour
Chris Holmes
David J. Hurley
Radka Jersakova
+ Audio-based AI classifiers show no evidence of improved COVID-19 screening over simple symptoms checkers 2022 Harry Coppock
George Nicholson
Ivan Kiskin
Vasiliki Koutra
Kieran Baker
Jobie Budd
Richard Payne
Emma Karoune
David J. Hurley
Alexander Titcomb
+ Automatic Emotion Modelling in Written Stories 2022 Lukas Christ
Shahin Amiriparian
Manuel Milling
Ilhan Aslan
Björn Schüller
+ Fatigue Prediction in Outdoor Running Conditions using Audio Data 2022 Andreas Triantafyllopoulos
Sandra Ottl
Alexander Gebhard
Esther Rituerto-González
Mirko Jaumann
Steffen Hüttner
Valerie Dieter
Patrick Schneeweiß
Inga Krauß
Maurice Gerczuk
+ Insights on Modelling Physiological, Appraisal, and Affective Indicators of Stress using Audio Features 2022 Andreas Triantafyllopoulos
Sandra Zänkert
Alice Baird
Julian Konzok
Brigitte M. Kudielka
Björn Schüller
+ Towards Multimodal Prediction of Spontaneous Humour: A Novel Dataset and First Results 2022 Lukas Christ
Shahin Amiriparian
Alexander Kathan
Niklas Müller
Andreas König
Björn Schüller
+ PDF Chat EmoNet: A Transfer Learning Framework for Multi-Corpus Speech Emotion Recognition 2021 Maurice Gerczuk
Shahin Amiriparian
Sandra Ottl
Björn Schüller
+ PDF Chat Bias and privacy in AI's cough-based COVID-19 recognition – Authors' reply 2021 Harry Coppock
Lyn Jones
Ivan Kiskin
Björn Schüller
+ Facial Emotion Recognition using Deep Residual Networks in Real-World Environments. 2021 Panagiotis Tzirakis
Dénes Boros
Elnar Hajiyev
Björn Schüller
+ PDF Chat GraphTMT: Unsupervised Graph-based Topic Modeling from Video Transcripts 2021 Jason Thies
Lukas Stappen
Gerhard Hagerer
Björn Schüller
Georg Groh
+ Fitbeat: COVID-19 estimation based on wristband heart rate using a contrastive convolutional auto-encoder 2021 Shuo Liu
Jing Han
Estela Laporta
Spyridon Kontaxis
Shaoxiong Sun
Patrick Locatelli
Judith Dineley
Florian B. Pokorny
Gloria Dalla Costa
Letizia Leocani
+ PDF Chat A Physiologically-Adapted Gold Standard for Arousal during Stress 2021 Alice Baird
Lukas Stappen
Lukas Christ
Lea Schumann
Eva-Maria Meßner
Björn Schüller
+ PDF Chat Evaluating Deep Music Generation Methods Using Data Augmentation 2021 Toby Godwin
Georgios Rizos
Alice Baird
Najla D. Al Futaisi
Vincent Brisse
Björn Schüller
+ PDF Chat Fairness and underspecification in acoustic scene classification: The case for disaggregated evaluations 2021 Andreas Triantafyllopoulos
Manuel Milling
Konstantinos Drossos
Björn Schüller
+ A Machine Learning Framework for Automatic Prediction of Human Semen Motility. 2021 Sandra Ottl
Shahin Amiriparian
Maurice Gerczuk
Björn Schüller
+ PDF Chat Remote Smartphone-Based Speech Collection: Acceptance and Barriers in Individuals with Major Depressive Disorder 2021 Judith Dineley
Grace Lavelle
Daniel Leightley
Faith Matcham
Sara Siddi
Maria Teresa Peñarrubia‐María
Katie M White
Alina Ivan
Carolin Oetzmann
Sara Simblett
+ PDF Chat An Improved StarGAN for Emotional Voice Conversion: Enhancing Voice Quality and Data Augmentation 2021 Xiangheng He
Junjie Chen
Georgios Rizos
Björn Schüller
+ PDF Chat LiRA: Learning Visual Speech Representations from Audio Through Self-Supervision 2021 Pingchuan Ma
Rodrigo Mira
Stavros Petridis
Björn Schüller
Maja Pantić
+ PDF Chat The Multimodal Sentiment Analysis in Car Reviews (MuSe-CaR) Dataset: Collection, Insights and Improvements 2021 Lukas Stappen
Alice Baird
Lea Schumann
Björn Schüller
+ A Physiologically-adapted Gold Standard for Arousal During a Stress Induced Scenario 2021 Alice Baird
Lukas Stappen
Lukas Christ
Lea Schumann
Eva-Maria Meßner
Björn Schüller
+ A Physiologically-Adapted Gold Standard for Arousal during Stress 2021 Alice Baird
Lukas Stappen
Lukas Christ
Lea Schumann
Eva-Maria Meßner
Björn Schüller
+ MuSe-Toolbox: The Multimodal Sentiment Analysis Continuous Annotation Fusion and Discrete Class Transformation Toolbox 2021 Lukas Stappen
Lea Schumann
Benjamin Sertolli
Alice Baird
Benjamin Weigel
Erik Cambria
Björn Schüller
+ PDF Chat An Improved StarGAN for Emotional Voice Conversion: Enhancing Voice Quality and Data Augmentation 2021 Xiangheng He
Junjie Chen
Georgios Rizos
Björn Schüller
+ PDF Chat Affective Image Content Analysis: Two Decades Review and New Perspectives 2021 Sicheng Zhao
Xingxu Yao
Jufeng Yang
Guoli Jia
Guiguang Ding
Tat‐Seng Chua
Björn Schüller
Kurt Keutzer
+ PDF Chat LiRA: Learning Visual Speech Representations from Audio through Self-supervision 2021 Pingchuan Ma
Rodrigo Mira
Stavros Petridis
Björn Schüller
Maja Pantić
+ PDF Chat Prediction on Mechanical Properties of Non-Equiatomic High-Entropy Alloy by Atomistic Simulation and Machine Learning 2021 Liang Zhang
Kun Qian
Björn Schüller
Yasushi Shibuta
+ The voice of COVID-19: Acoustic correlates of infection in sustained vowels 2021 Katrin D. Bartl-Pokorny
Florian B. Pokorny
Anton Batliner
Shahin Amiriparian
Anastasia Semertzidou
Florian Eyben
Elena Kramer
Florian Schmidt
R. Schönweiler
Markus Wehler
+ Detecting COVID-19 from Breathing and Coughing Sounds using Deep Neural Networks 2021 Mina A. Nessiem
Mostafa M. Mohamed
Harry Coppock
Alexander Gaskell
Björn Schüller
+ PDF Chat Speech Emotion Recognition Using Semantic Information 2021 Panagiotis Tzirakis
Anh Gia-Tuan Nguyen
Stefanos Zafeiriou
Björn Schüller
+ An Estimation of Online Video User Engagement from Features of Continuous Emotions. 2021 Lukas Stappen
Alice Baird
Michelle Lienhart
Annalena Bätz
Björn Schüller
+ Unsupervised Graph-based Topic Modeling from Video Transcriptions. 2021 Lukas Stappen
Gerhard Hagerer
Björn Schüller
Georg Groh
+ PDF Chat Learning audio sequence representations for acoustic event classification 2021 Zixing Zhang
Ding Liu
Jing Han
Kun Qian
Björn Schüller
+ The MuSe 2021 Multimodal Sentiment Analysis Challenge: Sentiment, Emotion, Physiological-Emotion, and Stress 2021 Lukas Stappen
Alice Baird
Lukas Christ
Lea Schumann
Benjamin Sertolli
Eva-Maria Meßner
Erik Cambria
Guoying Zhao
Björn Schüller
+ PDF Chat Recent Advances in Computer Audition for Diagnosing COVID-19: An Overview 2021 Kun Qian
Björn Schüller
Yoshiharu Yamamoto
+ Speech Emotion Recognition using Semantic Information 2021 Panagiotis Tzirakis
Anh Gia-Tuan Nguyen
Stefanos Zafeiriou
Björn Schüller
+ The INTERSPEECH 2021 Computational Paralinguistics Challenge: COVID-19 Cough, COVID-19 Speech, Escalation & Primates 2021 Björn Schüller
Anton Batliner
Christian Bergler
Cecilia Mascolo
Jing Han
Iulia Lefter
Heysem Kaya
Shahin Amiriparian
Alice Baird
Lukas Stappen
+ The Multimodal Sentiment Analysis in Car Reviews (MuSe-CaR) Dataset: Collection, Insights and Improvements 2021 Lukas Stappen
Alice Baird
Lea Schumann
Björn Schüller
+ End-2-End COVID-19 Detection from Breath & Cough Audio 2021 Harry Coppock
Alexander Gaskell
Panagiotis Tzirakis
Alice Baird
Lyn Jones
Björn Schüller
+ Personalized Federated Deep Learning for Pain Estimation From Face Images 2021 Ognjen Rudovic
Nicolas Tobis
Sebastian Kaltwang
Björn Schüller
Daniel Rueckert
Jeffrey F. Cohn
Rosalind W. Picard
+ Deep Attention-based Representation Learning for Heart Sound Classification 2021 Zhao Ren
Kun Qian
Fengquan Dong
Zhenyu Dai
Yoshiharu Yamamoto
Björn Schüller
+ An Enhanced Adversarial Network with Combined Latent Features for Spatio-temporal Facial Affect Estimation in the Wild 2021 Decky Aspandi
Federico M. Sukno
Björn Schüller
Xavier Binefa
+ Computational Emotion Analysis From Images: Recent Advances and Future Directions 2021 Sicheng Zhao
Quanwei Huang
Youbao Tang
Xingxu Yao
Jufeng Yang
Guiguang Ding
Björn Schüller
+ Fitbeat: COVID-19 Estimation based on Wristband Heart Rate 2021 Shuo Liu
Jing Han
Estela Laporta
Spyridon Kontaxis
Shaoxiong Sun
Patrick Locatelli
Judith Dineley
Florian B. Pokorny
Gloria Dalla Costa
Letizia Leocani
+ On the Impact of Word Error Rate on Acoustic-Linguistic Speech Emotion Recognition: An Update for the Deep Learning Era 2021 Shahin Amiriparian
Artem Sokolov
Ilhan Aslan
Lukas Christ
Maurice Gerczuk
Tobias Hübner
Dmitry Lamanov
Manuel Milling
Sandra Ottl
Ilya Poduremennykh
+ DeepSpectrumLite: A Power-Efficient Transfer Learning Framework for Embedded Speech and Audio Processing from Decentralised Data 2021 Shahin Amiriparian
Tobias Hübner
Maurice Gerczuk
Sandra Ottl
Björn Schüller
+ PDF Chat Poisson CNN: Convolutional neural networks for the solution of the Poisson equation on a Cartesian mesh 2021 Ali Girayhan Özbay
Arash Hamzehloo
Sylvain Laizet
Panagiotis Tzirakis
Georgios Rizos
Björn Schüller
+ LiRA: Learning Visual Speech Representations from Audio through Self-supervision 2021 Pingchuan Ma
Rodrigo Mira
Stavros Petridis
Björn Schüller
Maja Pantić
+ Affective Image Content Analysis: Two Decades Review and New Perspectives 2021 Sicheng Zhao
Xingxu Yao
Jufeng Yang
Guoli Jia
Guiguang Ding
Tat‐Seng Chua
Björn Schüller
Kurt Keutzer
+ An Improved StarGAN for Emotional Voice Conversion: Enhancing Voice Quality and Data Augmentation 2021 Xiangheng He
Junjie Chen
Georgios Rizos
Björn Schüller
+ The EIHW-GLAM Deep Attentive Multi-model Fusion System for Cough-based COVID-19 Recognition in the DiCOVA 2021 Challenge 2021 Zhao Ren
Yi Chang
Björn Schüller
+ EIHW-MTG DiCOVA 2021 Challenge System Report 2021 Adria Mallol-Ragolta
Helena Cuesta
Emília Gómez
Björn Schüller
+ EIHW-MTG: Second DiCOVA Challenge System Report 2021 Adria Mallol-Ragolta
Helena Cuesta
Emília Gómez
Björn Schüller
+ Facial Emotion Recognition using Deep Residual Networks in Real-World Environments 2021 Panagiotis Tzirakis
Dénes Boros
Elnar Hajiyev
Björn Schüller
+ Multistage linguistic conditioning of convolutional layers for speech emotion recognition 2021 Andreas Triantafyllopoulos
Uwe D. Reichel
Shuo Liu
Stephan M. Huber
Florian Eyben
Björn Schüller
+ Evaluating the COVID-19 Identification ResNet (CIdeR) on the INTERSPEECH COVID-19 from Audio Challenges 2021 Alican Akman
Harry Coppock
Alexander Gaskell
Panagiotis Tzirakis
Lyn H. Jones
Björn Schüller
+ A Physiologically-Adapted Gold Standard for Arousal during Stress 2021 Alice Baird
Lukas Stappen
Lukas Christ
Lea Schumann
Eva-Maria Meßner
Björn Schüller
+ MuSe-Toolbox: The Multimodal Sentiment Analysis Continuous Annotation Fusion and Discrete Class Transformation Toolbox 2021 Lukas Stappen
Lea Schumann
Benjamin Sertolli
Alice Baird
Benjamin Weigel
Erik Cambria
Björn Schüller
+ An Estimation of Online Video User Engagement from Features of Continuous Emotions 2021 Lukas Stappen
Alice Baird
Michelle Lienhart
Annalena Bätz
Björn Schüller
+ GraphTMT: Unsupervised Graph-based Topic Modeling from Video Transcripts 2021 Lukas Stappen
Jason Thies
Gerhard Hagerer
Björn Schüller
Georg Groh
+ The MuSe 2021 Multimodal Sentiment Analysis Challenge: Sentiment, Emotion, Physiological-Emotion, and Stress 2021 Lukas Stappen
Alice Baird
Lukas Christ
Lea Schumann
Benjamin Sertolli
Eva-Maria Meßner
Erik Cambria
Guoying Zhao
Björn Schüller
+ Speech Emotion Recognition using Semantic Information 2021 Panagiotis Tzirakis
Anh Nguyen
Stefanos Zafeiriou
Björn Schüller
+ The Multimodal Sentiment Analysis in Car Reviews (MuSe-CaR) Dataset: Collection, Insights and Improvements 2021 Lukas Stappen
Alice Baird
Lea Schumann
Björn Schüller
+ End-2-End COVID-19 Detection from Breath & Cough Audio 2021 Harry Coppock
Alexander Gaskell
Panagiotis Tzirakis
Alice Baird
Lyn H. Jones
Björn Schüller
+ Fairness and underspecification in acoustic scene classification: The case for disaggregated evaluations 2021 Andreas Triantafyllopoulos
Manuel Milling
Konstantinos Drossos
Björn Schüller
+ EmoNet: A Transfer Learning Framework for Multi-Corpus Speech Emotion Recognition 2021 Maurice Gerczuk
Shahin Amiriparian
Sandra Ottl
Björn Schüller
+ A Machine Learning Framework for Automatic Prediction of Human Semen Motility 2021 Sandra Ottl
Shahin Amiriparian
Maurice Gerczuk
Björn Schüller
+ The INTERSPEECH 2021 Computational Paralinguistics Challenge: COVID-19 Cough, COVID-19 Speech, Escalation & Primates 2021 Björn Schüller
Anton Batliner
Christian Bergler
Cecilia Mascolo
Jing Han
Iulia Lefter
Heysem Kaya
Shahin Amiriparian
Alice Baird
Lukas Stappen
+ Detecting COVID-19 from Breathing and Coughing Sounds using Deep Neural Networks. 2020 Björn Schüller
Harry Coppock
Alexander Gaskell
+ Audio, Speech, Language, & Signal Processing for COVID-19: A Comprehensive Overview. 2020 Gauri Deshpande
Björn Schüller
+ PDF Chat CAA-Net: Conditional Atrous CNNs With Attention for Explainable Device-Robust Acoustic Scene Classification 2020 Zhao Ren
Qiuqiang Kong
Jing Han
Mark D. Plumbley
Björn Schüller
+ PDF Chat Synthesising 3D Facial Motion from “In-the-Wild” Speech 2020 Panagiotis Tzirakis
Athanasios Papaioannou
Alexandros Lattas
Michail Tarasiou
Björn Schüller
Stefanos Zafeiriou
+ PDF Chat Augmenting Generative Adversarial Networks for Speech Emotion Recognition 2020 Siddique Latif
Muhammad Asim
Rajib Rana
Sara Khalifa
Raja Jurdak
Björn Schüller
+ An Early Study on Intelligent Analysis of Speech Under COVID-19: Severity, Sleep Quality, Fatigue, and Anxiety 2020 Jing Han
Kun Qian
Meishu Song
Zijiang Yang
Zhao Ren
Shuo Liu
Juan Liu
Huaiyuan Zheng
Wei Ji
Tomoya Koike
+ PDF Chat Deep Architecture Enhancing Robustness to Noise, Adversarial Attacks, and Cross-Corpus Setting for Speech Emotion Recognition 2020 Siddique Latif
Rajib Rana
Sara Khalifa
Raja Jurdak
Björn Schüller
+ Go-CaRD - Generic, Optical Car Part Recognition and Detection: Collection, Insights, and Applications. 2020 Lukas Stappen
Xinchen Du
Vincent Karas
Stefan Müller
Björn Schüller
+ High-Fidelity Audio Generation and Representation Learning with Guided Adversarial Autoencoder 2020 Kazi Nazmul Haque
Rajib Rana
Björn Schüller
+ Augmenting Generative Adversarial Networks for Speech Emotion Recognition 2020 Siddique Latif
Muhammad Asim
Rajib Rana
Sara Khalifa
Raja Jurdak
Björn Schüller
+ Deep Architecture Enhancing Robustness to Noise, Adversarial Attacks, and Cross-corpus Setting for Speech Emotion Recognition 2020 Siddique Latif
Rajib Rana
Sara Khalifa
Raja Jurdak
Björn Schüller
+ An Overview on Audio, Signal, Speech, & Language Processing for COVID-19 2020 Gauri Deshpande
Björn Schüller
+ A Novel Fusion of Attention and Sequence to Sequence Autoencoders to Predict Sleepiness From Speech. 2020 Shahin Amiriparian
Pawel Winokurow
Vincent Karas
Sandra Ottl
Maurice Gerczuk
Björn Schüller
+ ConcealNet: An End-to-end Neural Network for Packet Loss Concealment in Deep Speech Emotion Recognition. 2020 Mostafa M. Mohamed
Björn Schüller
+ PDF Chat Multi-Task Semi-Supervised Adversarial Autoencoding for Speech Emotion Recognition 2020 Siddique Latif
Rajib Rana
Sara Khalifa
Raja Jurdak
Julien Epps
Björn Schüller
+ Adversarial-based neural networks for affect estimations in the wild 2020 Decky Aspandi
Adria Mallol-Ragolta
Björn Schüller
Xavier Binefa
+ Deep Representation Learning in Speech Processing: Challenges, Recent Advances, and Future Trends 2020 Siddique Latif
Rajib Rana
Sara Khalifa
Raja Jurdak
Junaid Qadir
Björn Schüller
+ Guided Generative Adversarial Neural Network for Representation Learning and High Fidelity Audio Generation using Fewer Labelled Audio Data 2020 Kazi Nazmul Haque
Rajib Rana
John H. L. Hansen
Björn Schüller
+ COVID-19 and Computer Audition: An Overview on What Speech & Sound Analysis Could Contribute in the SARS-CoV-2 Corona Crisis 2020 Björn Schüller
Dagmar Schuller
Kun Qian
Juan Liu
Huaiyuan Zheng
Xiao Li
+ Prediction of mechanical properties of non-equiatomic high-entropy alloy by atomistic simulation and machine learning 2020 Liang Zhang
Kun Qian
Björn Schüller
Cheng Lü
Yasushi Shibuta
Xiaoxu Huang
+ An Early Study on Intelligent Analysis of Speech under COVID-19: Severity, Sleep Quality, Fatigue, and Anxiety 2020 Jing Han
Kun Qian
Meishu Song
Zijiang Yang
Zhao Ren
Shuo Liu
Juan Liu
Huaiyuan Zheng
Wei Ji
Tomoya Koike
+ MuSe 2020 -- The First International Multimodal Sentiment Analysis in Real-life Media Challenge and Workshop 2020 Lukas Stappen
Alice Baird
Georgios Rizos
Panagiotis Tzirakis
Xinchen Du
Felix Hafner
Lea Schumann
Adria Mallol-Ragolta
Björn Schüller
Iulia Lefter
+ Cross-lingual Zero- and Few-shot Hate Speech Detection Utilising Frozen Transformer Language Models and AXEL 2020 Lukas Stappen
Fabian Brunn
Björn Schüller
+ deepSELF: An Open Source Deep Self End-to-End Learning Framework 2020 Tomoya Koike
Kun Qian
Björn Schüller
Yoshiharu Yamamoto
+ On Deep Speech Packet Loss Concealment: A Mini-Survey 2020 Mostafa M. Mohamed
Mina A. Nessiem
Björn Schüller
+ "I have vxxx bxx connexxxn!": Facing Packet Loss in Deep Speech Emotion Recognition 2020 Mostafa M. Mohamed
Björn Schüller
+ Deep Reinforcement Learning with Pre-training for Time-efficient Training of Automatic Speech Recognition 2020 Thejan Rajapakshe
Siddique Latif
Rajib Rana
Sara Khalifa
Björn Schüller
+ MeDaS: An open-source platform as service to help break the walls between medicine and informatics 2020 Liang Zhang
Johann Li
Ping Li
Xiaoyuan Lu
Peiyi Shen
Guangming Zhu
Syed Afaq Ali Shah
Mohammed Bennamoun
Kun Qian
Björn Schüller
+ Capturing dynamics of post-earnings-announcement drift using genetic algorithm-optimised supervised learning 2020 Zhengxin Joseph Ye
Björn Schüller
+ PDF Chat High-Fidelity Audio Generation and Representation Learning With Guided Adversarial Autoencoder 2020 Kazi Nazmul Haque
Rajib Rana
Björn Schüller
+ Recent Advances in Computer Audition for Diagnosing COVID-19: An Overview 2020 Kun Qian
Björn Schüller
Yoshiharu Yamamoto
+ Domain Adaptation with Joint Learning for Generic, Optical Car Part Recognition and Detection Systems (Go-CaRD) 2020 Lukas Stappen
Xinchen Du
Vincent Karas
Stefan Müller
Björn Schüller
+ Detecting COVID-19 from Breathing and Coughing Sounds using Deep Neural Networks 2020 Björn Schüller
Harry Coppock
Alexander Gaskell
+ Audio, Speech, Language, & Signal Processing for COVID-19: A Comprehensive Overview 2020 Gauri Deshpande
Björn Schüller
+ High-Fidelity Audio Generation and Representation Learning with Guided Adversarial Autoencoder 2020 Kazi Nazmul Haque
Rajib Rana
Björn Schüller
+ Augmenting Generative Adversarial Networks for Speech Emotion Recognition 2020 Siddique Latif
Muhammad Asim
Rajib Rana
Sara Khalifa
Raja Jurdak
Björn Schüller
+ Deep Architecture Enhancing Robustness to Noise, Adversarial Attacks, and Cross-corpus Setting for Speech Emotion Recognition 2020 Siddique Latif
Rajib Rana
Sara Khalifa
Raja Jurdak
Björn Schüller
+ ConcealNet: An End-to-end Neural Network for Packet Loss Concealment in Deep Speech Emotion Recognition 2020 Mostafa M. Mohamed
Björn Schüller
+ A Novel Fusion of Attention and Sequence to Sequence Autoencoders to Predict Sleepiness From Speech 2020 Shahin Amiriparian
Pawel Winokurow
Vincent Karas
Sandra Ottl
Maurice Gerczuk
Björn Schüller
+ Adversarial-based neural networks for affect estimations in the wild 2020 Decky Aspandi
Adria Mallol-Ragolta
Björn Schüller
Xavier Binefa
+ An Overview on Audio, Signal, Speech, & Language Processing for COVID-19 2020 Gauri Deshpande
Björn Schüller
+ Convolutional Neural Networks for the Solution of the 2D Poisson Equation with Arbitrary Dirichlet Boundary Conditions, Mesh Sizes and Grid Spacings 2019 Ali Girayhan Özbay
Panagiotis Tzirakis
Georgios Rizos
Björn Schüller
Sylvain Laizet
+ Poisson CNN: Convolutional Neural Networks for the Solution of the Poisson Equation with Varying Meshes and Dirichlet Boundary Conditions 2019 Ali Girayhan Özbay
Sylvain Laizet
Panagiotis Tzirakis
Georgios Rizos
Björn Schüller
+ PDF Chat AVEC 2019 Workshop and Challenge: State-of-Mind, Detecting Depression with AI, and Cross-Cultural Affect Recognition 2019 Fabien Ringeval
Björn Schüller
Michel Valstar
Nicholas Cummins
Roddy Cowie
Leili Tavabi
Maximilian Schmitt
Sina Alisamir
Shahin Amiriparian
Eva-Maria Meßner
+ PDF Chat Multi-modal Active Learning From Human Data: A Deep Reinforcement Learning Approach 2019 Ognjen Rudovic
Meiru Zhang
Björn Schüller
Rosalind W. Picard
+ PDF Chat SEWA DB: A Rich Database for Audio-Visual Emotion and Sentiment Research in the Wild 2019 Jean Kossaifi
Robert Walecki
Yannis Panagakis
Jie Shen
Maximilian Schmitt
Fabien Ringeval
Jing Han
Vedhas Pandit
Antoine Toisoul
Björn Schüller
+ On Laughter and Speech-Laugh, Based on Observations of Child-Robot Interaction. 2019 Anton Batliner
Stefan Steidl
Florian Eyben
Björn Schüller
+ Presenting the Acoustic Sounds for Wellbeing Dataset and Baseline Classification Results. 2019 Alice Baird
Björn Schüller
+ PDF Chat EmoBed: Strengthening Monomodal Emotion Recognition via Training with Crossmodal Emotion Embeddings 2019 Jing Han
Zixing Zhang
Zhao Ren
Björn Schüller
+ Multi-modal Active Learning From Human Data: A Deep Reinforcement Learning Approach 2019 Ognjen Rudovic
Meiru Zhang
Björn Schüller
Rosalind W. Picard
+ A comparison of online automatic speech recognition systems and the nonverbal responses to unintelligible speech 2019 Joshua Y. Kim
Chunfeng Liu
Rafael A. Calvo
Kathryn McCabe
Silas Taylor
Björn Schüller
Kaihang Wu
+ PDF Chat Attention-augmented End-to-end Multi-task Learning for Emotion Prediction from Speech 2019 Zixing Zhang
Bingwen Wu
Björn Schüller
+ Synthesising 3D Facial Motion from "In-the-Wild" Speech 2019 Panagiotis Tzirakis
Athanasios Papaioannou
Alexandros Lattas
Michail Tarasiou
Björn Schüller
Stefanos Zafeiriou
+ PDF Chat Snore-GANs: Improving Automatic Snore Sound Classification With Synthesized Data 2019 Zixing Zhang
Jing Han
Kun Qian
Christoph Janott
Yanan Guo
Björn Schüller
+ Responsible and Representative Multimodal Data Acquisition and Analysis: On Auditability, Benchmarking, Confidence, Data-Reliance & Explainability. 2019 Alice Baird
Simone Hantke
Björn Schüller
+ On Many-to-Many Mapping Between Concordance Correlation Coefficient and Mean Square Error. 2019 Vedhas Pandit
Björn Schüller
+ The Many-to-Many Mapping Between the Concordance Correlation Coefficient and the Mean Square Error 2019 Vedhas Pandit
Björn Schüller
+ PDF Chat Deep Affect Prediction in-the-Wild: Aff-Wild Database and Challenge, Deep Architectures, and Beyond 2019 Dimitrios Kollias
Panagiotis Tzirakis
Mihalis A. Nicolaou
Athanasios Papaioannou
Guoying Zhao
Björn Schüller
Irene Kotsia
Stefanos Zafeiriou
+ Voice command generation using Progressive Wavegans 2019 Thomas Wiest
Nicholas Cummins
Alice Baird
Simone Hantke
Judith Dineley
Björn Schüller
+ Snore-GANs: Improving Automatic Snore Sound Classification with Synthesized Data 2019 Zixing Zhang
Jing Han
Kun Qian
Christoph Janott
Yanan Guo
Björn Schüller
+ Attention-Augmented End-to-End Multi-Task Learning for Emotion Prediction from Speech 2019 Zixing Zhang
Bingwen Wu
Björn Schüller
+ Single-Channel Speech Separation with Auxiliary Speaker Embeddings 2019 Shuo Liu
Gil Keren
Björn Schüller
+ Acoustic Sounds for Wellbeing: A Novel Dataset and Baseline Results 2019 Alice Baird
Björn Schüller
+ Pre-training in Deep Reinforcement Learning for Automatic Speech Recognition 2019 Thejan Rajapakshe
Rajib Rana
Siddique Latif
Sara Khalifa
Björn Schüller
+ N-HANS: Introducing the Augsburg Neuro-Holistic Audio-eNhancement System 2019 Shuo Liu
Gil Keren
Björn Schüller
+ AVEC 2019 Workshop and Challenge: State-of-Mind, Detecting Depression with AI, and Cross-Cultural Affect Recognition 2019 Fabien Ringeval
Björn Schüller
Michel Valstar
Nicholas Cummins
Roddy Cowie
Leili Tavabi
Maximilian Schmitt
Sina Alisamir
Shahin Amiriparian
Eva-Maria Meßner
+ Multi-Task Semi-Supervised Adversarial Autoencoding for Speech Emotion Recognition 2019 Siddique Latif
Rajib Rana
Sara Khalifa
Raja Jurdak
Julien Epps
Björn Schüller
+ Multi-modal Active Learning From Human Data: A Deep Reinforcement Learning Approach 2019 Ognjen Rudovic
Meiru Zhang
Björn Schüller
Rosalind W. Picard
+ A Comparison of Online Automatic Speech Recognition Systems and the Nonverbal Responses to Unintelligible Speech 2019 Joshua Y. Kim
Chunfeng Liu
Rafael A. Calvo
Kathryn McCabe
Silas Taylor
Björn Schüller
Kaihang Wu
+ Synthesising 3D Facial Motion from "In-the-Wild" Speech 2019 Panagiotis Tzirakis
Αθανάσιος Παπαϊωάννου
Alexandros Lattas
Michail Tarasiou
Björn Schüller
Stefanos Zafeiriou
+ The Many-to-Many Mapping Between the Concordance Correlation Coefficient and the Mean Square Error 2019 Vedhas Pandit
Björn Schüller
+ On Laughter and Speech-Laugh, Based on Observations of Child-Robot Interaction 2019 Anton Batliner
Stefan Steidl
Florian Eyben
Björn Schüller
+ Responsible and Representative Multimodal Data Acquisition and Analysis: On Auditability, Benchmarking, Confidence, Data-Reliance & Explainability 2019 Alice Baird
Simone Hantke
Björn Schüller
+ PDF Chat Fast Single-Class Classification and the Principle of Logit Separation 2018 Gil Keren
Sivan Sabato
Björn Schüller
+ PDF Chat Dynamic Difficulty Awareness Training for Continuous Emotion Prediction 2018 Zixing Zhang
Jing Han
Eduardo Coutinho
Björn Schüller
+ PDF Chat Scaling speech enhancement in unseen environments with noise embeddings 2018 Gil Keren
Jing Han
Björn Schüller
+ PDF Chat Noise Invariant Frame Selection: A Simple Method to Address the Background Noise Problem for Text-independent Speaker Verification 2018 Siyang Song
Shuimei Zhang
Björn Schüller
Linlin Shen
Michel Valstar
+ Personalized machine learning for robot perception of affect and engagement in autism therapy 2018 Ognjen Rudovic
Jaeryoung Lee
Miles Dai
Björn Schüller
Rosalind W. Picard
+ audEERING's approach to the One-Minute-Gradual Emotion Challenge. 2018 Andreas Triantafyllopoulos
Hesam Sagha
Florian Eyben
Björn Schüller
+ End2You -- The Imperial Toolkit for Multimodal Profiling by End-to-End Learning 2018 Panagiotis Tzirakis
Stefanos Zafeiriou
Björn Schüller
+ Weakly Supervised One-Shot Detection with Attention Siamese Networks 2018 Gil Keren
Maximilian Schmitt
Thomas Kehrenberg
Björn Schüller
+ Applying Cooperative Machine Learning to Speed Up the Annotation of Social Signals in Large Multi-modal Corpora 2018 Johannes Wagner
Tobias Baur
Yue Zhang
Michel Valstar
Björn Schüller
Elisabeth André
+ Calibrated Prediction Intervals for Neural Network Regressors 2018 Gil Keren
Nicholas Cummins
Björn Schüller
+ Noise Invariant Frame Selection: A Simple Method to Address the Background Noise Problem for Text-independent Speaker Verification 2018 Siyang Song
Shuimei Zhang
Björn Schüller
Linlin Shen
Michel Valstar
+ Adversarial Training in Affective Computing and Sentiment Analysis: Recent Advances and Perspectives 2018 Jing Han
Zixing Zhang
Nicholas Cummins
Björn Schüller
+ Scaling Speech Enhancement in Unseen Environments with Noise Embeddings 2018 Gil Keren
Jing Han
Björn Schüller
+ Calibrated Prediction Intervals for Neural Network Regressors 2018 Gil Keren
Nicholas Cummins
Björn Schüller
+ audEERING's approach to the One-Minute-Gradual Emotion Challenge 2018 Andreas Triantafyllopoulos
Hesam Sagha
Florian Eyben
Björn Schüller
+ End2You -- The Imperial Toolkit for Multimodal Profiling by End-to-End Learning 2018 Panagiotis Tzirakis
Stefanos Zafeiriou
Björn Schüller
+ Weakly Supervised One-Shot Detection with Attention Similarity Networks 2018 Gil Keren
Maximilian Schmitt
Thomas Kehrenberg
Björn Schüller
+ auDeep: Unsupervised Learning of Representations from Audio with Deep Recurrent Neural Networks 2017 Michael Freitag
Shahin Amiriparian
Sergey Pugachevskiy
Nicholas Cummins
Björn Schüller
+ PDF Chat End-to-End Multimodal Emotion Recognition Using Deep Neural Networks 2017 Panagiotis Tzirakis
George Trigeorgis
Mihalis A. Nicolaou
Björn Schüller
Stefanos Zafeiriou
+ PDF Chat DeepCoder: Semi-Parametric Variational Autoencoders for Automatic Facial Action Coding 2017 Dieu Linh Tran
Robert Walecki
Ognjen Rudovic
Stefanos Eleftheriadis
Björn Schüller
Maja Pantić
+ PDF Chat Deep Structured Learning for Facial Action Unit Intensity Estimation 2017 Robert Walecki
Ognjen Rudovic
Vladimir Pavlović
Björn Schüller
Maja Pantić
+ Deep Structured Learning for Facial Action Unit Intensity Estimation 2017 Robert Walecki
Ognjen
Rudovic
Vladimir Pavlović
Björn Schüller
Maja Pantić
+ DeepCoder: Semi-parametric Variational Autoencoders for Facial Action Unit Intensity Estimation. 2017 Dieu Linh Tran
Robert Walecki
Ognjen Rudovic
Stefanos Eleftheriadis
Björn Schüller
Maja Pantić
+ DeepCoder: Semi-parametric Variational Autoencoders for Automatic Facial Action Coding 2017 Dieu Linh Tran
Robert Walecki
Ognjen Rudovic
Stefanos Eleftheriadis
Björn Schüller
Maja Pantić
+ PDF Chat Tunable Sensitivity to Large Errors in Neural Network Training 2017 Gil Keren
Sivan Sabato
Björn Schüller
+ Fast Single-Class Classification and the Principle of Logit Separation 2017 Gil Keren
Sivan Sabato
Björn Schüller
+ Learning audio sequence representations for acoustic event classification 2017 Zixing Zhang
Ding Liu
Jing Han
Björn Schüller
+ Deep Learning for Environmentally Robust Speech Recognition: An Overview of Recent Developments 2017 Zixing Zhang
Jürgen T. Geiger
Jouni Pohjalainen
Amr El-Desoky Mousa
Wenyu Jin
Björn Schüller
+ DeepCoder: Semi-parametric Variational Autoencoders for Automatic Facial Action Coding 2017 Dieu Linh Tran
Robert Walecki
Ognjen Rudovic
Stefanos Eleftheriadis
Björn Schüller
Maja Pantić
+ Deep Structured Learning for Facial Action Unit Intensity Estimation 2017 Robert Walecki
Ognjen
Rudovic
Vladimir Pavlović
Björn Schüller
Maja Pantić
+ PDF Chat Detecting road surface wetness from audio: A deep learning approach 2016 Irman Abdić
Lex Fridman
Daniel E. Brown
William Angell
Bryan Reimer
Erik Marchi
Björn Schüller
+ PDF Chat Convolutional RNN: An enhanced model for extracting features from sequential data 2016 Gil Keren
Björn Schüller
+ openXBOW - Introducing the Passau Open-Source Crossmodal Bag-of-Words Toolkit 2016 Maximilian Schmitt
Björn Schüller
+ PDF Chat A Deep Matrix Factorization Method for Learning Attribute Representations 2016 George Trigeorgis
Konstantinos Bousmalis
Stefanos Zafeiriou
Björn Schüller
+ Convolutional RNN: an Enhanced Model for Extracting Features from Sequential Data 2016 Gil Keren
Björn Schüller
+ Tunable Sensitivity to Large Errors in Neural Network Training 2016 Gil Keren
Sivan Sabato
Björn Schüller
+ AVEC 2016 - Depression, Mood, and Emotion Recognition Workshop and Challenge 2016 Michel Valstar
Jonathan Gratch
Björn Schüller
Fabien Ringeval
Denis Lalanne
Mercedes Torres Torres
Stefan Scherer
Guiota Stratou
Roddy Cowie
Maja Pantić
+ openXBOW - Introducing the Passau Open-Source Crossmodal Bag-of-Words Toolkit 2016 Maximilian Schmitt
Björn Schüller
+ Detecting Road Surface Wetness from Audio: A Deep Learning Approach 2015 Irman Abdić
Lex Fridman
Erik Marchi
Daniel Brown
William Angell
Bryan Reimer
Björn Schüller
+ The ICSTM+TUM+UP Approach to the 3rd CHIME Challenge: Single-Channel LSTM Speech Enhancement with Multi-Channel Correlation Shaping Dereverberation and LSTM Language Models 2015 Amr El-Desoky Mousa
Erik Marchi
Björn Schüller
+ A deep matrix factorization method for learning attribute representations 2015 George Trigeorgis
Konstantinos Bousmalis
Stefanos Zafeiriou
Björn Schüller
+ The ICSTM+TUM+UP Approach to the 3rd CHIME Challenge: Single-Channel LSTM Speech Enhancement with Multi-Channel Correlation Shaping Dereverberation and LSTM Language Models 2015 Amr El-Desoky Mousa
Erik Marchi
Björn Schüller
+ A deep matrix factorization method for learning attribute representations 2015 George Trigeorgis
Konstantinos Bousmalis
Stefanos Zafeiriou
Björn Schüller
+ Detecting Road Surface Wetness from Audio: A Deep Learning Approach 2015 Irman Abdić
Lex Fridman
Erik Marchi
Daniel E. Brown
William W. Angell
Bryan Reimer
Björn Schüller
+ A Broadcast News Corpus for Evaluation and Tuning of German LVCSR Systems 2014 Felix Weninger
Björn Schüller
Florian Eyben
Martin Wöllmer
Gerhard Rigoll
+ PDF Chat Acoustic Gait-based Person Identification using Hidden Markov Models 2014 Jürgen T. Geiger
Maximilian Kneißl
Björn Schüller
Gerhard Rigoll
+ Acoustic Gait-based Person Identification using Hidden Markov Models 2014 Jürgen T. Geiger
Maximilian Kneißl
Björn Schüller
Gerhard Rigoll
+ The state of play of ASC-Inclusion: An Integrated Internet-Based Environment for Social Inclusion of Children with Autism Spectrum Conditions 2014 Björn Schüller
Erik Marchi
Simon Baron‐Cohen
Helen O’Reilly
Delia Pigat
Peter Robinson
Ian Daves
+ The state of play of ASC-Inclusion: An Integrated Internet-Based Environment for Social Inclusion of Children with Autism Spectrum Conditions 2014 Björn Schüller
Erik Marchi
Simon Baron‐Cohen
Helen O’Reilly
Delia Pigat
Peter Robinson
Ian Davies
+ Acoustic Gait-based Person Identification using Hidden Markov Models 2014 Jürgen T. Geiger
Maximilian Kneißl
Björn Schüller
Gerhard Rigoll
+ A Broadcast News Corpus for Evaluation and Tuning of German LVCSR Systems 2014 Felix Weninger
Björn Schüller
Florian Eyben
Martin Wöllmer
Gerhard Rigoll
+ The state of play of ASC-Inclusion: An Integrated Internet-Based Environment for Social Inclusion of Children with Autism Spectrum Conditions 2014 Björn Schüller
Erik Marchi
Simon Baron‐Cohen
Helen O’Reilly
Delia Pigat
Peter Robinson
Ian Daves
+ 6th International Symposium on Attention in Cognitive Systems 2013 2013 Lucas Paletta
Laurent Itti
Björn Schüller
Fang Fang
Common Coauthors
Commonly Cited References
Action Title Year Authors # of times referenced
+ PDF Chat Deep Residual Learning for Image Recognition 2016 Kaiming He
Xiangyu Zhang
Shaoqing Ren
Jian Sun
39
+ Very Deep Convolutional Networks for Large-Scale Image Recognition 2014 Karen Simonyan
Andrew Zisserman
17
+ PDF Chat End-to-End Multimodal Emotion Recognition Using Deep Neural Networks 2017 Panagiotis Tzirakis
George Trigeorgis
Mihalis A. Nicolaou
Björn Schüller
Stefanos Zafeiriou
16
+ Adam: A Method for Stochastic Optimization 2014 Diederik P. Kingma
Jimmy Ba
15
+ A Concordance Correlation Coefficient to Evaluate Reproducibility 1989 Liang‐In Lin
14
+ PDF Chat Joint Face Detection and Alignment Using Multitask Cascaded Convolutional Networks 2016 Kaipeng Zhang
Zhanpeng Zhang
Zhifeng Li
Yu Qiao
13
+ PDF Chat SEWA DB: A Rich Database for Audio-Visual Emotion and Sentiment Research in the Wild 2019 Jean Kossaifi
Robert Walecki
Yannis Panagakis
Jie Shen
Maximilian Schmitt
Fabien Ringeval
Jing Han
Vedhas Pandit
Antoine Toisoul
Björn Schüller
12
+ PDF Chat Densely Connected Convolutional Networks 2017 Gao Huang
Zhuang Liu
Laurens van der Maaten
Kilian Q. Weinberger
11
+ Deep Speech 2: End-to-End Speech Recognition in English and Mandarin 2015 Dario Amodei
Rishita Anubhai
Eric Battenberg
Carl Case
Jared Casper
Bryan Catanzaro
Jingdong Chen
Mike Chrzanowski
Adam Coates
Greg Diamos
11
+ Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift 2015 Sergey Ioffe
Christian Szegedy
10
+ Neural Machine Translation by Jointly Learning to Align and Translate 2015 Dzmitry Bahdanau
Kyunghyun Cho
Yoshua Bengio
9
+ COVID-19 and Computer Audition: An Overview on What Speech & Sound Analysis Could Contribute in the SARS-CoV-2 Corona Crisis 2020 Björn Schüller
Dagmar Schuller
Kun Qian
Juan Liu
Huaiyuan Zheng
Xiao Li
9
+ Exploring Automatic Diagnosis of COVID-19 from Crowdsourced Respiratory Sound Data 2020 Chloë Brown
Jagmohan Chauhan
Andreas Grammenos
Jing Han
Apinan Hasthanasombat
Dimitris Spathis
Xia Tong
Pietro Cicuta
Cecilia Mascolo
9
+ Deep Representation Learning in Speech Processing: Challenges, Recent Advances, and Future Trends 2020 Siddique Latif
Rajib Rana
Sara Khalifa
Raja Jurdak
Junaid Qadir
Björn Schüller
9
+ PDF Chat Tacotron: Towards End-to-End Speech Synthesis 2017 Yuxuan Wang
RJ Skerry-Ryan
Daisy Stanton
Yonghui Wu
Ron J. Weiss
Navdeep Jaitly
Zongheng Yang
Ying Xiao
Zhifeng Chen
Samy Bengio
9
+ PDF Chat Direct Modelling of Speech Emotion from Raw Speech 2019 Siddique Latif
Rajib Rana
Sara Khalifa
Raja Jurdak
Julien Epps
9
+ PDF Chat PANNs: Large-Scale Pretrained Audio Neural Networks for Audio Pattern Recognition 2020 Qiuqiang Kong
Yin Cao
Turab Iqbal
Yuxuan Wang
Wenwu Wang
Mark D. Plumbley
9
+ PDF Chat Adversarial Auto-Encoders for Speech Based Emotion Recognition 2017 Saurabh Sahu
Rahul Gupta
Ganesh Sivaraman
Wael AbdAlmageed
Carol Espy-Wilson
9
+ WaveNet: A Generative Model for Raw Audio 2016 Aäron van den Oord
Sander Dieleman
Heiga Zen
Karen Simonyan
Oriol Vinyals
Alexander Graves
Nal Kalchbrenner
Andrew Senior
Koray Kavukcuoglu
8
+ AI4COVID-19: AI enabled preliminary diagnosis for COVID-19 from cough samples via an app 2020 Ali Imran
Iryna Posokhova
Haneya Naeem Qureshi
Usama Masood
Muhammad Sajid Riaz
Kamran Ali
Charles N. John
MD Iftikhar Hussain
Muhammad Nabeel
8
+ Very Deep Convolutional Networks for Large-Scale Image Recognition 2014 Karen Simonyan
Andrew Zisserman
8
+ PDF Chat Transfer Learning for Improving Speech Emotion Classification Accuracy 2018 Siddique Latif
Rajib Rana
Muhammad Shahzad Younis
Junaid Qadir
Julien Epps
8
+ Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling 2014 Jun‐Young Chung
Çaǧlar Gülçehre
Kyunghyun Cho
Yoshua Bengio
8
+ PDF Chat Deep Learning Face Attributes in the Wild 2015 Ziwei Liu
Ping Luo
Xiaogang Wang
Xiaoou Tang
8
+ PDF Chat Learning representations of emotional speech with deep convolutional generative adversarial networks 2017 Jonathan Chang
Stefan Scherer
8
+ PDF Chat ImageNet Large Scale Visual Recognition Challenge 2015 Olga Russakovsky
Jia Deng
Hao Su
Jonathan Krause
Sanjeev Satheesh
Sean Ma
Zhiheng Huang
Andrej Karpathy
Aditya Khosla
Michael S. Bernstein
8
+ PDF Chat Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks 2017 Jun-Yan Zhu
Taesung Park
Phillip Isola
Alexei A. Efros
8
+ PDF Chat CNN architectures for large-scale audio classification 2017 Shawn Hershey
Sourish Chaudhuri
Daniel P. W. Ellis
Jort F. Gemmeke
Aren Jansen
Robert C. Moore
Manoj Plakal
Devin Platt
Rif A. Saurous
Bryan Seybold
8
+ auDeep: Unsupervised Learning of Representations from Audio with Deep Recurrent Neural Networks 2017 Michael Freitag
Shahin Amiriparian
Sergey Pugachevskiy
Nicholas Cummins
Bj ̈orn W Schuller
7
+ PDF Chat Attentive Convolutional Neural Network Based Speech Emotion Recognition: A Study on the Impact of Input Features, Signal Length, and Acted Speech 2017 Michael Neumann
Ngoc Thang Vu
7
+ PDF Chat Dawn of the Transformer Era in Speech Emotion Recognition: Closing the Valence Gap 2023 Johannes Wagner
Andreas Triantafyllopoulos
Hagen Wierstorf
Maximilian Schmitt
Felix Burkhardt
Florian Eyben
Björn Schüller
7
+ PDF Chat Multi-Task Semi-Supervised Adversarial Autoencoding for Speech Emotion Recognition 2020 Siddique Latif
Rajib Rana
Sara Khalifa
Raja Jurdak
Julien Epps
Björn Schüller
7
+ PDF Chat SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition 2019 Daniel Park
William Chan
Yu Zhang
Chung‐Cheng Chiu
Barret Zoph
Ekin D. Cubuk
Quoc V. Le
7
+ PDF Chat Rethinking the Inception Architecture for Computer Vision 2016 Christian Szegedy
Vincent Vanhoucke
Sergey Ioffe
Jon Shlens
Zbigniew Wojna
7
+ Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift 2015 Sergey Ioffe
Christian Szegedy
7
+ PDF Chat Variational Autoencoders for Learning Latent Representations of Speech Emotion: A Preliminary Study 2018 Siddique Latif
Rajib Rana
Junaid Qadir
Julien Epps
6
+ BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding 2018 Jacob Devlin
Ming‐Wei Chang
Kenton Lee
Kristina Toutanova
6
+ Cough Against COVID: Evidence of COVID-19 Signature in Cough Sounds 2020 Piyush Bagad
Aman Dalmia
Jigar Doshi
Arsha Nagrani
Parag Bhamare
Amrita Mahale
Saurabh Rane
Neeraj Agarwal
Rahul Panicker
6
+ COVID-19 Patient Detection from Telephone Quality Speech Data 2020 Kotra Venkata Sai Ritwik
Shareef Babu Kalluri
Deepu Vijayasenan
6
+ PDF Chat Going deeper with convolutions 2015 Christian Szegedy
Wei Liu
Yangqing Jia
Pierre Sermanet
Scott Reed
Dragomir Anguelov
Dumitru Erhan
Vincent Vanhoucke
Andrew Rabinovich
6
+ PDF Chat Image-to-Image Translation with Conditional Adversarial Networks 2017 Phillip Isola
Jun-Yan Zhu
Tinghui Zhou
Alexei A. Efros
6
+ End2You -- The Imperial Toolkit for Multimodal Profiling by End-to-End Learning 2018 Panagiotis Tzirakis
Stefanos Zafeiriou
Björn Schüller
6
+ PDF Chat Tunable Sensitivity to Large Errors in Neural Network Training 2017 Gil Keren
Sivan Sabato
Björn Schüller
6
+ PDF Chat Deep Affect Prediction in-the-Wild: Aff-Wild Database and Challenge, Deep Architectures, and Beyond 2019 Dimitrios Kollias
Panagiotis Tzirakis
Mihalis A. Nicolaou
Athanasios Papaioannou
Guoying Zhao
Björn Schüller
Irene Kotsia
Stefanos Zafeiriou
6
+ Pixel Recurrent Neural Networks 2016 Aäron van den Oord
Nal Kalchbrenner
Koray Kavukcuoglu
6
+ PDF Chat Learning Problem-Agnostic Speech Representations from Multiple Self-Supervised Tasks 2019 Santiago Pascual
Mirco Ravanelli
Joan Serrà
Antonio Bonafonte
Yoshua Bengio
6
+ PDF Chat Representation Learning: A Review and New Perspectives 2013 Yoshua Bengio
Aaron Courville
P. M. Durai Raj Vincent
6
+ Conditional Generative Adversarial Nets 2014 Mehdi Mirza
Simon Osindero
6
+ wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations 2020 Alexei Baevski
Henry Zhou
Abdelrahman Mohamed
Michael Auli
6
+ PDF Chat MobileNetV2: Inverted Residuals and Linear Bottlenecks 2018 Mark Sandler
Andrew Howard
Menglong Zhu
Andrey Zhmoginov
Liang-Chieh Chen
5