+
PDF
Chat
|
Parameterised Quantum Circuits for Novel Representation Learning in
Speech Emotion Recognition
|
2025
|
Thejan Rajapakshe
Rajib Rana
Farhan Riaz
Sara Khalifa
Björn W. Schuller
|
+
PDF
Chat
|
DOTA-ME-CS: Daily Oriented Text Audio-Mandarin English-Code Switching
Dataset
|
2025
|
Yupei Li
Wei Zhang
Heng Yu
Huichi Zhou
Björn W. Schuller
|
+
PDF
Chat
|
DFingerNet: Noise-Adaptive Speech Enhancement for Hearing Aids
|
2025
|
Iosif Tsangko
Andreas Triantafyllopoulos
Michael G. Müller
Hendrik Schröter
Björn W. Schuller
|
+
PDF
Chat
|
MAD-UV: The 1st INTERSPEECH Mice Autism Detection via Ultrasound
Vocalization Challenge
|
2025
|
Zijiang Yang
Meishu Song
Jing Xin
Haojie Zhang
Kun Qian
Bin Hu
Kota Tamada
Toru Takumi
Björn W. Schuller
Yoshiharu Yamamoto
|
+
|
Automating Airborne Pollen Classification: Identifying and Interpreting Hard Samples for Classifiers
|
2025
|
Manuel Milling
Simon Rampp
Andreas Triantafyllopoulos
Maria Pilar Plaza
Jens O. Brunner
Claudia Traidl‐Hoffmann
Björn W. Schuller
Athanasios Damialis
|
+
PDF
Chat
|
Gender Bias in Text-to-Video Generation Models: A case study of Sora
|
2024
|
Mohammad Nadeem
Shahab Saquib Sohail
Erik Cambria
Björn W. Schuller
Amir Hussain
|
+
|
Explainable Artificial Intelligence for Medical Applications: A Review
|
2024
|
Qiyang Sun
Alican Akman
Björn Schüller
|
+
PDF
Chat
|
Towards Friendly AI: A Comprehensive Review and New Perspectives on
Human-AI Alignment
|
2024
|
Qiyang Sun
Yupei Li
Emran Alturki
S Murthy
Björn Schüller
|
+
PDF
Chat
|
Detecting Document-level Paraphrased Machine Generated Content:
Mimicking Human Writing Style and Involving Discourse Features
|
2024
|
Yupei Li
Manuel Milling
Lucia Specia
Björn Schüller
|
+
PDF
Chat
|
Detecting Machine-Generated Music with Explainability -- A Challenge and
Early Benchmarks
|
2024
|
Yupei Li
Qiyang Sun
Hui Li
Lucia Specia
Björn Schüller
|
+
PDF
Chat
|
ProsodyFM: Unsupervised Phrasing and Intonation Control for Intelligible
Speech Synthesis
|
2024
|
Xiangheng He
Junjie Chen
Zixing Zhang
Björn Schüller
|
+
PDF
Chat
|
autrainer: A Modular and Extensible Deep Learning Toolkit for Computer
Audition Tasks
|
2024
|
Simon Rampp
Andreas Triantafyllopoulos
Manuel Milling
Björn Schüller
|
+
PDF
Chat
|
M6: Multi-generator, Multi-domain, Multi-lingual and cultural,
Multi-genres, Multi-instrument Machine-Generated Music Detection Databases
|
2024
|
Yupei Li
Hui Li
Lucia Specia
Björn Schüller
|
+
PDF
Chat
|
From Audio Deepfake Detection to AI-Generated Music Detection -- A
Pathway and Overview
|
2024
|
Yupei Li
Manuel Milling
Lucia Specia
Björn Schüller
|
+
PDF
Chat
|
Raw Audio Classification with Cosine Convolutional Neural Network
(CosCovNN)
|
2024
|
Kazi Nazmul Haque
Rajib Rana
T. Jarin
Björn Schüller
|
+
PDF
Chat
|
Using voice analysis as an early indicator of risk for depression in
young adults
|
2024
|
Klaus R. Scherer
Felix Burkhardt
Uwe D. Reichel
Florian Eyben
Björn Schüller
|
+
PDF
Chat
|
Explainable Artificial Intelligence for Medical Applications: A Review
|
2024
|
Qiyang Sun
Alican Akman
Björn Schüller
|
+
PDF
Chat
|
Non-Invasive Suicide Risk Prediction Through Speech Analysis
|
2024
|
Shahin Amiriparian
Maurice Gerczuk
Justina Lutz
Wolfgang Strube
Irina Papazova
Alkomiet Hasan
Alexander Kathan
Björn Schüller
|
+
PDF
Chat
|
Does the Definition of Difficulty Matter? Scoring Functions and their
Role for Curriculum Learning
|
2024
|
Simon Rampp
Manuel Milling
Andreas Triantafyllopoulos
Björn Schüller
|
+
PDF
Chat
|
Audio-based Kinship Verification Using Age Domain Conversion
|
2024
|
Qiyang Sun
Alican Akman
Xin Jing
Manuel Milling
Björn Schüller
|
+
PDF
Chat
|
Audio Explanation Synthesis with Generative Foundation Models
|
2024
|
Alican Akman
Qiyang Sun
Björn Schüller
|
+
PDF
Chat
|
PerCo (SD): Open Perceptual Compression
|
2024
|
Nikolai Körber
Eduard Kromer
Andreas Siebert
Sascha Hauke
Daniel Mueller-Gritschneder
Björn Schüller
|
+
PDF
Chat
|
Trading through Earnings Seasons using Self-Supervised Contrastive
Representation Learning
|
2024
|
Zhengxin Joseph Ye
Björn Schüller
|
+
PDF
Chat
|
Affective Computing Has Changed: The Foundation Model Disruption
|
2024
|
Björn Schüller
Adria Mallol-Ragolta
Alejandro Peña Almansa
Iosif Tsangko
Mostafa M. Amin
Anastasia Semertzidou
Lukas Christ
Shahin Amiriparian
|
+
PDF
Chat
|
Enhancing Emotional Text-to-Speech Controllability with Natural Language
Guidance through Contrastive Learning and Diffusion Models
|
2024
|
Xin Jing
Kun Zhou
Andreas Triantafyllopoulos
Björn Schüller
|
+
PDF
Chat
|
ParaCLAP – Towards a general language-audio model for computational paralinguistic tasks
|
2024
|
Xin Jing
Andreas Triantafyllopoulos
Björn Schüller
|
+
PDF
Chat
|
Are you sure? Analysing Uncertainty Quantification Approaches for Real-world Speech Emotion Recognition
|
2024
|
Oliver Schrüfer
Manuel Milling
Felix Burkhardt
Florian Eyben
Björn Schüller
|
+
PDF
Chat
|
Sustained Vowels for Pre- vs Post-Treatment COPD Classification
|
2024
|
Andreas Triantafyllopoulos
Anton Batliner
Wolfgang Mayr
Markus Fendler
Florian B. Pokorny
Maurice Gerczuk
Shahin Amiriparian
Thomas M. Berghaus
Björn Schüller
|
+
PDF
Chat
|
Exploring Gender-Specific Speech Patterns in Automatic Suicide Risk Assessment
|
2024
|
Maurice Gerczuk
Shahin Amiriparian
Justina Lutz
Wolfgang Strube
Irina Papazova
Alkomiet Hasan
Björn Schüller
|
+
PDF
Chat
|
Enrolment-based personalisation for improving individual-level fairness in speech emotion recognition
|
2024
|
Andreas Triantafyllopoulos
Björn Schüller
|
+
PDF
Chat
|
ExHuBERT: Enhancing HuBERT Through Block Extension and Fine-Tuning on 37 Emotion Datasets
|
2024
|
Shahin Amiriparian
Filip Packań
Maurice Gerczuk
Björn Schüller
|
+
PDF
Chat
|
This Paper Had the Smartest Reviewers - Flattery Detection Utilising an Audio-Textual Transformer-Based Approach
|
2024
|
Lukas Christ
Shahin Amiriparian
Friederike Hawighorst
Ann-Kathrin Schill
Angelo Boutalikakis
Lorenz Graf‐Vlachy
Andreas König
Björn Schüller
|
+
PDF
Chat
|
Negation Blindness in Large Language Models: Unveiling the NO Syndrome
in Image Generation
|
2024
|
Mohammad Nadeem
Shahab Saquib Sohail
Erik Cambria
Björn Schüller
Amir Hussain
|
+
PDF
Chat
|
Audio-Based Step-Count Estimation for Running - Windowing and Neural Network Baselines
|
2024
|
Philipp Wagner
Andreas Triantafyllopoulos
Alexander Gebhard
Björn Schüller
|
+
PDF
Chat
|
Wav2Small: Distilling Wav2Vec2 to 72K parameters for Low-Resource Speech
emotion recognition
|
2024
|
Dionyssos Kounadis-Bastian
Oliver Schrüfer
Anna Derington
Hagen Wierstorf
Florian Eyben
Felix Burkhardt
Björn Schüller
|
+
PDF
Chat
|
Computer Audition: From Task-Specific Machine Learning to Foundation
Models
|
2024
|
Andreas Triantafyllopoulos
Iosif Tsangko
Alexander Gebhard
Annamaria Mesaros
Tuomas Virtanen
Björn Schüller
|
+
PDF
Chat
|
A Comprehensive Survey on Heart Sound Analysis in the Deep Learning Era
|
2024
|
Zhao Ren
Yi Chang
Thành Tâm Nguyên
Yang Tan
Kun Qian
Björn Schüller
|
+
PDF
Chat
|
Emotion and Intent Joint Understanding in Multimodal Conversation: A
Benchmarking Dataset
|
2024
|
Rui Liu
Haolin Zuo
Zheng Lian
Xiaofen Xing
Björn Schüller
Haizhou Li
|
+
PDF
Chat
|
Are you sure? Analysing Uncertainty Quantification Approaches for
Real-world Speech Emotion Recognition
|
2024
|
Oliver Schrüfer
Manuel Milling
Felix Burkhardt
Florian Eyben
Björn Schüller
|
+
PDF
Chat
|
Audio Enhancement for Computer Audition—An Iterative Training Paradigm Using Sample Importance
|
2024
|
Manuel Milling
Shuo Liu
Andreas Triantafyllopoulos
Ilhan Aslan
Björn Schüller
|
+
PDF
Chat
|
A Wide Evaluation of ChatGPT on Affective Computing Tasks
|
2024
|
Mostafa M. Amin
Rui Mao
Erik Cambria
Björn Schüller
|
+
PDF
Chat
|
Exploring Gender-Specific Speech Patterns in Automatic Suicide Risk
Assessment
|
2024
|
Maurice Gerczuk
Shahin Amiriparian
Justina Lutz
Wolfgang Strube
Irina Papazova
Alkomiet Hasan
Björn Schüller
|
+
PDF
Chat
|
This Paper Had the Smartest Reviewers -- Flattery Detection Utilising an
Audio-Textual Transformer-Based Approach
|
2024
|
Lukas Christ
Shahin Amiriparian
Friederike Hawighorst
Ann-Kathrin Schill
Angelo Boutalikakis
Lorenz Graf‐Vlachy
Andreas König
Björn Schüller
|
+
PDF
Chat
|
Speech Emotion Recognition under Resource Constraints with Data
Distillation
|
2024
|
Yi Chang
Zhao Ren
Zhonghao Zhao
Thành Tâm Nguyên
Kun Qian
Tanja Schultz
Björn Schüller
|
+
PDF
Chat
|
ParaCLAP -- Towards a general language-audio model for computational
paralinguistic tasks
|
2024
|
Xin Jing
Andreas Triantafyllopoulos
Björn Schüller
|
+
PDF
Chat
|
The MuSe 2024 Multimodal Sentiment Analysis Challenge: Social Perception
and Humor Recognition
|
2024
|
Shahin Amiriparian
Lukas Christ
Alexander Kathan
Maurice Gerczuk
Niklas Müller
Steffen Klug
Lukas Stappen
Andreas König
Erik Cambria
Björn Schüller
|
+
PDF
Chat
|
DB3V: A Dialect Dominated Dataset of Bird Vocalisation for Cross-corpus
Bird Species Recognition
|
2024
|
Xin Jing
Luyang Zhang
Jiangjian Xie
Alexander Gebhard
Alice Baird
Björn Schüller
|
+
PDF
Chat
|
ExHuBERT: Enhancing HuBERT Through Block Extension and Fine-Tuning on 37
Emotion Datasets
|
2024
|
Shahin Amiriparian
Filip Packań
Maurice Gerczuk
Björn Schüller
|
+
PDF
Chat
|
An automatic analysis of ultrasound vocalisations for the prediction of
interaction context in captive Egyptian fruit bats
|
2024
|
Andreas Triantafyllopoulos
Alexander Gebhard
Manuel Milling
Simon Rampp
Björn Schüller
|
+
PDF
Chat
|
Audio-based Step-count Estimation for Running -- Windowing and Neural
Network Baselines
|
2024
|
Philipp Wagner
Andreas Triantafyllopoulos
Alexander Gebhard
Björn Schüller
|
+
PDF
Chat
|
Sustained Vowels for Pre- vs Post-Treatment COPD Classification
|
2024
|
Andreas Triantafyllopoulos
Anton Batliner
Wolfgang Mayr
Markus Fendler
Florian B. Pokorny
Maurice Gerczuk
Shahin Amiriparian
Thomas M. Berghaus
Björn Schüller
|
+
PDF
Chat
|
INTERSPEECH 2009 Emotion Challenge Revisited: Benchmarking 15 Years of
Progress in Speech Emotion Recognition
|
2024
|
Andreas Triantafyllopoulos
Anton Batliner
Simon Rampp
Manuel Milling
Björn Schüller
|
+
PDF
Chat
|
Enrolment-based personalisation for improving individual-level fairness
in speech emotion recognition
|
2024
|
Andreas Triantafyllopoulos
Björn Schüller
|
+
PDF
Chat
|
Modeling Emotional Trajectories in Written Stories Utilizing
Transformers and Weakly-Supervised Learning
|
2024
|
Lukas Christ
Shahin Amiriparian
Manuel Milling
Ilhan Aslan
Björn Schüller
|
+
PDF
Chat
|
Refashioning Emotion Recognition Modelling: The Advent of Generalised Large Models
|
2024
|
Zixing Zhang
Liyizhe Peng
Tao Pang
Jing Han
Huan Zhao
Björn Schüller
|
+
PDF
Chat
|
Identity-free Artificial Emotional Intelligence via Micro-Gesture
Understanding
|
2024
|
Rong Gao
Xin Liu
Bohao Xing
Zitong Yu
Björn Schüller
Heikki Kälviäinen
|
+
PDF
Chat
|
HAFFormer: A Hierarchical Attention-Free Framework for Alzheimer's
Disease Detection From Spontaneous Speech
|
2024
|
Zhongren Dong
Zixing Zhang
Weixiang Xu
Jing Han
Jianjun Ou
Björn Schüller
|
+
PDF
Chat
|
Intelligent Cardiac Auscultation for Murmur Detection via
Parallel-Attentive Models with Uncertainty Estimation
|
2024
|
Zixing Zhang
Tao Pang
Jing Han
Björn Schüller
|
+
PDF
Chat
|
Expressivity and Speech Synthesis
|
2024
|
Andreas Triantafyllopoulos
Björn Schüller
|
+
PDF
Chat
|
MER 2024: Semi-Supervised Learning, Noise Robustness, and
Open-Vocabulary Multimodal Emotion Recognition
|
2024
|
Zheng Lian
Haiyang Sun
Licai Sun
Zhuofan Wen
Siyuan Zhang
Shun Chen
Hao Gu
Jinming Zhao
Ziyang Ma
Xie Chen
|
+
PDF
Chat
|
Enhancing Suicide Risk Assessment: A Speech-Based Automated Approach in
Emergency Medicine
|
2024
|
Shahin Amiriparian
Maurice Gerczuk
Justina Lutz
Wolfgang Strube
Irina Papazova
Alkomiet Hasan
Alexander Kathan
Björn Schüller
|
+
|
Identifying depression-related topics in smartphone-collected free-response speech recordings using an automatic speech recognition system and a deep learning topic model
|
2024
|
Yuezhou Zhang
Amos Folarin
Judith Dineley
Pauline Conde
Valeria de Angel
Shaoxiong Sun
Yatharth Ranjan
Zulqarnain Rashid
Callum Stewart
Petroula Laiou
|
+
PDF
Chat
|
On Prompt Sensitivity of ChatGPT in Affective Computing
|
2024
|
Mostafa M. Amin
Björn Schüller
|
+
PDF
Chat
|
emoDARTS: Joint Optimisation of CNN & Sequential Neural Network
Architectures for Superior Speech Emotion Recognition
|
2024
|
Thejan Rajapakshe
Rajib Rana
Sara Khalifa
Berrak Şişman
Björn Schüller
Carlos Busso
|
+
|
Speech Swin-Transformer: Exploring a Hierarchical Transformer with Shifted Windows for Speech Emotion Recognition
|
2024
|
Yong Wang
Cheng Lu
Hailun Lian
Zhao Yan
Björn Schüller
Yuan Zong
Wenming Zheng
|
+
|
Intelligent Cardiac Auscultation for Murmur Detection via Parallel-Attentive Models with Uncertainty Estimation
|
2024
|
Zixing Zhang
Tao Pang
Jing Han
Björn Schüller
|
+
|
Customising General Large Language Models for Specialised Emotion Recognition Tasks
|
2024
|
Liyizhe Peng
Zixing Zhang
Tao Pang
Jing Han
Huan Zhao
Hao Chen
Björn Schüller
|
+
|
Improving Speaker-Independent Speech Emotion Recognition using Dynamic Joint Distribution Adaptation
|
2024
|
Cheng Lu
Yuan Zong
Hailun Lian
Yan Zhao
Björn Schüller
Wenming Zheng
|
+
|
HAFFormer: A Hierarchical Attention-Free Framework for Alzheimer’s Disease Detection From Spontaneous Speech
|
2024
|
Zhongren Dong
Zixing Zhang
Weixiang Xu
Jing Han
Jianjun Ou
Björn Schüller
|
+
|
Emotion-Aware Contrastive Adaptation Network for Source-Free Cross-Corpus Speech Emotion Recognition
|
2024
|
Yan Zhao
Jincen Wang
Cheng Lu
Sunan Li
Björn Schüller
Yuan Zong
Wenming Zheng
|
+
|
Bringing the Discussion of Minima Sharpness to the Audio Domain: A Filter-Normalised Evaluation for Acoustic Scene Classification
|
2024
|
Manuel Milling
Andreas Triantafyllopoulos
Iosif Tsangko
Simon Rampp
Björn Schüller
|
+
|
Synthia’s Melody: A Benchmark Framework for Unsupervised Domain Adaptation in Audio
|
2024
|
Chiahsin Lin
Charles Jones
Björn Schüller
Harry Coppock
Alican Akman
|
+
|
Task Selection and Assignment for Multi-Modal Multi-Task Dialogue Act Classification with Non-Stationary Multi-Armed Bandits
|
2024
|
Xiangheng He
Junjie Chen
Björn Schüller
|
+
PDF
Chat
|
Propagating variational model uncertainty for bioacoustic call label smoothing
|
2024
|
Georgios Rizos
Jenna Lawson
Sımon F. Mıtchell
Pranay Shah
Xin Wen
Cristina Banks‐Leite
Robert M. Ewers
Björn Schüller
|
+
PDF
Chat
|
STAA-Net: A Sparse and Transferable Adversarial Attack for Speech
Emotion Recognition
|
2024
|
Yi Chang
Zhao Ren
Zixing Zhang
Xin Jing
Kun Qian
Xi Shao
Bin Hu
Tanja Schultz
Björn Schüller
|
+
|
Improving Speaker-independent Speech Emotion Recognition Using Dynamic Joint Distribution Adaptation
|
2024
|
Cheng Lu
Yuan Zong
Hailun Lian
Yan Zhao
Björn Schüller
Wenming Zheng
|
+
|
Speech Swin-Transformer: Exploring a Hierarchical Transformer with Shifted Windows for Speech Emotion Recognition
|
2024
|
Yong Wang
Cheng Lu
Hailun Lian
Yan Zhao
Björn Schüller
Yuan Zong
Wenming Zheng
|
+
|
Emotion-Aware Contrastive Adaptation Network for Source-Free Cross-Corpus Speech Emotion Recognition
|
2024
|
Yan Zhao
Jincen Wang
Cheng Lu
Sunan Li
Björn Schüller
Yuan Zong
Wenming Zheng
|
+
|
emoDARTS: Joint Optimization of CNN and Sequential Neural Network Architectures for Superior Speech Emotion Recognition
|
2024
|
Thejan Rajapakshe
Rajib Rana
Sara Khalifa
Berrak Şişman
Björn Schüller
Carlos Busso
|
+
PDF
Chat
|
Computational charisma—A brick by brick blueprint for building charismatic artificial intelligence
|
2023
|
Björn Schüller
Shahin Amiriparian
Anton Batliner
Alexander Gebhard
Maurice Gerczuk
Vincent Karas
Alexander Kathan
Lennart Seizer
Johanna Löchner
|
+
|
The UK COVID-19 Vocal Audio Dataset
|
2023
|
Harry Coppock
The Alan Turing Institute
UK Health Security Agency
Jobie Budd
Emma Karoune
Chris Holmes
Kieran Baker
Davide Pigoli
George Nicholson
Richard Payne
|
+
|
The UK COVID-19 Vocal Audio Dataset
|
2023
|
Harry Coppock
The Alan Turing Institute
UK Health Security Agency
Jobie Budd
Emma Karoune
Chris Holmes
Kieran Baker
Davide Pigoli
George Nicholson
Richard Payne
|
+
|
MER 2023: Multi-label Learning, Modality Robustness, and Semi-Supervised Learning
|
2023
|
Zheng Lian
Haiyang Sun
Licai Sun
Kang Chen
Mngyu Xu
Kexin Wang
Ke Xu
Yu He
Ying Li
Jinming Zhao
|
+
PDF
Chat
|
The MuSe 2023 Multimodal Sentiment Analysis Challenge: Mimicked Emotions, Cross-Cultural Humour, and Personalisation
|
2023
|
Lukas Christ
Shahin Amiriparian
Alice Baird
Alexander Kathan
Niklas Müller
Steffen Klug
Chris Gagne
Panagiotis Tzirakis
Lukas Stappen
Eva-Maria Meßner
|
+
PDF
Chat
|
COLD Fusion: Calibrated and Ordinal Latent Distribution Fusion for Uncertainty-Aware Multimodal Emotion Recognition
|
2023
|
Mani Kumar Tellamekala
Shahin Amiriparian
Björn Schüller
Elisabeth André
Timo Giesbrecht
Michel Valstar
|
+
PDF
Chat
|
Integrating Generative Artificial Intelligence in Intelligent Vehicle Systems
|
2023
|
Lukas Stappen
Jeremy Dillmann
Serena Striegel
Hans J. Vogel
Nicolas Flores-Herr
Björn Schüller
|
+
PDF
Chat
|
HEAR4Health: a blueprint for making computer audition a staple of modern healthcare
|
2023
|
Andreas Triantafyllopoulos
Alexander Kathan
Alice Baird
Lukas Christ
Alexander Gebhard
Maurice Gerczuk
Vincent Karas
Tobias Hübner
Xin Jing
Shuo Liu
|
+
PDF
Chat
|
Can ChatGPT’s Responses Boost Traditional Natural Language Processing?
|
2023
|
Mostafa M. Amin
Erik Cambria
Björn Schüller
|
+
PDF
Chat
|
A Preliminary Study on Augmenting Speech Emotion Recognition using a Diffusion Model
|
2023
|
Mohammad Ibrahim Malik
Siddique Latif
Raja Jurdak
Björn Schüller
|
+
PDF
Chat
|
Abusive Speech Detection in Indic Languages Using Acoustic Features
|
2023
|
Anika A. Spiesberger
Andreas Triantafyllopoulos
Iosif Tsangko
Björn Schüller
|
+
PDF
Chat
|
Executive Voiced Laughter and Social Approval: An Explorative Machine Learning Study
|
2023
|
Niklas Mueller
Steffen Klug
Alexander Kathan
Lukas Christ
Björn Schüller
Shahin Amiriparian
|
+
PDF
Chat
|
Are 3D Face Shapes Expressive Enough for Recognising Continuous Emotions and Action Unit Intensities?
|
2023
|
Mani Kumar Tellamekala
Ömer Sümer
Björn Schüller
Elisabeth André
Timo Giesbrecht
Michel Valstar
|
+
|
Hierarchical Network with Decoupled Knowledge Distillation for Speech Emotion Recognition
|
2023
|
Ziping Zhao
Huan Wang
Haishuai Wang
Björn Schüller
|
+
|
Audio Barlow Twins: Self-Supervised Audio Representation Learning
|
2023
|
Jonah Anton
Harry Coppock
Pancham Shukla
Björn Schüller
|
+
|
Knowledge Transfer for on-Device Speech Emotion Recognition With Neural Structured Learning
|
2023
|
Yi Chang
Zhao Ren
Thành Tâm Nguyên
Kun Qian
Björn Schüller
|
+
|
Fast Yet Effective Speech Emotion Recognition with Self-Distillation
|
2023
|
Zhao Ren
Thành Tâm Nguyên
Yi Hua Chang
Björn Schüller
|
+
PDF
Chat
|
Dawn of the Transformer Era in Speech Emotion Recognition: Closing the Valence Gap
|
2023
|
Johannes Wagner
Andreas Triantafyllopoulos
Hagen Wierstorf
Maximilian Schmitt
Felix Burkhardt
Florian Eyben
Björn Schüller
|
+
PDF
Chat
|
An Overview of Affective Speech Synthesis and Conversion in the Deep Learning Era
|
2023
|
Andreas Triantafyllopoulos
Björn Schüller
Gökçe İymen
Tevfik Metin Sezgin
Xiangheng He
Zijiang Yang
Panagiotis Tzirakis
Shuo Liu
Silvan Mertes
Elisabeth André
|
+
PDF
Chat
|
A summary of the ComParE COVID-19 challenges
|
2023
|
Harry Coppock
Alican Akman
Christian Bergler
Maurice Gerczuk
Chloë Brown
Jagmohan Chauhan
Andreas Grammenos
Apinan Hasthanasombat
Dimitris Spathis
Xia Tong
|
+
PDF
Chat
|
Multistage linguistic conditioning of convolutional layers for speech emotion recognition
|
2023
|
Andreas Triantafyllopoulos
Uwe D. Reichel
Shuo Liu
Stephan Huber
Florian Eyben
Björn Schüller
|
+
|
Computational Charisma -- A Brick by Brick Blueprint for Building Charismatic Artificial Intelligence
|
2023
|
Björn Schüller
Shahin Amiriparian
Anton Batliner
Alexander Gebhard
Maurice Gerzcuk
Vincent Karas
Alexander Kathan
Lennart Seizer
Johanna Löchner
|
+
|
A Comprehensive Survey on Heart Sound Analysis in the Deep Learning Era
|
2023
|
Zhao Ren
Yi Chang
Thanh Tam Nguyen
Yang Tan
Kun Qian
Björn Schüller
|
+
|
HEAR4Health: A blueprint for making computer audition a staple of modern healthcare
|
2023
|
Andreas Triantafyllopoulos
Alexander Kathan
Alice Baird
Lukas Christ
Alexander Gebhard
Maurice Gerczuk
Vincent Karas
Tobias Hübner
Xin Jing
Shuo Liu
|
+
|
audb -- Sharing and Versioning of Audio and Annotation Data in Python
|
2023
|
Hagen Wierstorf
Johannes Wagner
Florian Eyben
Felix Burkhardt
Björn Schüller
|
+
|
Will Affective Computing Emerge from Foundation Models and General AI? A First Evaluation on ChatGPT
|
2023
|
Mostafa M. Amin
Erik Cambria
Björn Schüller
|
+
|
hierarchical network with decoupled knowledge distillation for speech emotion recognition
|
2023
|
Ziping Zhao
Huan Wang
Haishuai Wang
Björn Schüller
|
+
|
MER 2023: Multi-label Learning, Modality Robustness, and Semi-Supervised Learning
|
2023
|
Zheng Lian
Haiyang Sun
Licai Sun
Jinming Zhao
Ye Liu
Bin Liu
Jiangyan Yi
Meng Wang
Erik Cambria
Guoying Zhao
|
+
|
The ACM Multimedia 2023 Computational Paralinguistics Challenge: Emotion Share & Requests
|
2023
|
Björn Schüller
Anton Batliner
Shahin Amiriparian
Alexander Barnhill
Maurice Gerczuk
Andreas Triantafyllopoulos
Alice Baird
Panagiotis Tzirakis
Chris Gagne
Alan Cowen
|
+
|
The MuSe 2023 Multimodal Sentiment Analysis Challenge: Mimicked Emotions, Cross-Cultural Humour, and Personalisation
|
2023
|
Lukas Christ
Shahin Amiriparian
Alice Baird
Alexander Kathan
Niklas Müller
Steffen Klug
Chris Gagne
Panagiotis Tzirakis
Eva-Maria Meßner
Andreas König
|
+
|
Executive Voiced Laughter and Social Approval: An Explorative Machine Learning Study
|
2023
|
Niklas Mueller
Steffen Klug
Andreas Koenig
Alexander Kathan
Lukas Christ
Björn Schüller
Shahin Amiriparian
|
+
|
A Preliminary Study on Augmenting Speech Emotion Recognition using a Diffusion Model
|
2023
|
Ibrahim Malik
Siddique Latif
Raja Jurdak
Björn Schüller
|
+
|
U-DiT TTS: U-Diffusion Vision Transformer for Text-to-Speech
|
2023
|
Xin Jing
Yi Chang
Zijiang Yang
Jiangjian Xie
Andreas Triantafyllopoulos
Björn Schüller
|
+
|
Enhancing Speech Emotion Recognition Through Differentiable Architecture Search
|
2023
|
Thejan Rajapakshe
Rajib Rana
Sara Khalifa
Berrak Şişman
Björn Schüller
|
+
|
Happy or Evil Laughter? Analysing a Database of Natural Audio Samples
|
2023
|
Aljoscha Düsterhöft
Felix Burkhardt
Björn Schüller
|
+
|
Integrating Generative Artificial Intelligence in Intelligent Vehicle Systems
|
2023
|
Lukas Stappen
Jeremy Dillmann
Serena Striegel
Hans J. Vogel
Nicolas Flores-Herr
Björn Schüller
|
+
|
Speech-based Age and Gender Prediction with Transformers
|
2023
|
Felix Burkhardt
Johannes Wagner
Hagen Wierstorf
Florian Eyben
Björn Schüller
|
+
|
Going Retro: Astonishingly Simple Yet Effective Rule-based Prosody Modelling for Speech Synthesis Simulating Emotion Dimensions
|
2023
|
Felix Burkhardt
Uwe D. Reichel
Florian Eyben
Björn Schüller
|
+
|
Can ChatGPT's Responses Boost Traditional Natural Language Processing?
|
2023
|
Mostafa M. Amin
Erik Cambria
Björn Schüller
|
+
|
Can Large Language Models Aid in Annotating Speech Emotional Data? Uncovering New Frontiers
|
2023
|
Siddique Latif
Muhammad Usama
Mohammad Ibrahim Malik
Björn Schüller
|
+
|
Refashioning Emotion Recognition Modelling: The Advent of Generalised Large Models
|
2023
|
Zixing Zhang
Liyizhe Peng
Tao Pang
Jing Han
Huan Zhao
Björn Schüller
|
+
|
Identifying depression-related topics in smartphone-collected free-response speech recordings using an automatic speech recognition system and a deep learning topic model
|
2023
|
Yuezhou Zhang
Amos Folarin
Judith Dineley
Pauline Conde
Valeria de Angel
Shaoxiong Sun
Yatharth Ranjan
Zulqarnain Rashid
Callum Stewart
Petroula Laiou
|
+
|
Sparks of Large Audio Models: A Survey and Outlook
|
2023
|
Siddique Latif
Moazzam Shoukat
Fahad Shamshad
Muhammad Usama
Heriberto Cuayáhuitl
Björn Schüller
|
+
|
A Wide Evaluation of ChatGPT on Affective Computing Tasks
|
2023
|
Mostafa M. Amin
Rui Mao
Erik Cambria
Björn Schüller
|
+
|
Exploring Meta Information for Audio-based Zero-shot Bird Classification
|
2023
|
Alexander Gebhard
Andreas Triantafyllopoulos
Teresa Bez
Lukas Christ
Alexander Kathan
Björn Schüller
|
+
|
Task Selection and Assignment for Multi-modal Multi-task Dialogue Act Classification with Non-stationary Multi-armed Bandits
|
2023
|
Xiangheng He
Junjie Chen
Björn Schüller
|
+
|
Synthia's Melody: A Benchmark Framework for Unsupervised Domain Adaptation in Audio
|
2023
|
Chiahsin Lin
Charles Jones
Björn Schüller
Harry Coppock
|
+
|
Bringing the Discussion of Minima Sharpness to the Audio Domain: a Filter-Normalised Evaluation for Acoustic Scene Classification
|
2023
|
Manuel Milling
Andreas Triantafyllopoulos
Iosif Tsangko
Simon Rampp
Björn Schüller
|
+
|
Customising General Large Language Models for Specialised Emotion Recognition Tasks
|
2023
|
Liyizhe Peng
Zixing Zhang
Tao Pang
Jing Han
Huan Zhao
Hao Chen
Björn Schüller
|
+
|
Testing Speech Emotion Recognition Machine Learning Models
|
2023
|
Anna Derington
Hagen Wierstorf
Ali Gürcan Özkil
Florian Eyben
Felix Burkhardt
Björn Schüller
|
+
PDF
Chat
|
Speech Synthesis With Mixed Emotions
|
2022
|
Kun Zhou
Berrak Şişman
Rajib Rana
Björn Schüller
Haizhou Li
|
+
PDF
Chat
|
Audio self-supervised learning: A survey
|
2022
|
Shuo Liu
Adria Mallol-Ragolta
Emilia Parada‐Cabaleiro
Kun Qian
Xin Jing
Alexander Kathan
Bin Hu
Björn Schüller
|
+
PDF
Chat
|
Multitask Learning From Augmented Auxiliary Data for Improving Speech Emotion Recognition
|
2022
|
Siddique Latif
Rajib Rana
Sara Khalifa
Raja Jurdak
Björn Schüller
|
+
PDF
Chat
|
The MuSe 2022 Multimodal Sentiment Analysis Challenge
|
2022
|
Lukas Christ
Shahin Amiriparian
Alice Baird
Panagiotis Tzirakis
Alexander Kathan
Niklas Müller
Lukas Stappen
Eva-Maria Meßner
Andreas König
Alan Cowen
|
+
PDF
Chat
|
A Temporal-oriented Broadcast ResNet for COVID-19 Detection
|
2022
|
Xin Jing
Shuo Liu
Emilia Parada‐Cabaleiro
Andreas Triantafyllopoulos
Meishu Song
Zijiang Yang
Björn Schüller
|
+
PDF
Chat
|
Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on Data-Driven Deep Learning
|
2022
|
Rui Liu
Berrak Şişman
Björn Schüller
Guanglai Gao
Haizhou Li
|
+
PDF
Chat
|
Data Augmentation for Dementia Detection in Spoken Language.
|
2022
|
Dominika Woszczyk
Anna Hedlikova
Alican Akman
Soteris Demetriou
Björn Schüller
|
+
PDF
Chat
|
SVTS: Scalable Video-to-Speech Synthesis
|
2022
|
Rodrigo Mira
Alexandros Haliassos
Stavros Petridis
Björn Schüller
Maja Pantić
|
+
PDF
Chat
|
Example-based Explanations with Adversarial Attacks for Respiratory Sound Analysis
|
2022
|
Yi Chang
Zhao Ren
Thành Tâm Nguyên
Wolfgang Nejdl
Björn Schüller
|
+
PDF
Chat
|
Distinguishing between pre- and post-treatment in the speech of patients with chronic obstructive pulmonary disease
|
2022
|
Andreas Triantafyllopoulos
Markus Fendler
Anton Batliner
Maurice Gerczuk
Shahin Amiriparian
Thomas M. Berghaus
Björn Schüller
|
+
PDF
Chat
|
Probing speech emotion recognition transformers for linguistic knowledge
|
2022
|
Andreas Triantafyllopoulos
Johannes Wagner
Hagen Wierstorf
Maximilian Schmitt
Uwe D. Reichel
Florian Eyben
Felix Burkhardt
Björn Schüller
|
+
|
Introducing the COVID-19 YouTube (COVYT) speech dataset featuring the same speakers with and without infection
|
2022
|
Andreas Triantafyllopoulos
Anastasia Semertzidou
Meishu Song
Florian B. Pokorny
Björn Schüller
|
+
|
Introducing the COVID-19 YouTube (COVYT) speech dataset featuring the same speakers with and without infection
|
2022
|
Andreas Triantafyllopoulos
Anastasia Semertzidou
Meishu Song
Florian B. Pokorny
Björn Schüller
|
+
PDF
Chat
|
Depression Diagnosis and Forecast based on Mobile Phone Sensor Data
|
2022
|
Xiangheng He
Andreas Triantafyllopoulos
Alexander Kathan
Manuel Milling
Tian‐Hao Yan
Srividya Tirunellai Rajamani
Ludwig Küster
Mathias Harrer
Elena Heber
Inga Grossmann
|
+
PDF
Chat
|
Journaling Data for Daily PHQ-2 Depression Prediction and Forecasting
|
2022
|
Alexander Kathan
Andreas Triantafyllopoulos
Xiangheng He
Manuel Milling
Tianhao Yan
Srividya Tirunellai Rajamani
Ludwig Küster
Mathias Harrer
Elena Heber
Inga Grossmann
|
+
PDF
Chat
|
Insights on Modelling Physiological, Appraisal, and Affective Indicators of Stress using Audio Features
|
2022
|
Andreas Triantafyllopoulos
Sandra Zänkert
Alice Baird
Julian Konzok
Brigitte M. Kudielka
Björn Schüller
|
+
PDF
Chat
|
Fatigue Prediction in Outdoor Running Conditions using Audio Data
|
2022
|
Andreas Triantafyllopoulos
Sandra Ottl
Alexander Gebhard
Esther Rituerto-González
Mirko Jaumann
Steffen Huttner
Valerie Dieter
Patrick Schneeweiß
Inga Krauß
Maurice Gerczuk
|
+
PDF
Chat
|
Evaluating the COVID-19 Identification ResNet (CIdeR) on the INTERSPEECH COVID-19 From Audio Challenges
|
2022
|
Alican Akman
Harry Coppock
Alexander Gaskell
Panagiotis Tzirakis
Lyn Jones
Björn Schüller
|
+
PDF
Chat
|
Emotion Intensity and its Control for Emotional Voice Conversion
|
2022
|
Kun Zhou
Berrak Şişman
Rajib Rana
Björn Schüller
Haizhou Li
|
+
PDF
Chat
|
End-to-End Video-to-Speech Synthesis Using Generative Adversarial Networks
|
2022
|
Rodrigo Mira
Konstantinos Vougioukas
Pingchuan Ma
Stavros Petridis
Björn Schüller
Maja Pantić
|
+
PDF
Chat
|
Self Supervised Adversarial Domain Adaptation for Cross-Corpus and Cross-Language Speech Emotion Recognition
|
2022
|
Siddique Latif
Rajib Rana
Sara Khalifa
Raja Jurdak
Björn Schüller
|
+
|
Fairness and underspecification in acoustic scene classification: The case for disaggregated evaluations
|
2022
|
Andreas Triantafyllopoulos
Manuel Milling
Konstantinos Drossos
Björn Schüller
|
+
PDF
Chat
|
A Novel Policy for Pre-trained Deep Reinforcement Learning for Speech Emotion Recognition
|
2022
|
Thejan Rajapakshe
Rajib Rana
Sara Khalifa
Jiajun Liu
Björn Schüller
|
+
PDF
Chat
|
MEDAS: an open-source platform as a service to help break the walls between medicine and informatics
|
2022
|
Liang Zhang
Johann Li
Ping Li
Xiaoyuan Lu
Maoguo Gong
Peiyi Shen
Guangming Zhu
Syed Afaq Ali Shah
Mohammed Bennamoun
Kun Qian
|
+
|
Continuous-Time Audiovisual Fusion with Recurrence vs. Attention for In-The-Wild Affect Recognition
|
2022
|
Vincent Karas
Mani Kumar Tellamekala
Adria Mallol-Ragolta
Michel Valstar
Björn Schüller
|
+
|
Audiovisual Affect Assessment and Autonomous Automobiles: Applications
|
2022
|
Björn Schüller
Dagmar Schuller
|
+
|
Climate Change & Computer Audition: A Call to Action and Overview on Audio Intelligence to Help Save the Planet
|
2022
|
Björn Schüller
Alican Akman
Yi Chang
Harry Coppock
Alexander Gebhard
Alexander Kathan
Esther Rituerto-González
Andreas Triantafyllopoulos
Florian B. Pokorny
|
+
|
Example-based Explanations with Adversarial Attacks for Respiratory Sound Analysis
|
2022
|
Yi Chang
Zhao Ren
Thanh Tam Nguyen
Wolfgang Nejdl
Björn Schüller
|
+
|
HEAR: Holistic Evaluation of Audio Representations
|
2022
|
Joseph Turian
Jordie Shier
Humair Raj Khan
Bhiksha Raj
Björn Schüller
Christian J. Steinmetz
Colin Malloy
George Tzanetakis
Gissel Velarde
Kirk McNally
|
+
|
An Overview & Analysis of Sequence-to-Sequence Emotional Voice Conversion
|
2022
|
Zijiang Yang
Xin Jing
Andreas Triantafyllopoulos
Meishu Song
Ilhan Aslan
Björn Schüller
|
+
|
Self Supervised Adversarial Domain Adaptation for Cross-Corpus and Cross-Language Speech Emotion Recognition
|
2022
|
Siddique Latif
Rajib Rana
Sara Khalifa
Raja Jurdak
Björn Schüller
|
+
|
Normalise for Fairness: A Simple Normalisation Technique for Fairness in Regression Machine Learning Problems
|
2022
|
Mostafa M. Mohamed
Björn Schüller
|
+
|
Audio Self-supervised Learning: A Survey
|
2022
|
Shuo Liu
Adria Mallol-Ragolta
Emilia Parada-Cabeleiro
Kun Qian
Xin Jing
Alexander Kathan
Bin Hu
Björn Schüller
|
+
|
A Temporal-oriented Broadcast ResNet for COVID-19 Detection
|
2022
|
Xin Jing
Shuo Liu
Emilia Parada‐Cabaleiro
Andreas Triantafyllopoulos
Meishu Song
Zijiang Yang
Björn Schüller
|
+
|
Robust Federated Learning Against Adversarial Attacks for Speech Emotion Recognition
|
2022
|
Yi Chang
Sofiane Laridi
Zhao Ren
Gregory M. Palmer
Björn Schüller
Marco Fisichella
|
+
|
Evaluating Deep Music Generation Methods Using Data Augmentation
|
2022
|
Toby Godwin
Georgios Rizos
Alice Baird
Najla D. Al Futaisi
Vincent Brisse
Björn Schüller
|
+
|
A Summary of the ComParE COVID-19 Challenges
|
2022
|
Harry Coppock
Alican Akman
Christian Bergler
Maurice Gerczuk
Chloë Brown
Jagmohan Chauhan
Andreas Grammenos
Apinan Hasthanasombat
Dimitris Spathis
Xia Tong
|
+
|
Probing Speech Emotion Recognition Transformers for Linguistic Knowledge
|
2022
|
Andreas Triantafyllopoulos
Johannes Wagner
Hagen Wierstorf
Maximilian Schmitt
Uwe D. Reichel
Florian Eyben
Felix Burkhardt
Björn Schüller
|
+
|
Dawn of the transformer era in speech emotion recognition: closing the valence gap
|
2022
|
Johannes Wagner
Andreas Triantafyllopoulos
Hagen Wierstorf
Maximilian Schmitt
Felix Burkhardt
Florian Eyben
Björn Schüller
|
+
|
Predicting Sex and Stroke Success -- Computer-aided Player Grunt Analysis in Tennis Matches
|
2022
|
Lukas Stappen
Manuel Milling
Valentin Munst
Korakot Hoffmann
Björn Schüller
|
+
|
SVTS: Scalable Video-to-Speech Synthesis
|
2022
|
Rodrigo Mira
Alexandros Haliassos
Stavros Petridis
Björn Schüller
Maja Pantić
|
+
|
The ICML 2022 Expressive Vocalizations Workshop and Competition: Recognizing, Generating, and Personalizing Vocal Bursts
|
2022
|
Alice Baird
Panagiotis Tzirakis
Gauthier Gidel
Marco Jiralerspong
Eilif Müller
Kory W. Mathewson
Björn Schüller
Erik Cambria
Dacher Keltner
Alan Cowen
|
+
|
Journaling Data for Daily PHQ-2 Depression Prediction and Forecasting
|
2022
|
Alexander Kathan
Andreas Triantafyllopoulos
Xiangheng He
Manuel Milling
Tian‐Hao Yan
Srividya Tirunellai Rajamani
Ludwig Küster
Mathias Harrer
Elena Heber
Inga Großmann
|
+
|
The ACM Multimedia 2022 Computational Paralinguistics Challenge: Vocalisations, Stuttering, Activity, & Mosquitoes
|
2022
|
Björn Schüller
Anton Batliner
Shahin Amiriparian
Christian Bergler
Maurice Gerczuk
Natalie Holz
Pauline Larrouy-Maestri
Sebastian P. Bayerl
Korbinian Riedhammer
Adria Mallol-Ragolta
|
+
|
COLD Fusion: Calibrated and Ordinal Latent Distribution Fusion for Uncertainty-Aware Multimodal Emotion Recognition
|
2022
|
Mani Kumar Tellamekala
Shahin Amiriparian
Björn Schüller
Elisabeth André
Timo Giesbrecht
Michel Valstar
|
+
|
Exploring speaker enrolment for few-shot personalisation in emotional vocalisation prediction
|
2022
|
Andreas Triantafyllopoulos
Meishu Song
Zijiang Yang
Xin Jing
Björn Schüller
|
+
|
Redundancy Reduction Twins Network: A Training framework for Multi-output Emotion Regression
|
2022
|
Xin Jing
Meishu Song
Andreas Triantafyllopoulos
Zijiang Yang
Björn Schüller
|
+
|
Dynamic Restrained Uncertainty Weighting Loss for Multitask Learning of Vocal Expression
|
2022
|
Meishu Song
Zijiang Yang
Andreas Triantafyllopoulos
Xin Jing
Vincent Karas
Jiangjian Xie
Zixing Zhang
Yoshiharu Yamamoto
Björn Schüller
|
+
|
COVYT: Introducing the Coronavirus YouTube and TikTok speech dataset featuring the same speakers with and without infection
|
2022
|
Andreas Triantafyllopoulos
Anastasia Semertzidou
Meishu Song
Florian B. Pokorny
Björn Schüller
|
+
|
Data Augmentation for Dementia Detection in Spoken Language
|
2022
|
Anna Hlédiková
Dominika Woszczyk
Alican Acman
Soteris Demetriou
Björn Schüller
|
+
|
Are 3D Face Shapes Expressive Enough for Recognising Continuous Emotions and Action Unit Intensities?
|
2022
|
Mani Kumar Tellamekala
Ömer Sümer
Björn Schüller
Elisabeth André
Timo Giesbrecht
Michel Valstar
|
+
|
The ACII 2022 Affective Vocal Bursts Workshop & Competition: Understanding a critically understudied modality of emotional expression
|
2022
|
Alice Baird
Panagiotis Tzirakis
Jeffrey A. Brooks
Christopher B. Gregory
Björn Schüller
Anton Batliner
Dacher Keltner
Alan Cowen
|
+
|
Multitask Learning from Augmented Auxiliary Data for Improving Speech Emotion Recognition
|
2022
|
Siddique Latif
Rajib Rana
Sara Khalifa
Raja Jurdak
Björn Schüller
|
+
|
The MuSe 2022 Multimodal Sentiment Analysis Challenge: Humor, Emotional Reactions, and Stress
|
2022
|
Lukas Christ
Shahin Amiriparian
Alice Baird
Panagiotis Tzirakis
Alexander Kathan
Niklas Müller
Lukas Stappen
Eva-Maria Meßner
Andreas König
Alan Cowen
|
+
|
Proceedings of the ICML 2022 Expressive Vocalizations Workshop and Competition: Recognizing, Generating, and Personalizing Vocal Bursts
|
2022
|
Alice Baird
Panagiotis Tzirakis
Gauthier Gidel
Marco Jiralerspong
Eilif Müller
Kory W. Mathewson
Björn Schüller
Erik Cambria
Dacher Keltner
Alan Cowen
|
+
|
Distinguishing between pre- and post-treatment in the speech of patients with chronic obstructive pulmonary disease
|
2022
|
Andreas Triantafyllopoulos
Markus Fendler
Anton Batliner
Maurice Gerczuk
Shahin Amiriparian
Thomas M. Berghaus
Björn Schüller
|
+
|
Self-Supervised Attention Networks and Uncertainty Loss Weighting for Multi-Task Emotion Recognition on Vocal Bursts
|
2022
|
Vincent Karas
Andreas Triantafyllopoulos
Meishu Song
Björn Schüller
|
+
|
Audio Barlow Twins: Self-Supervised Audio Representation Learning
|
2022
|
Jonah Anton
Harry Coppock
Pancham Shukla
Björn Schüller
|
+
|
An Overview of Affective Speech Synthesis and Conversion in the Deep Learning Era
|
2022
|
Andreas Triantafyllopoulos
Björn Schüller
Gökçe İymen
Tevfik Metin Sezgin
Xiangheng He
Zijiang Yang
Panagiotis Tzirakis
Shuo Liu
Silvan Mertes
Elisabeth André
|
+
|
Propagating Variational Model Uncertainty for Bioacoustic Call Label Smoothing
|
2022
|
Georgios Rizos
Jenna Lawson
Sımon F. Mıtchell
Pranay Shah
Xin Wen
Cristina Banks‐Leite
Robert M. Ewers
Björn Schüller
|
+
|
Fast Yet Effective Speech Emotion Recognition with Self-distillation
|
2022
|
Zhao Ren
Thành Tâm Nguyên
Yi Chang
Björn Schüller
|
+
|
Knowledge Transfer For On-Device Speech Emotion Recognition with Neural Structured Learning
|
2022
|
Yi Chang
Zhao Ren
Thành Tâm Nguyên
Kun Qian
Björn Schüller
|
+
|
Depression Diagnosis and Forecast based on Mobile Phone Sensor Data
|
2022
|
Xiangheng He
Andreas Triantafyllopoulos
Alexander Kathan
Manuel Milling
Tian‐Hao Yan
Srividya Tirunellai Rajamani
Ludwig Küster
Mathias Harrer
Elena Heber
Inga Großmann
|
+
|
Proceedings of the ACII Affective Vocal Bursts Workshop and Competition 2022 (A-VB): Understanding a critically understudied modality of emotional expression
|
2022
|
Alice Baird
Panagiotis Tzirakis
Jeffrey A. Brooks
Christopher B. Gregory
Björn Schüller
Anton Batliner
Dacher Keltner
Alan Cowen
|
+
|
AI-Based Emotion Recognition: Promise, Peril, and Prescriptions for Prosocial Path
|
2022
|
Siddique Latif
Hafiz Shehbaz Ali
Muhammad Usama
Rajib Rana
Björn Schüller
Junaid Qadir
|
+
|
Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on Data-Driven Deep Learning
|
2022
|
Rui Liu
Berrak Şişman
Björn Schüller
Guanglai Gao
Haizhou Li
|
+
|
A large-scale and PCR-referenced vocal audio dataset for COVID-19
|
2022
|
Jobie Budd
Kieran Baker
Emma Karoune
Harry Coppock
Selina Patel
Ana Tendero Cañadas
Alexander Titcomb
Richard Payne
David J. Hurley
Sabrina Egglestone
|
+
|
Statistical Design and Analysis for Robust Machine Learning: A Case Study from COVID-19
|
2022
|
Davide Pigoli
Kieran Baker
Jobie Budd
Lorraine Butler
Harry Coppock
Sabrina Egglestone
Steven G. Gilmour
Chris Holmes
David J. Hurley
Radka Jersakova
|
+
|
Audio-based AI classifiers show no evidence of improved COVID-19 screening over simple symptoms checkers
|
2022
|
Harry Coppock
George Nicholson
Ivan Kiskin
Vasiliki Koutra
Kieran Baker
Jobie Budd
Richard Payne
Emma Karoune
David J. Hurley
Alexander Titcomb
|
+
|
Automatic Emotion Modelling in Written Stories
|
2022
|
Lukas Christ
Shahin Amiriparian
Manuel Milling
Ilhan Aslan
Björn Schüller
|
+
|
Fatigue Prediction in Outdoor Running Conditions using Audio Data
|
2022
|
Andreas Triantafyllopoulos
Sandra Ottl
Alexander Gebhard
Esther Rituerto-González
Mirko Jaumann
Steffen Hüttner
Valerie Dieter
Patrick Schneeweiß
Inga Krauß
Maurice Gerczuk
|
+
|
Insights on Modelling Physiological, Appraisal, and Affective Indicators of Stress using Audio Features
|
2022
|
Andreas Triantafyllopoulos
Sandra Zänkert
Alice Baird
Julian Konzok
Brigitte M. Kudielka
Björn Schüller
|
+
|
Towards Multimodal Prediction of Spontaneous Humour: A Novel Dataset and First Results
|
2022
|
Lukas Christ
Shahin Amiriparian
Alexander Kathan
Niklas Müller
Andreas König
Björn Schüller
|
+
PDF
Chat
|
EmoNet: A Transfer Learning Framework for Multi-Corpus Speech Emotion Recognition
|
2021
|
Maurice Gerczuk
Shahin Amiriparian
Sandra Ottl
Björn Schüller
|
+
PDF
Chat
|
Bias and privacy in AI's cough-based COVID-19 recognition – Authors' reply
|
2021
|
Harry Coppock
Lyn Jones
Ivan Kiskin
Björn Schüller
|
+
|
Facial Emotion Recognition using Deep Residual Networks in Real-World Environments.
|
2021
|
Panagiotis Tzirakis
Dénes Boros
Elnar Hajiyev
Björn Schüller
|
+
PDF
Chat
|
GraphTMT: Unsupervised Graph-based Topic Modeling from Video Transcripts
|
2021
|
Jason Thies
Lukas Stappen
Gerhard Hagerer
Björn Schüller
Georg Groh
|
+
|
Fitbeat: COVID-19 estimation based on wristband heart rate using a contrastive convolutional auto-encoder
|
2021
|
Shuo Liu
Jing Han
Estela Laporta
Spyridon Kontaxis
Shaoxiong Sun
Patrick Locatelli
Judith Dineley
Florian B. Pokorny
Gloria Dalla Costa
Letizia Leocani
|
+
PDF
Chat
|
A Physiologically-Adapted Gold Standard for Arousal during Stress
|
2021
|
Alice Baird
Lukas Stappen
Lukas Christ
Lea Schumann
Eva-Maria Meßner
Björn Schüller
|
+
PDF
Chat
|
Evaluating Deep Music Generation Methods Using Data Augmentation
|
2021
|
Toby Godwin
Georgios Rizos
Alice Baird
Najla D. Al Futaisi
Vincent Brisse
Björn Schüller
|
+
PDF
Chat
|
Fairness and underspecification in acoustic scene classification: The case for disaggregated evaluations
|
2021
|
Andreas Triantafyllopoulos
Manuel Milling
Konstantinos Drossos
Björn Schüller
|
+
|
A Machine Learning Framework for Automatic Prediction of Human Semen Motility.
|
2021
|
Sandra Ottl
Shahin Amiriparian
Maurice Gerczuk
Björn Schüller
|
+
PDF
Chat
|
Remote Smartphone-Based Speech Collection: Acceptance and Barriers in Individuals with Major Depressive Disorder
|
2021
|
Judith Dineley
Grace Lavelle
Daniel Leightley
Faith Matcham
Sara Siddi
Maria Teresa Peñarrubia‐María
Katie M White
Alina Ivan
Carolin Oetzmann
Sara Simblett
|
+
PDF
Chat
|
An Improved StarGAN for Emotional Voice Conversion: Enhancing Voice Quality and Data Augmentation
|
2021
|
Xiangheng He
Junjie Chen
Georgios Rizos
Björn Schüller
|
+
PDF
Chat
|
LiRA: Learning Visual Speech Representations from Audio Through Self-Supervision
|
2021
|
Pingchuan Ma
Rodrigo Mira
Stavros Petridis
Björn Schüller
Maja Pantić
|
+
PDF
Chat
|
The Multimodal Sentiment Analysis in Car Reviews (MuSe-CaR) Dataset: Collection, Insights and Improvements
|
2021
|
Lukas Stappen
Alice Baird
Lea Schumann
Björn Schüller
|
+
|
A Physiologically-adapted Gold Standard for Arousal During a Stress Induced Scenario
|
2021
|
Alice Baird
Lukas Stappen
Lukas Christ
Lea Schumann
Eva-Maria Meßner
Björn Schüller
|
+
|
A Physiologically-Adapted Gold Standard for Arousal during Stress
|
2021
|
Alice Baird
Lukas Stappen
Lukas Christ
Lea Schumann
Eva-Maria Meßner
Björn Schüller
|
+
|
MuSe-Toolbox: The Multimodal Sentiment Analysis Continuous Annotation Fusion and Discrete Class Transformation Toolbox
|
2021
|
Lukas Stappen
Lea Schumann
Benjamin Sertolli
Alice Baird
Benjamin Weigel
Erik Cambria
Björn Schüller
|
+
PDF
Chat
|
An Improved StarGAN for Emotional Voice Conversion: Enhancing Voice
Quality and Data Augmentation
|
2021
|
Xiangheng He
Junjie Chen
Georgios Rizos
Björn Schüller
|
+
PDF
Chat
|
Affective Image Content Analysis: Two Decades Review and New Perspectives
|
2021
|
Sicheng Zhao
Xingxu Yao
Jufeng Yang
Guoli Jia
Guiguang Ding
Tat‐Seng Chua
Björn Schüller
Kurt Keutzer
|
+
PDF
Chat
|
LiRA: Learning Visual Speech Representations from Audio through
Self-supervision
|
2021
|
Pingchuan Ma
Rodrigo Mira
Stavros Petridis
Björn Schüller
Maja Pantić
|
+
PDF
Chat
|
Prediction on Mechanical Properties of Non-Equiatomic High-Entropy Alloy by Atomistic Simulation and Machine Learning
|
2021
|
Liang Zhang
Kun Qian
Björn Schüller
Yasushi Shibuta
|
+
|
The voice of COVID-19: Acoustic correlates of infection in sustained vowels
|
2021
|
Katrin D. Bartl-Pokorny
Florian B. Pokorny
Anton Batliner
Shahin Amiriparian
Anastasia Semertzidou
Florian Eyben
Elena Kramer
Florian Schmidt
R. Schönweiler
Markus Wehler
|
+
|
Detecting COVID-19 from Breathing and Coughing Sounds using Deep Neural Networks
|
2021
|
Mina A. Nessiem
Mostafa M. Mohamed
Harry Coppock
Alexander Gaskell
Björn Schüller
|
+
PDF
Chat
|
Speech Emotion Recognition Using Semantic Information
|
2021
|
Panagiotis Tzirakis
Anh Gia-Tuan Nguyen
Stefanos Zafeiriou
Björn Schüller
|
+
|
An Estimation of Online Video User Engagement from Features of Continuous Emotions.
|
2021
|
Lukas Stappen
Alice Baird
Michelle Lienhart
Annalena Bätz
Björn Schüller
|
+
|
Unsupervised Graph-based Topic Modeling from Video Transcriptions.
|
2021
|
Lukas Stappen
Gerhard Hagerer
Björn Schüller
Georg Groh
|
+
PDF
Chat
|
Learning audio sequence representations for acoustic event classification
|
2021
|
Zixing Zhang
Ding Liu
Jing Han
Kun Qian
Björn Schüller
|
+
|
The MuSe 2021 Multimodal Sentiment Analysis Challenge: Sentiment, Emotion, Physiological-Emotion, and Stress
|
2021
|
Lukas Stappen
Alice Baird
Lukas Christ
Lea Schumann
Benjamin Sertolli
Eva-Maria Meßner
Erik Cambria
Guoying Zhao
Björn Schüller
|
+
PDF
Chat
|
Recent Advances in Computer Audition for Diagnosing COVID-19: An Overview
|
2021
|
Kun Qian
Björn Schüller
Yoshiharu Yamamoto
|
+
|
Speech Emotion Recognition using Semantic Information
|
2021
|
Panagiotis Tzirakis
Anh Gia-Tuan Nguyen
Stefanos Zafeiriou
Björn Schüller
|
+
|
The INTERSPEECH 2021 Computational Paralinguistics Challenge: COVID-19 Cough, COVID-19 Speech, Escalation & Primates
|
2021
|
Björn Schüller
Anton Batliner
Christian Bergler
Cecilia Mascolo
Jing Han
Iulia Lefter
Heysem Kaya
Shahin Amiriparian
Alice Baird
Lukas Stappen
|
+
|
The Multimodal Sentiment Analysis in Car Reviews (MuSe-CaR) Dataset: Collection, Insights and Improvements
|
2021
|
Lukas Stappen
Alice Baird
Lea Schumann
Björn Schüller
|
+
|
End-2-End COVID-19 Detection from Breath & Cough Audio
|
2021
|
Harry Coppock
Alexander Gaskell
Panagiotis Tzirakis
Alice Baird
Lyn Jones
Björn Schüller
|
+
|
Personalized Federated Deep Learning for Pain Estimation From Face Images
|
2021
|
Ognjen Rudovic
Nicolas Tobis
Sebastian Kaltwang
Björn Schüller
Daniel Rueckert
Jeffrey F. Cohn
Rosalind W. Picard
|
+
|
Deep Attention-based Representation Learning for Heart Sound Classification
|
2021
|
Zhao Ren
Kun Qian
Fengquan Dong
Zhenyu Dai
Yoshiharu Yamamoto
Björn Schüller
|
+
|
An Enhanced Adversarial Network with Combined Latent Features for Spatio-temporal Facial Affect Estimation in the Wild
|
2021
|
Decky Aspandi
Federico M. Sukno
Björn Schüller
Xavier Binefa
|
+
|
Computational Emotion Analysis From Images: Recent Advances and Future Directions
|
2021
|
Sicheng Zhao
Quanwei Huang
Youbao Tang
Xingxu Yao
Jufeng Yang
Guiguang Ding
Björn Schüller
|
+
|
Fitbeat: COVID-19 Estimation based on Wristband Heart Rate
|
2021
|
Shuo Liu
Jing Han
Estela Laporta
Spyridon Kontaxis
Shaoxiong Sun
Patrick Locatelli
Judith Dineley
Florian B. Pokorny
Gloria Dalla Costa
Letizia Leocani
|
+
|
On the Impact of Word Error Rate on Acoustic-Linguistic Speech Emotion Recognition: An Update for the Deep Learning Era
|
2021
|
Shahin Amiriparian
Artem Sokolov
Ilhan Aslan
Lukas Christ
Maurice Gerczuk
Tobias Hübner
Dmitry Lamanov
Manuel Milling
Sandra Ottl
Ilya Poduremennykh
|
+
|
DeepSpectrumLite: A Power-Efficient Transfer Learning Framework for Embedded Speech and Audio Processing from Decentralised Data
|
2021
|
Shahin Amiriparian
Tobias Hübner
Maurice Gerczuk
Sandra Ottl
Björn Schüller
|
+
PDF
Chat
|
Poisson CNN: Convolutional neural networks for the solution of the Poisson equation on a Cartesian mesh
|
2021
|
Ali Girayhan Özbay
Arash Hamzehloo
Sylvain Laizet
Panagiotis Tzirakis
Georgios Rizos
Björn Schüller
|
+
|
LiRA: Learning Visual Speech Representations from Audio through Self-supervision
|
2021
|
Pingchuan Ma
Rodrigo Mira
Stavros Petridis
Björn Schüller
Maja Pantić
|
+
|
Affective Image Content Analysis: Two Decades Review and New Perspectives
|
2021
|
Sicheng Zhao
Xingxu Yao
Jufeng Yang
Guoli Jia
Guiguang Ding
Tat‐Seng Chua
Björn Schüller
Kurt Keutzer
|
+
|
An Improved StarGAN for Emotional Voice Conversion: Enhancing Voice Quality and Data Augmentation
|
2021
|
Xiangheng He
Junjie Chen
Georgios Rizos
Björn Schüller
|
+
|
The EIHW-GLAM Deep Attentive Multi-model Fusion System for Cough-based COVID-19 Recognition in the DiCOVA 2021 Challenge
|
2021
|
Zhao Ren
Yi Chang
Björn Schüller
|
+
|
EIHW-MTG DiCOVA 2021 Challenge System Report
|
2021
|
Adria Mallol-Ragolta
Helena Cuesta
Emília Gómez
Björn Schüller
|
+
|
EIHW-MTG: Second DiCOVA Challenge System Report
|
2021
|
Adria Mallol-Ragolta
Helena Cuesta
Emília Gómez
Björn Schüller
|
+
|
Facial Emotion Recognition using Deep Residual Networks in Real-World Environments
|
2021
|
Panagiotis Tzirakis
Dénes Boros
Elnar Hajiyev
Björn Schüller
|
+
|
Multistage linguistic conditioning of convolutional layers for speech emotion recognition
|
2021
|
Andreas Triantafyllopoulos
Uwe D. Reichel
Shuo Liu
Stephan M. Huber
Florian Eyben
Björn Schüller
|
+
|
Evaluating the COVID-19 Identification ResNet (CIdeR) on the INTERSPEECH COVID-19 from Audio Challenges
|
2021
|
Alican Akman
Harry Coppock
Alexander Gaskell
Panagiotis Tzirakis
Lyn H. Jones
Björn Schüller
|
+
|
A Physiologically-Adapted Gold Standard for Arousal during Stress
|
2021
|
Alice Baird
Lukas Stappen
Lukas Christ
Lea Schumann
Eva-Maria Meßner
Björn Schüller
|
+
|
MuSe-Toolbox: The Multimodal Sentiment Analysis Continuous Annotation Fusion and Discrete Class Transformation Toolbox
|
2021
|
Lukas Stappen
Lea Schumann
Benjamin Sertolli
Alice Baird
Benjamin Weigel
Erik Cambria
Björn Schüller
|
+
|
An Estimation of Online Video User Engagement from Features of Continuous Emotions
|
2021
|
Lukas Stappen
Alice Baird
Michelle Lienhart
Annalena Bätz
Björn Schüller
|
+
|
GraphTMT: Unsupervised Graph-based Topic Modeling from Video Transcripts
|
2021
|
Lukas Stappen
Jason Thies
Gerhard Hagerer
Björn Schüller
Georg Groh
|
+
|
The MuSe 2021 Multimodal Sentiment Analysis Challenge: Sentiment, Emotion, Physiological-Emotion, and Stress
|
2021
|
Lukas Stappen
Alice Baird
Lukas Christ
Lea Schumann
Benjamin Sertolli
Eva-Maria Meßner
Erik Cambria
Guoying Zhao
Björn Schüller
|
+
|
Speech Emotion Recognition using Semantic Information
|
2021
|
Panagiotis Tzirakis
Anh Nguyen
Stefanos Zafeiriou
Björn Schüller
|
+
|
The Multimodal Sentiment Analysis in Car Reviews (MuSe-CaR) Dataset: Collection, Insights and Improvements
|
2021
|
Lukas Stappen
Alice Baird
Lea Schumann
Björn Schüller
|
+
|
End-2-End COVID-19 Detection from Breath & Cough Audio
|
2021
|
Harry Coppock
Alexander Gaskell
Panagiotis Tzirakis
Alice Baird
Lyn H. Jones
Björn Schüller
|
+
|
Fairness and underspecification in acoustic scene classification: The case for disaggregated evaluations
|
2021
|
Andreas Triantafyllopoulos
Manuel Milling
Konstantinos Drossos
Björn Schüller
|
+
|
EmoNet: A Transfer Learning Framework for Multi-Corpus Speech Emotion Recognition
|
2021
|
Maurice Gerczuk
Shahin Amiriparian
Sandra Ottl
Björn Schüller
|
+
|
A Machine Learning Framework for Automatic Prediction of Human Semen Motility
|
2021
|
Sandra Ottl
Shahin Amiriparian
Maurice Gerczuk
Björn Schüller
|
+
|
The INTERSPEECH 2021 Computational Paralinguistics Challenge: COVID-19 Cough, COVID-19 Speech, Escalation & Primates
|
2021
|
Björn Schüller
Anton Batliner
Christian Bergler
Cecilia Mascolo
Jing Han
Iulia Lefter
Heysem Kaya
Shahin Amiriparian
Alice Baird
Lukas Stappen
|
+
|
Detecting COVID-19 from Breathing and Coughing Sounds using Deep Neural Networks.
|
2020
|
Björn Schüller
Harry Coppock
Alexander Gaskell
|
+
|
Audio, Speech, Language, & Signal Processing for COVID-19: A Comprehensive Overview.
|
2020
|
Gauri Deshpande
Björn Schüller
|
+
PDF
Chat
|
CAA-Net: Conditional Atrous CNNs With Attention for Explainable Device-Robust Acoustic Scene Classification
|
2020
|
Zhao Ren
Qiuqiang Kong
Jing Han
Mark D. Plumbley
Björn Schüller
|
+
PDF
Chat
|
Synthesising 3D Facial Motion from “In-the-Wild” Speech
|
2020
|
Panagiotis Tzirakis
Athanasios Papaioannou
Alexandros Lattas
Michail Tarasiou
Björn Schüller
Stefanos Zafeiriou
|
+
PDF
Chat
|
Augmenting Generative Adversarial Networks for Speech Emotion Recognition
|
2020
|
Siddique Latif
Muhammad Asim
Rajib Rana
Sara Khalifa
Raja Jurdak
Björn Schüller
|
+
|
An Early Study on Intelligent Analysis of Speech Under COVID-19: Severity, Sleep Quality, Fatigue, and Anxiety
|
2020
|
Jing Han
Kun Qian
Meishu Song
Zijiang Yang
Zhao Ren
Shuo Liu
Juan Liu
Huaiyuan Zheng
Wei Ji
Tomoya Koike
|
+
PDF
Chat
|
Deep Architecture Enhancing Robustness to Noise, Adversarial Attacks, and Cross-Corpus Setting for Speech Emotion Recognition
|
2020
|
Siddique Latif
Rajib Rana
Sara Khalifa
Raja Jurdak
Björn Schüller
|
+
|
Go-CaRD - Generic, Optical Car Part Recognition and Detection: Collection, Insights, and Applications.
|
2020
|
Lukas Stappen
Xinchen Du
Vincent Karas
Stefan Müller
Björn Schüller
|
+
|
High-Fidelity Audio Generation and Representation Learning with Guided Adversarial Autoencoder
|
2020
|
Kazi Nazmul Haque
Rajib Rana
Björn Schüller
|
+
|
Augmenting Generative Adversarial Networks for Speech Emotion Recognition
|
2020
|
Siddique Latif
Muhammad Asim
Rajib Rana
Sara Khalifa
Raja Jurdak
Björn Schüller
|
+
|
Deep Architecture Enhancing Robustness to Noise, Adversarial Attacks, and Cross-corpus Setting for Speech Emotion Recognition
|
2020
|
Siddique Latif
Rajib Rana
Sara Khalifa
Raja Jurdak
Björn Schüller
|
+
|
An Overview on Audio, Signal, Speech, & Language Processing for COVID-19
|
2020
|
Gauri Deshpande
Björn Schüller
|
+
|
A Novel Fusion of Attention and Sequence to Sequence Autoencoders to Predict Sleepiness From Speech.
|
2020
|
Shahin Amiriparian
Pawel Winokurow
Vincent Karas
Sandra Ottl
Maurice Gerczuk
Björn Schüller
|
+
|
ConcealNet: An End-to-end Neural Network for Packet Loss Concealment in Deep Speech Emotion Recognition.
|
2020
|
Mostafa M. Mohamed
Björn Schüller
|
+
PDF
Chat
|
Multi-Task Semi-Supervised Adversarial Autoencoding for Speech Emotion Recognition
|
2020
|
Siddique Latif
Rajib Rana
Sara Khalifa
Raja Jurdak
Julien Epps
Björn Schüller
|
+
|
Adversarial-based neural networks for affect estimations in the wild
|
2020
|
Decky Aspandi
Adria Mallol-Ragolta
Björn Schüller
Xavier Binefa
|
+
|
Deep Representation Learning in Speech Processing: Challenges, Recent Advances, and Future Trends
|
2020
|
Siddique Latif
Rajib Rana
Sara Khalifa
Raja Jurdak
Junaid Qadir
Björn Schüller
|
+
|
Guided Generative Adversarial Neural Network for Representation Learning and High Fidelity Audio Generation using Fewer Labelled Audio Data
|
2020
|
Kazi Nazmul Haque
Rajib Rana
John H. L. Hansen
Björn Schüller
|
+
|
COVID-19 and Computer Audition: An Overview on What Speech & Sound Analysis Could Contribute in the SARS-CoV-2 Corona Crisis
|
2020
|
Björn Schüller
Dagmar Schuller
Kun Qian
Juan Liu
Huaiyuan Zheng
Xiao Li
|
+
|
Prediction of mechanical properties of non-equiatomic high-entropy alloy by atomistic simulation and machine learning
|
2020
|
Liang Zhang
Kun Qian
Björn Schüller
Cheng Lü
Yasushi Shibuta
Xiaoxu Huang
|
+
|
An Early Study on Intelligent Analysis of Speech under COVID-19: Severity, Sleep Quality, Fatigue, and Anxiety
|
2020
|
Jing Han
Kun Qian
Meishu Song
Zijiang Yang
Zhao Ren
Shuo Liu
Juan Liu
Huaiyuan Zheng
Wei Ji
Tomoya Koike
|
+
|
MuSe 2020 -- The First International Multimodal Sentiment Analysis in Real-life Media Challenge and Workshop
|
2020
|
Lukas Stappen
Alice Baird
Georgios Rizos
Panagiotis Tzirakis
Xinchen Du
Felix Hafner
Lea Schumann
Adria Mallol-Ragolta
Björn Schüller
Iulia Lefter
|
+
|
Cross-lingual Zero- and Few-shot Hate Speech Detection Utilising Frozen Transformer Language Models and AXEL
|
2020
|
Lukas Stappen
Fabian Brunn
Björn Schüller
|
+
|
deepSELF: An Open Source Deep Self End-to-End Learning Framework
|
2020
|
Tomoya Koike
Kun Qian
Björn Schüller
Yoshiharu Yamamoto
|
+
|
On Deep Speech Packet Loss Concealment: A Mini-Survey
|
2020
|
Mostafa M. Mohamed
Mina A. Nessiem
Björn Schüller
|
+
|
"I have vxxx bxx connexxxn!": Facing Packet Loss in Deep Speech Emotion Recognition
|
2020
|
Mostafa M. Mohamed
Björn Schüller
|
+
|
Deep Reinforcement Learning with Pre-training for Time-efficient Training of Automatic Speech Recognition
|
2020
|
Thejan Rajapakshe
Siddique Latif
Rajib Rana
Sara Khalifa
Björn Schüller
|
+
|
MeDaS: An open-source platform as service to help break the walls between medicine and informatics
|
2020
|
Liang Zhang
Johann Li
Ping Li
Xiaoyuan Lu
Peiyi Shen
Guangming Zhu
Syed Afaq Ali Shah
Mohammed Bennamoun
Kun Qian
Björn Schüller
|
+
|
Capturing dynamics of post-earnings-announcement drift using genetic algorithm-optimised supervised learning
|
2020
|
Zhengxin Joseph Ye
Björn Schüller
|
+
PDF
Chat
|
High-Fidelity Audio Generation and Representation Learning With Guided Adversarial Autoencoder
|
2020
|
Kazi Nazmul Haque
Rajib Rana
Björn Schüller
|
+
|
Recent Advances in Computer Audition for Diagnosing COVID-19: An Overview
|
2020
|
Kun Qian
Björn Schüller
Yoshiharu Yamamoto
|
+
|
Domain Adaptation with Joint Learning for Generic, Optical Car Part Recognition and Detection Systems (Go-CaRD)
|
2020
|
Lukas Stappen
Xinchen Du
Vincent Karas
Stefan Müller
Björn Schüller
|
+
|
Detecting COVID-19 from Breathing and Coughing Sounds using Deep Neural Networks
|
2020
|
Björn Schüller
Harry Coppock
Alexander Gaskell
|
+
|
Audio, Speech, Language, & Signal Processing for COVID-19: A Comprehensive Overview
|
2020
|
Gauri Deshpande
Björn Schüller
|
+
|
High-Fidelity Audio Generation and Representation Learning with Guided Adversarial Autoencoder
|
2020
|
Kazi Nazmul Haque
Rajib Rana
Björn Schüller
|
+
|
Augmenting Generative Adversarial Networks for Speech Emotion Recognition
|
2020
|
Siddique Latif
Muhammad Asim
Rajib Rana
Sara Khalifa
Raja Jurdak
Björn Schüller
|
+
|
Deep Architecture Enhancing Robustness to Noise, Adversarial Attacks, and Cross-corpus Setting for Speech Emotion Recognition
|
2020
|
Siddique Latif
Rajib Rana
Sara Khalifa
Raja Jurdak
Björn Schüller
|
+
|
ConcealNet: An End-to-end Neural Network for Packet Loss Concealment in Deep Speech Emotion Recognition
|
2020
|
Mostafa M. Mohamed
Björn Schüller
|
+
|
A Novel Fusion of Attention and Sequence to Sequence Autoencoders to Predict Sleepiness From Speech
|
2020
|
Shahin Amiriparian
Pawel Winokurow
Vincent Karas
Sandra Ottl
Maurice Gerczuk
Björn Schüller
|
+
|
Adversarial-based neural networks for affect estimations in the wild
|
2020
|
Decky Aspandi
Adria Mallol-Ragolta
Björn Schüller
Xavier Binefa
|
+
|
An Overview on Audio, Signal, Speech, & Language Processing for COVID-19
|
2020
|
Gauri Deshpande
Björn Schüller
|
+
|
Convolutional Neural Networks for the Solution of the 2D Poisson Equation with Arbitrary Dirichlet Boundary Conditions, Mesh Sizes and Grid Spacings
|
2019
|
Ali Girayhan Özbay
Panagiotis Tzirakis
Georgios Rizos
Björn Schüller
Sylvain Laizet
|
+
|
Poisson CNN: Convolutional Neural Networks for the Solution of the Poisson Equation with Varying Meshes and Dirichlet Boundary Conditions
|
2019
|
Ali Girayhan Özbay
Sylvain Laizet
Panagiotis Tzirakis
Georgios Rizos
Björn Schüller
|
+
PDF
Chat
|
AVEC 2019 Workshop and Challenge: State-of-Mind, Detecting Depression with AI, and Cross-Cultural Affect Recognition
|
2019
|
Fabien Ringeval
Björn Schüller
Michel Valstar
Nicholas Cummins
Roddy Cowie
Leili Tavabi
Maximilian Schmitt
Sina Alisamir
Shahin Amiriparian
Eva-Maria Meßner
|
+
PDF
Chat
|
Multi-modal Active Learning From Human Data: A Deep Reinforcement Learning Approach
|
2019
|
Ognjen Rudovic
Meiru Zhang
Björn Schüller
Rosalind W. Picard
|
+
PDF
Chat
|
SEWA DB: A Rich Database for Audio-Visual Emotion and Sentiment Research in the Wild
|
2019
|
Jean Kossaifi
Robert Walecki
Yannis Panagakis
Jie Shen
Maximilian Schmitt
Fabien Ringeval
Jing Han
Vedhas Pandit
Antoine Toisoul
Björn Schüller
|
+
|
On Laughter and Speech-Laugh, Based on Observations of Child-Robot Interaction.
|
2019
|
Anton Batliner
Stefan Steidl
Florian Eyben
Björn Schüller
|
+
|
Presenting the Acoustic Sounds for Wellbeing Dataset and Baseline Classification Results.
|
2019
|
Alice Baird
Björn Schüller
|
+
PDF
Chat
|
EmoBed: Strengthening Monomodal Emotion Recognition via Training with Crossmodal Emotion Embeddings
|
2019
|
Jing Han
Zixing Zhang
Zhao Ren
Björn Schüller
|
+
|
Multi-modal Active Learning From Human Data: A Deep Reinforcement Learning Approach
|
2019
|
Ognjen Rudovic
Meiru Zhang
Björn Schüller
Rosalind W. Picard
|
+
|
A comparison of online automatic speech recognition systems and the nonverbal responses to unintelligible speech
|
2019
|
Joshua Y. Kim
Chunfeng Liu
Rafael A. Calvo
Kathryn McCabe
Silas Taylor
Björn Schüller
Kaihang Wu
|
+
PDF
Chat
|
Attention-augmented End-to-end Multi-task Learning for Emotion Prediction from Speech
|
2019
|
Zixing Zhang
Bingwen Wu
Björn Schüller
|
+
|
Synthesising 3D Facial Motion from "In-the-Wild" Speech
|
2019
|
Panagiotis Tzirakis
Athanasios Papaioannou
Alexandros Lattas
Michail Tarasiou
Björn Schüller
Stefanos Zafeiriou
|
+
PDF
Chat
|
Snore-GANs: Improving Automatic Snore Sound Classification With Synthesized Data
|
2019
|
Zixing Zhang
Jing Han
Kun Qian
Christoph Janott
Yanan Guo
Björn Schüller
|
+
|
Responsible and Representative Multimodal Data Acquisition and Analysis: On Auditability, Benchmarking, Confidence, Data-Reliance & Explainability.
|
2019
|
Alice Baird
Simone Hantke
Björn Schüller
|
+
|
On Many-to-Many Mapping Between Concordance Correlation Coefficient and Mean Square Error.
|
2019
|
Vedhas Pandit
Björn Schüller
|
+
|
The Many-to-Many Mapping Between the Concordance Correlation Coefficient and the Mean Square Error
|
2019
|
Vedhas Pandit
Björn Schüller
|
+
PDF
Chat
|
Deep Affect Prediction in-the-Wild: Aff-Wild Database and Challenge, Deep Architectures, and Beyond
|
2019
|
Dimitrios Kollias
Panagiotis Tzirakis
Mihalis A. Nicolaou
Athanasios Papaioannou
Guoying Zhao
Björn Schüller
Irene Kotsia
Stefanos Zafeiriou
|
+
|
Voice command generation using Progressive Wavegans
|
2019
|
Thomas Wiest
Nicholas Cummins
Alice Baird
Simone Hantke
Judith Dineley
Björn Schüller
|
+
|
Snore-GANs: Improving Automatic Snore Sound Classification with Synthesized Data
|
2019
|
Zixing Zhang
Jing Han
Kun Qian
Christoph Janott
Yanan Guo
Björn Schüller
|
+
|
Attention-Augmented End-to-End Multi-Task Learning for Emotion Prediction from Speech
|
2019
|
Zixing Zhang
Bingwen Wu
Björn Schüller
|
+
|
Single-Channel Speech Separation with Auxiliary Speaker Embeddings
|
2019
|
Shuo Liu
Gil Keren
Björn Schüller
|
+
|
Acoustic Sounds for Wellbeing: A Novel Dataset and Baseline Results
|
2019
|
Alice Baird
Björn Schüller
|
+
|
Pre-training in Deep Reinforcement Learning for Automatic Speech Recognition
|
2019
|
Thejan Rajapakshe
Rajib Rana
Siddique Latif
Sara Khalifa
Björn Schüller
|
+
|
N-HANS: Introducing the Augsburg Neuro-Holistic Audio-eNhancement System
|
2019
|
Shuo Liu
Gil Keren
Björn Schüller
|
+
|
AVEC 2019 Workshop and Challenge: State-of-Mind, Detecting Depression with AI, and Cross-Cultural Affect Recognition
|
2019
|
Fabien Ringeval
Björn Schüller
Michel Valstar
Nicholas Cummins
Roddy Cowie
Leili Tavabi
Maximilian Schmitt
Sina Alisamir
Shahin Amiriparian
Eva-Maria Meßner
|
+
|
Multi-Task Semi-Supervised Adversarial Autoencoding for Speech Emotion Recognition
|
2019
|
Siddique Latif
Rajib Rana
Sara Khalifa
Raja Jurdak
Julien Epps
Björn Schüller
|
+
|
Multi-modal Active Learning From Human Data: A Deep Reinforcement Learning Approach
|
2019
|
Ognjen Rudovic
Meiru Zhang
Björn Schüller
Rosalind W. Picard
|
+
|
A Comparison of Online Automatic Speech Recognition Systems and the Nonverbal Responses to Unintelligible Speech
|
2019
|
Joshua Y. Kim
Chunfeng Liu
Rafael A. Calvo
Kathryn McCabe
Silas Taylor
Björn Schüller
Kaihang Wu
|
+
|
Synthesising 3D Facial Motion from "In-the-Wild" Speech
|
2019
|
Panagiotis Tzirakis
Αθανάσιος Παπαϊωάννου
Alexandros Lattas
Michail Tarasiou
Björn Schüller
Stefanos Zafeiriou
|
+
|
The Many-to-Many Mapping Between the Concordance Correlation Coefficient and the Mean Square Error
|
2019
|
Vedhas Pandit
Björn Schüller
|
+
|
On Laughter and Speech-Laugh, Based on Observations of Child-Robot Interaction
|
2019
|
Anton Batliner
Stefan Steidl
Florian Eyben
Björn Schüller
|
+
|
Responsible and Representative Multimodal Data Acquisition and Analysis: On Auditability, Benchmarking, Confidence, Data-Reliance & Explainability
|
2019
|
Alice Baird
Simone Hantke
Björn Schüller
|
+
PDF
Chat
|
Fast Single-Class Classification and the Principle of Logit Separation
|
2018
|
Gil Keren
Sivan Sabato
Björn Schüller
|
+
PDF
Chat
|
Dynamic Difficulty Awareness Training for Continuous Emotion Prediction
|
2018
|
Zixing Zhang
Jing Han
Eduardo Coutinho
Björn Schüller
|
+
PDF
Chat
|
Scaling speech enhancement in unseen environments with noise embeddings
|
2018
|
Gil Keren
Jing Han
Björn Schüller
|
+
PDF
Chat
|
Noise Invariant Frame Selection: A Simple Method to Address the Background Noise Problem for Text-independent Speaker Verification
|
2018
|
Siyang Song
Shuimei Zhang
Björn Schüller
Linlin Shen
Michel Valstar
|
+
|
Personalized machine learning for robot perception of affect and engagement in autism therapy
|
2018
|
Ognjen Rudovic
Jaeryoung Lee
Miles Dai
Björn Schüller
Rosalind W. Picard
|
+
|
audEERING's approach to the One-Minute-Gradual Emotion Challenge.
|
2018
|
Andreas Triantafyllopoulos
Hesam Sagha
Florian Eyben
Björn Schüller
|
+
|
End2You -- The Imperial Toolkit for Multimodal Profiling by End-to-End Learning
|
2018
|
Panagiotis Tzirakis
Stefanos Zafeiriou
Björn Schüller
|
+
|
Weakly Supervised One-Shot Detection with Attention Siamese Networks
|
2018
|
Gil Keren
Maximilian Schmitt
Thomas Kehrenberg
Björn Schüller
|
+
|
Applying Cooperative Machine Learning to Speed Up the Annotation of Social Signals in Large Multi-modal Corpora
|
2018
|
Johannes Wagner
Tobias Baur
Yue Zhang
Michel Valstar
Björn Schüller
Elisabeth André
|
+
|
Calibrated Prediction Intervals for Neural Network Regressors
|
2018
|
Gil Keren
Nicholas Cummins
Björn Schüller
|
+
|
Noise Invariant Frame Selection: A Simple Method to Address the Background Noise Problem for Text-independent Speaker Verification
|
2018
|
Siyang Song
Shuimei Zhang
Björn Schüller
Linlin Shen
Michel Valstar
|
+
|
Adversarial Training in Affective Computing and Sentiment Analysis: Recent Advances and Perspectives
|
2018
|
Jing Han
Zixing Zhang
Nicholas Cummins
Björn Schüller
|
+
|
Scaling Speech Enhancement in Unseen Environments with Noise Embeddings
|
2018
|
Gil Keren
Jing Han
Björn Schüller
|
+
|
Calibrated Prediction Intervals for Neural Network Regressors
|
2018
|
Gil Keren
Nicholas Cummins
Björn Schüller
|
+
|
audEERING's approach to the One-Minute-Gradual Emotion Challenge
|
2018
|
Andreas Triantafyllopoulos
Hesam Sagha
Florian Eyben
Björn Schüller
|
+
|
End2You -- The Imperial Toolkit for Multimodal Profiling by End-to-End Learning
|
2018
|
Panagiotis Tzirakis
Stefanos Zafeiriou
Björn Schüller
|
+
|
Weakly Supervised One-Shot Detection with Attention Similarity Networks
|
2018
|
Gil Keren
Maximilian Schmitt
Thomas Kehrenberg
Björn Schüller
|
+
|
auDeep: Unsupervised Learning of Representations from Audio with Deep Recurrent Neural Networks
|
2017
|
Michael Freitag
Shahin Amiriparian
Sergey Pugachevskiy
Nicholas Cummins
Björn Schüller
|
+
PDF
Chat
|
End-to-End Multimodal Emotion Recognition Using Deep Neural Networks
|
2017
|
Panagiotis Tzirakis
George Trigeorgis
Mihalis A. Nicolaou
Björn Schüller
Stefanos Zafeiriou
|
+
PDF
Chat
|
DeepCoder: Semi-Parametric Variational Autoencoders for Automatic Facial Action Coding
|
2017
|
Dieu Linh Tran
Robert Walecki
Ognjen Rudovic
Stefanos Eleftheriadis
Björn Schüller
Maja Pantić
|
+
PDF
Chat
|
Deep Structured Learning for Facial Action Unit Intensity Estimation
|
2017
|
Robert Walecki
Ognjen Rudovic
Vladimir Pavlović
Björn Schüller
Maja Pantić
|
+
|
Deep Structured Learning for Facial Action Unit Intensity Estimation
|
2017
|
Robert Walecki
Ognjen
Rudovic
Vladimir Pavlović
Björn Schüller
Maja Pantić
|
+
|
DeepCoder: Semi-parametric Variational Autoencoders for Facial Action Unit Intensity Estimation.
|
2017
|
Dieu Linh Tran
Robert Walecki
Ognjen Rudovic
Stefanos Eleftheriadis
Björn Schüller
Maja Pantić
|
+
|
DeepCoder: Semi-parametric Variational Autoencoders for Automatic Facial Action Coding
|
2017
|
Dieu Linh Tran
Robert Walecki
Ognjen Rudovic
Stefanos Eleftheriadis
Björn Schüller
Maja Pantić
|
+
PDF
Chat
|
Tunable Sensitivity to Large Errors in Neural Network Training
|
2017
|
Gil Keren
Sivan Sabato
Björn Schüller
|
+
|
Fast Single-Class Classification and the Principle of Logit Separation
|
2017
|
Gil Keren
Sivan Sabato
Björn Schüller
|
+
|
Learning audio sequence representations for acoustic event classification
|
2017
|
Zixing Zhang
Ding Liu
Jing Han
Björn Schüller
|
+
|
Deep Learning for Environmentally Robust Speech Recognition: An Overview of Recent Developments
|
2017
|
Zixing Zhang
Jürgen T. Geiger
Jouni Pohjalainen
Amr El-Desoky Mousa
Wenyu Jin
Björn Schüller
|
+
|
DeepCoder: Semi-parametric Variational Autoencoders for Automatic Facial Action Coding
|
2017
|
Dieu Linh Tran
Robert Walecki
Ognjen Rudovic
Stefanos Eleftheriadis
Björn Schüller
Maja Pantić
|
+
|
Deep Structured Learning for Facial Action Unit Intensity Estimation
|
2017
|
Robert Walecki
Ognjen
Rudovic
Vladimir Pavlović
Björn Schüller
Maja Pantić
|
+
PDF
Chat
|
Detecting road surface wetness from audio: A deep learning approach
|
2016
|
Irman Abdić
Lex Fridman
Daniel E. Brown
William Angell
Bryan Reimer
Erik Marchi
Björn Schüller
|
+
PDF
Chat
|
Convolutional RNN: An enhanced model for extracting features from sequential data
|
2016
|
Gil Keren
Björn Schüller
|
+
|
openXBOW - Introducing the Passau Open-Source Crossmodal Bag-of-Words Toolkit
|
2016
|
Maximilian Schmitt
Björn Schüller
|
+
PDF
Chat
|
A Deep Matrix Factorization Method for Learning Attribute Representations
|
2016
|
George Trigeorgis
Konstantinos Bousmalis
Stefanos Zafeiriou
Björn Schüller
|
+
|
Convolutional RNN: an Enhanced Model for Extracting Features from Sequential Data
|
2016
|
Gil Keren
Björn Schüller
|
+
|
Tunable Sensitivity to Large Errors in Neural Network Training
|
2016
|
Gil Keren
Sivan Sabato
Björn Schüller
|
+
|
AVEC 2016 - Depression, Mood, and Emotion Recognition Workshop and Challenge
|
2016
|
Michel Valstar
Jonathan Gratch
Björn Schüller
Fabien Ringeval
Denis Lalanne
Mercedes Torres Torres
Stefan Scherer
Guiota Stratou
Roddy Cowie
Maja Pantić
|
+
|
openXBOW - Introducing the Passau Open-Source Crossmodal Bag-of-Words Toolkit
|
2016
|
Maximilian Schmitt
Björn Schüller
|
+
|
Detecting Road Surface Wetness from Audio: A Deep Learning Approach
|
2015
|
Irman Abdić
Lex Fridman
Erik Marchi
Daniel Brown
William Angell
Bryan Reimer
Björn Schüller
|
+
|
The ICSTM+TUM+UP Approach to the 3rd CHIME Challenge: Single-Channel LSTM Speech Enhancement with Multi-Channel Correlation Shaping Dereverberation and LSTM Language Models
|
2015
|
Amr El-Desoky Mousa
Erik Marchi
Björn Schüller
|
+
|
A deep matrix factorization method for learning attribute representations
|
2015
|
George Trigeorgis
Konstantinos Bousmalis
Stefanos Zafeiriou
Björn Schüller
|
+
|
The ICSTM+TUM+UP Approach to the 3rd CHIME Challenge: Single-Channel LSTM Speech Enhancement with Multi-Channel Correlation Shaping Dereverberation and LSTM Language Models
|
2015
|
Amr El-Desoky Mousa
Erik Marchi
Björn Schüller
|
+
|
A deep matrix factorization method for learning attribute representations
|
2015
|
George Trigeorgis
Konstantinos Bousmalis
Stefanos Zafeiriou
Björn Schüller
|
+
|
Detecting Road Surface Wetness from Audio: A Deep Learning Approach
|
2015
|
Irman Abdić
Lex Fridman
Erik Marchi
Daniel E. Brown
William W. Angell
Bryan Reimer
Björn Schüller
|
+
|
A Broadcast News Corpus for Evaluation and Tuning of German LVCSR Systems
|
2014
|
Felix Weninger
Björn Schüller
Florian Eyben
Martin Wöllmer
Gerhard Rigoll
|
+
PDF
Chat
|
Acoustic Gait-based Person Identification using Hidden Markov Models
|
2014
|
Jürgen T. Geiger
Maximilian Kneißl
Björn Schüller
Gerhard Rigoll
|
+
|
Acoustic Gait-based Person Identification using Hidden Markov Models
|
2014
|
Jürgen T. Geiger
Maximilian Kneißl
Björn Schüller
Gerhard Rigoll
|
+
|
The state of play of ASC-Inclusion: An Integrated Internet-Based Environment for Social Inclusion of Children with Autism Spectrum Conditions
|
2014
|
Björn Schüller
Erik Marchi
Simon Baron‐Cohen
Helen O’Reilly
Delia Pigat
Peter Robinson
Ian Daves
|
+
|
The state of play of ASC-Inclusion: An Integrated Internet-Based Environment for Social Inclusion of Children with Autism Spectrum Conditions
|
2014
|
Björn Schüller
Erik Marchi
Simon Baron‐Cohen
Helen O’Reilly
Delia Pigat
Peter Robinson
Ian Davies
|
+
|
Acoustic Gait-based Person Identification using Hidden Markov Models
|
2014
|
Jürgen T. Geiger
Maximilian Kneißl
Björn Schüller
Gerhard Rigoll
|
+
|
A Broadcast News Corpus for Evaluation and Tuning of German LVCSR Systems
|
2014
|
Felix Weninger
Björn Schüller
Florian Eyben
Martin Wöllmer
Gerhard Rigoll
|
+
|
The state of play of ASC-Inclusion: An Integrated Internet-Based Environment for Social Inclusion of Children with Autism Spectrum Conditions
|
2014
|
Björn Schüller
Erik Marchi
Simon Baron‐Cohen
Helen O’Reilly
Delia Pigat
Peter Robinson
Ian Daves
|
+
|
6th International Symposium on Attention in Cognitive Systems 2013
|
2013
|
Lucas Paletta
Laurent Itti
Björn Schüller
Fang Fang
|