Exploring Fine-Grained Audiovisual Categorization with the SSW60 Dataset

Type: Preprint

Publication Date: 2022-01-01

Citations: 1

DOI: https://doi.org/10.48550/arxiv.2207.10664

Locations

  • arXiv (Cornell University) - View
  • Edinburgh Research Explorer (University of Edinburgh) - View - PDF
  • DataCite API - View

Similar Works

Action Title Year Authors
+ PDF Chat Exploring Fine-Grained Audiovisual Categorization with the SSW60 Dataset 2022 Grant Van Horn
Rui Qian
Michael J. Wilber
Hartwig Adam
Oisin Mac Aodha
Serge Belongie
+ Large Scale Audiovisual Learning of Sounds with Weakly Labeled Data 2020 Haytham M. Fayek
Anurag Kumar
+ Large Scale Audiovisual Learning of Sounds with Weakly Labeled Data 2020 Haytham M. Fayek
Anurag Kumar
+ Large Scale Audiovisual Learning of Sounds with Weakly Labeled Data 2020 Haytham M. Fayek
Anurag Kumar
+ PDF Chat ACAV100M: Automatic Curation of Large-Scale Datasets for Audio-Visual Video Representation Learning 2021 Sangho Lee
Jiwan Chung
Youngjae Yu
Gunhee Kim
Thomas M. Breuel
Gal Chechik
Yale Song
+ ACAV100M: Automatic Curation of Large-Scale Datasets for Audio-Visual Video Representation Learning 2021 Sangho Lee
Jiwan Chung
Youngjae Yu
Gunhee Kim
Thomas M. Breuel
Gal Chechik
Yale Song
+ Cross-domain Deep Feature Combination for Bird Species Classification with Audio-visual Data 2018 Naranchimeg Bold
Chao Zhang
Takuya Akashi
+ PDF Chat Audio-Visual Instance Discrimination with Cross-Modal Agreement 2021 Pedro Morgado
Nuno Vasconcelos
Ishan Misra
+ Audio-Visual Instance Discrimination with Cross-Modal Agreement 2020 Pedro Morgado
Nuno Vasconcelos
Ishan Misra
+ PDF Chat Vggsound: A Large-Scale Audio-Visual Dataset 2020 Honglie Chen
Weidi Xie
Andrea Vedaldi
Andrew Zisserman
+ VGGSound: A Large-scale Audio-Visual Dataset 2020 Honglie Chen
Weidi Xie
Andrea Vedaldi
Andrew Zisserman
+ Audio-Visual Instance Discrimination with Cross-Modal Agreement 2020 Pedro Morgado
Nuno Vasconcelos
Ishan Misra
+ UAVM: Towards Unifying Audio and Visual Models 2022 Yuan Gong
Alexander H. Liu
Andrew Rouditchenko
James Glass
+ PDF Chat Audioclip: Extending Clip to Image, Text and Audio 2022 Andrey Guzhov
Federico Raue
J.J. van Hees
Andreas Dengel
+ PDF Chat Efficient Audio-Visual Fusion for Video Classification 2024 Mahrukh Awan
Asmar Nadeem
Armin Mustafa
+ Cross-Modality and Within-Modality Regularization for Audio-Visual Deepfake Detection 2024 Heqing Zou
Meng Shen
Yu‐Chen Hu
Chen Chen
Eng Siong Chng
Deepu Rajan
+ Cross-Modality and Within-Modality Regularization for Audio-Visual DeepFake Detection 2024 Heqing Zou
Meng Shen
Yu‐Chen Hu
Chen Chen
Eng Siong Chng
Deepu Rajan
+ PDF Chat Attend-Fusion: Efficient Audio-Visual Fusion for Video Classification 2024 Mahrukh Awan
Asmar Nadeem
Muhammad Junaid Awan
Armin Mustafa
Syed Sameed Husain
+ ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities 2023 Peng Wang
Shijie Wang
Junyang Lin
Shuai Bai
Xiaohuan Zhou
Jingren Zhou
Xinggang Wang
Chang Zhou
+ AV-SUPERB: A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models 2023 Yuan Tseng
Layne Berry
Yi‐Ting Chen
I-Hsiang Chiu
Hsuan-Hao Lin
Max Liu
Puyuan Peng
Yi-Jen Shih
Hung‐Yu Wang
Haibin Wu

Works That Cite This (0)

Action Title Year Authors

Works Cited by This (0)

Action Title Year Authors