Type: Article
Publication Date: 2023-06-01
Citations: 28
DOI: https://doi.org/10.1109/cvpr52729.2023.01428
The goal of multimodal summarization is to extract the most important information from different modalities to form summaries. Unlike unimodal summarization, the multimodal summarization task explicitly leverages cross-modal information to help generate more reliable and high-quality summaries. However, existing methods fail to lever-age the temporal correspondence between different modal-ities and ignore the intrinsic correlation between different samples. To address this issue, we introduce Align and Attend Multimodal Summarization (A2Summ), a unified multimodal transformer-based model which can effectively align and attend the multimodal input. In addition, we propose two novel contrastive losses to model both inter-sample and intra-sample correlations. Extensive experiments on two standard video summarization datasets (TVSum and SumMe) and two multimodal summarization datasets (Daily Mail and CNN) demonstrate the superiority of A2Summ, achieving state-of-the-art performances on all datasets. Moreover, we collected a large-scale multimodal summarization dataset BLiSS, which contains livestream videos and transcribed texts with annotated summaries. Our code and dataset are publicly available at https://boheumd.github.io/A2Summ/.
Action | Title | Year | Authors |
---|---|---|---|
+ | None | 1999 |
Ming Liao |
+ | None | 2001 |
I. N. Kostin |
+ | None | 1999 |
Yong-Gao Chen Imre Z. Ruzsa |
+ | None | 2003 |
Paul Sablonnière |
+ | None | 2001 |
Emmanuel Fragnière Jacek Gondzio Robert Sarkissian |
+ | None | 1998 |
G. Sardanashvily |
+ | None | 1998 |
Hans Keiding |
+ | None | 2003 |
Haihua Feng Vincenzo Galdi David A. Castañón |
+ | None | 2003 |
V. Z. Kanchukoev B. S. Karamurzov В. А. Созаев Vladimir Chernov |
+ | None | 2001 |
Petr Habala Nicole Tomczak-Jaegermann |
+ | None | 2001 |
S. E. Kozlov |
+ PDF Chat | None | 2008 |
田村 直義 |
+ | None | 2001 |
Joaquin Soriano |
+ | None | 2001 |
Shigetaka Fukuda |
+ | None | 2003 |
Solomon Friedberg |
+ | None | 2003 |
Igor Belegradek |
+ | None | 1997 |
Salih Çelïk |
+ | None | 2001 |
M. de Montigny Hubert de Guise |
+ | None | 2001 |
A. Yu. Kolesov Н. Х. Розов |
+ | None | 2002 |
D. G. Djumbayeva Erlan Nursultanov |