Align and Attend: Multimodal Summarization with Dual Contrastive Losses

Type: Preprint

Publication Date: 2023-01-01

Citations: 0

DOI: https://doi.org/10.48550/arxiv.2303.07284

Locations

  • arXiv (Cornell University) - View - PDF
  • DataCite API - View

Similar Works

Action Title Year Authors
+ PDF Chat SITransformer: Shared Information-Guided Transformer for Extreme Multimodal Summarization 2024 Sicheng Liu
Lintao Wang
X.W. Zhu
Xuequan Lu
Zhiyong Wang
Kun Hu
+ ICAF: Iterative Contrastive Alignment Framework for Multimodal Abstractive Summarization 2021 Zijian Zhang
Chang Shu
Youxin Chen
Jing Xiao
Qian Zhang
Lu Zheng
+ See, Hear, Read: Leveraging Multimodality with Guided Attention for Abstractive Text Summarization 2021 Yash Kumar Atri
Shraman Pramanick
Vikram Goyal
Tanmoy Chakraborty
+ See, Hear, Read: Leveraging Multimodality with Guided Attention for Abstractive Text Summarization 2021 Yash Kumar Atri
Shraman Pramanick
Vikram Goyal
Tanmoy Chakraborty
+ PDF Chat ICAF: Iterative Contrastive Alignment Framework for Multimodal Abstractive Summarization 2022 Zijian Zhang
Chang Shu
Youxin Chen
Jing Xiao
Qian Zhang
Lu Zheng
+ Hierarchical3D Adapters for Long Video-to-text Summarization 2022 Pinelopi Papalampidi
Mirella Lapata
+ Hierarchical3D Adapters for Long Video-to-text Summarization 2023 Pinelopi Papalampidi
Mirella Lapata
+ Multimodal Frame-Scoring Transformer for Video Summarization 2022 Jeiyoon Park
Kiho Kwoun
Chanhee Lee
Heuiseok Lim
+ Progressive Video Summarization via Multimodal Self-supervised Learning 2022 Haopeng Li
Qiuhong Ke
Mingming Gong
Rui Zhang
+ MMSum: A Dataset for Multimodal Summarization and Thumbnail Generation of Videos 2023 Jielin Qiu
Jiacheng Zhu
William Han
Aditesh Kumar
Karthik Mittal
Claire Jin
Zhengyuan Yang
Linjie Li
Jianfeng Wang
Bo Li
+ PDF Chat VideoXum: Cross-Modal Visual and Textural Summarization of Videos 2023 Jingyang Lin
Hang Hua
Ming Chen
Yikang Li
Jen-Hao Hsiao
Chiuman Ho
Jiebo Luo
+ Fusing Multimodal Signals on Hyper-complex Space for Extreme Abstractive Text Summarization (TL;DR) of Scientific Contents 2023 Yash Kumar Atri
Vikram Goyal
Tanmoy Chakraborty
+ TLDW: Extreme Multimodal Summarisation of News Videos 2022 Peggy Tang
Kun Hu
Lei Zhang
Jiebo Luo
Zhiyong Wang
+ Abstractive Sentence Summarization with Guidance of Selective Multimodal Reference. 2021 Zijian Zhang
Chenxi Zhang
Qinpei Zhao
Jiangfeng Li
+ VideoXum: Cross-modal Visual and Textural Summarization of Videos 2023 Jingyang Lin
Hang Hua
Ming Chen
Yikang Li
Jen-Hao Hsiao
Chiuman Ho
Jiebo Luo
+ D$^2$TV: Dual Knowledge Distillation and Target-oriented Vision Modeling for Many-to-Many Multimodal Summarization 2023 Yunlong Liang
Fandong Meng
Jiaan Wang
Jinan Xu
Yufeng Chen
Jie Zhou
+ Hierarchical Cross-Modality Semantic Correlation Learning Model for Multimodal Summarization 2021 Litian Zhang
Xiaoming Zhang
Junshu Pan
Feiran Huang
+ PDF Chat Hierarchical Cross-Modality Semantic Correlation Learning Model for Multimodal Summarization 2022 Litian Zhang
Xiaoming Zhang
Junshu Pan
+ Summary-Oriented Vision Modeling for Multimodal Abstractive Summarization 2022 Yunlong Liang
Fandong Meng
Jinan Xu
Jiaan Wang
Yufeng Chen
Jie Zhou
+ D2TV: Dual Knowledge Distillation and Target-oriented Vision Modeling for Many-to-Many Multimodal Summarization 2023 Yunlong Liang
Fandong Meng
Jiaan Wang
Jinan Xu
Yufeng Chen
Jie Zhou

Works That Cite This (0)

Action Title Year Authors

Works Cited by This (0)

Action Title Year Authors