+
|
RoBERTa: A Robustly Optimized BERT Pretraining Approach
|
2019
|
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
Mike Lewis
Luke Zettlemoyer
Veselin Stoyanov
|
4
|
+
PDF
Chat
|
CIDEr: Consensus-based image description evaluation
|
2015
|
Ramakrishna Vedantam
C. Lawrence Zitnick
Devi Parikh
|
3
|
+
PDF
Chat
|
Show and tell: A neural image caption generator
|
2015
|
Oriol Vinyals
Alexander Toshev
Samy Bengio
Dumitru Erhan
|
3
|
+
PDF
Chat
|
Deep Residual Learning for Image Recognition
|
2016
|
Kaiming He
Xiangyu Zhang
Shaoqing Ren
Jian Sun
|
3
|
+
PDF
Chat
|
Entity-aware Image Caption Generation
|
2018
|
Di Lu
Spencer Whitehead
Lifu Huang
Heng Ji
Shih-Fu Chang
|
3
|
+
|
Variations of the Similarity Function of TextRank for Automated Summarization
|
2016
|
Federico Barrios
Federico L贸pez
Luis Argerich
Rosita Wachenchauzer
|
3
|
+
|
Microsoft COCO Captions: Data Collection and Evaluation Server
|
2015
|
Xinlei Chen
Hao Fang
Tsung-Yi Lin
Ramakrishna Vedantam
Saurabh Gupta
Piotr Doll谩r
C. Lawrence Zitnick
|
3
|
+
|
Adam: A Method for Stochastic Optimization
|
2014
|
Diederik P. Kingma
Jimmy Ba
|
3
|
+
PDF
Chat
|
Knowing When to Look: Adaptive Attention via a Visual Sentinel for Image Captioning
|
2017
|
Jiasen Lu
Caiming Xiong
Devi Parikh
Richard Socher
|
3
|
+
PDF
Chat
|
Get To The Point: Summarization with Pointer-Generator Networks
|
2017
|
Abigail See
Peter J. Liu
Christopher D. Manning
|
3
|
+
PDF
Chat
|
Good News, Everyone! Context Driven Entity-Aware Captioning for News Images
|
2019
|
Ali Furkan Biten
Llu铆s G贸mez
Mar莽al Rusi帽ol
D矛mosthenis Karatzas
|
3
|
+
PDF
Chat
|
Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering
|
2018
|
Peter Anderson
Xiaodong He
Chris Buehler
Damien Teney
Mark Johnson
Stephen Jay Gould
Lei Zhang
|
3
|
+
PDF
Chat
|
Attention on Attention for Image Captioning
|
2019
|
Lun Huang
Wenmin Wang
Jie Chen
Xiao-Yong Wei
|
3
|
+
PDF
Chat
|
Long-term recurrent convolutional networks for visual recognition and description
|
2015
|
Jeff Donahue
Lisa Anne Hendricks
Sergio Guadarrama
Marcus Rohrbach
Subhashini Venugopalan
Trevor Darrell
Kate Saenko
|
3
|
+
PDF
Chat
|
Informative Image Captioning with External Sources of Information
|
2019
|
Sanqiang Zhao
Piyush Sharma
Tomer Levinboim
Radu Soricut
|
2
|
+
PDF
Chat
|
Image Captioning with Semantic Attention
|
2016
|
Quanzeng You
Hailin Jin
Zhaowen Wang
Fang Chen
Jiebo Luo
|
2
|
+
PDF
Chat
|
BreakingNews: Article Annotation by Image and Text Processing
|
2017
|
Arnau Ramisa
Fei Yan
Francesc Moreno-Noguer
Krystian Mikolajczyk
|
2
|
+
PDF
Chat
|
Transform and Tell: Entity-Aware News Image Captioning
|
2020
|
Alasdair Tran
A. P. Mathews
Lexing Xie
|
2
|
+
|
Attention is All you Need
|
2017
|
Ashish Vaswani
Noam Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan N. Gomez
艁ukasz Kaiser
Illia Polosukhin
|
2
|
+
|
ROUGE 2.0: Updated and Improved Measures for Evaluation of Summarization Tasks
|
2018
|
Kavita Ganesan
|
2
|
+
PDF
Chat
|
Deep visual-semantic alignments for generating image descriptions
|
2015
|
Andrej Karpathy
Li Fei-Fei
|
2
|
+
PDF
Chat
|
From captions to visual concepts and back
|
2015
|
Hao Fang
Saurabh Gupta
Forrest Iandola
Rupesh K. Srivastava
Li Deng
Piotr Doll谩r
Jianfeng Gao
Xiaodong He
Margaret Mitchell
John Platt
|
2
|
+
|
BERTweet: A pre-trained language model for English Tweets
|
2020
|
Dat Quoc Nguyen
Thanh Vu
Anh Tuan Nguyen
|
1
|
+
PDF
Chat
|
TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval
|
2020
|
Jie Lei
Licheng Yu
Tamara L. Berg
Mohit Bansal
|
1
|
+
|
HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training
|
2020
|
Linjie Li
Yen鈥怌hun Chen
Yu Cheng
Zhe Gan
Licheng Yu
Jingjing Liu
|
1
|
+
PDF
Chat
|
Automatic Fact-Checking Using Context and Discourse Information
|
2019
|
Pepa Atanasova
Preslav Nakov
Llu谋虂s M脿rquez
Alberto Barr贸n鈥怌ede帽o
Georgi Karadzhov
Tsvetomila Mihaylova
Mitra Mohtarami
James Glass
|
1
|
+
|
CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval
|
2021
|
Huaishao Luo
Lei Ji
Ming Zhong
Yang Chen
Wen Lei
Nan Duan
Tianrui Li
|
1
|
+
PDF
Chat
|
Multi-modal Semantic Inconsistency Detection in Social Media News Posts
|
2022
|
Scott McCrae
Kehan Wang
Avideh Zakhor
|
1
|
+
|
Learning Transferable Visual Models From Natural Language Supervision
|
2021
|
Alec Radford
Jong Wook Kim
Chris Hallacy
Aditya Ramesh
Gabriel Goh
Sandhini Agarwal
Girish Sastry
Amanda Askell
Pamela Mishkin
Jack Clark
|
1
|
+
PDF
Chat
|
Extracting a Knowledge Base of Mechanisms from COVID-19 Papers
|
2021
|
Tom Hope
Aida Amini
David Wadden
Madeleine van Zuylen
Sravanthi Parasa
Eric Horvitz
Daniel S. Weld
Roy Schwartz
Hannaneh Hajishirzi
|
1
|
+
|
COVID-Fact: Fact Extraction and Verification of Real-World Claims on COVID-19 Pandemic
|
2021
|
Arkadiy Saakyan
Tuhin Chakrabarty
Smaranda Muresan
|
1
|
+
|
CLIP2Video: Mastering Video-Text Retrieval via Image CLIP
|
2021
|
Han Fang
Pengfei Xiong
Luhui Xu
Yu Chen
|
1
|
+
PDF
Chat
|
VideoCLIP: Contrastive Pre-training for Zero-shot Video-Text Understanding
|
2021
|
Hu Xu
Gargi Ghosh
Po-Yao Huang
Dmytro Okhonko
Armen Aghajanyan
Florian Metze
Luke Zettlemoyer
Christoph Feichtenhofer
|
1
|
+
PDF
Chat
|
Visual News: Benchmark and Challenges in News Image Captioning
|
2021
|
Fuxiao Liu
Yinghan Wang
Tianlu Wang
Vicente Ord贸帽ez
|
1
|
+
PDF
Chat
|
A Review on Fact Extraction and Verification
|
2021
|
Giannis Bekoulis
Christina Papagiannopoulou
Nikos Deligiannis
|
1
|
+
|
Misinformation Detection in Social Media Video Posts
|
2022
|
Kehan Wang
David Chan
Seth Z. Zhao
John Canny
Avideh Zakhor
|
1
|
+
PDF
Chat
|
NewsCLIPpings: Automatic Generation of Out-of-Context Multimodal Media
|
2021
|
Grace Luo
Trevor Darrell
Anna Rohrbach
|
1
|
+
|
Variations of the Similarity Function of TextRank for Automated Summarization
|
2016
|
Federico Barrios
Federico L贸pez
Luis Argerich
Rosa Wachenchauzer
|
1
|
+
|
Defending Against Neural Fake News
|
2019
|
Rowan Zellers
Ari Holtzman
Hannah Rashkin
Yonatan Bisk
Ali Farhadi
Franziska Roesner
Yejin Choi
|
1
|
+
|
Attention Is All You Need
|
2017
|
Ashish Vaswani
Noam Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan N. Gomez
艁ukasz Kaiser
Illia Polosukhin
|
1
|
+
PDF
Chat
|
A dataset for Movie Description
|
2015
|
Anna Rohrbach
Marcus Rohrbach
Niket Tandon
Bernt Schiele
|
1
|
+
|
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
|
2018
|
Jacob Devlin
Ming鈥怶ei Chang
Kenton Lee
Kristina Toutanova
|
1
|
+
|
Show, Attend and Tell: Neural Image Caption Generation with Visual Attention
|
2015
|
Kelvin Xu
Jimmy Ba
Ryan Kiros
Kyunghyun Cho
Aaron Courville
Ruslan Salakhutdinov
Richard S. Zemel
Yoshua Bengio
|
1
|
+
|
Neural Baby Talk
|
2018
|
Jiasen Lu
Jianwei Yang
Dhruv Batra
Devi Parikh
|
1
|
+
|
Informative Image Captioning with External Sources of Information
|
2019
|
Sanqiang Zhao
Piyush Sharma
Tomer Levinboim
Radu Soricut
|
1
|
+
|
Image Captioning with Semantic Attention
|
2016
|
Quanzeng You
Hailin Jin
Zhaowen Wang
Fang Chen
Jiebo Luo
|
1
|
+
PDF
Chat
|
Localizing Moments in Video with Natural Language
|
2017
|
Lisa Anne Hendricks
Oliver Wang
Eli Shechtman
Josef 艩ivic
Trevor Darrell
Bryan Russell
|
1
|
+
|
A Broad-Coverage Challenge Corpus for Sentence Understanding through Inference
|
2018
|
Adina Williams
Nikita Nangia
Samuel Bowman
|
1
|
+
PDF
Chat
|
Dense-Captioning Events in Videos
|
2017
|
Ranjay Krishna
Kenji Hata
Frederic Ren
Li Fei-Fei
Juan Carlos Niebles
|
1
|
+
|
FEVER: a Large-scale Dataset for Fact Extraction and VERification
|
2018
|
James Thorne
Andreas Vlachos
Christos Christodoulopoulos
Arpit Mittal
|
1
|