+
|
Visual Semantic Role Labeling
|
2015
|
Saurabh Gupta
Jitendra Malik
|
1
|
+
|
Microsoft COCO Captions: Data Collection and Evaluation Server
|
2015
|
Xinlei Chen
Hao Fang
Tsung-Yi Lin
Ramakrishna Vedantam
Saurabh Gupta
Piotr Dollár
C. Lawrence Zitnick
|
1
|
+
PDF
Chat
|
Deep Residual Learning for Image Recognition
|
2016
|
Kaiming He
Xiangyu Zhang
Shaoqing Ren
Jian Sun
|
1
|
+
PDF
Chat
|
Stacked Hourglass Networks for Human Pose Estimation
|
2016
|
Alejandro Newell
Kaiyu Yang
Jia Deng
|
1
|
+
PDF
Chat
|
Aggregated Residual Transformations for Deep Neural Networks
|
2017
|
Saining Xie
Ross Girshick
Piotr Dollár
Zhuowen Tu
Kaiming He
|
1
|
+
PDF
Chat
|
Scene Graph Generation by Iterative Message Passing
|
2017
|
Danfei Xu
Yuke Zhu
Christopher Choy
Li Fei-Fei
|
1
|
+
PDF
Chat
|
Factorizable Net: An Efficient Subgraph-Based Framework for Scene Graph Generation
|
2018
|
Yikang Li
Wanli Ouyang
Bolei Zhou
Jianping Shi
Chao Zhang
Xiaogang Wang
|
1
|
+
PDF
Chat
|
Graph R-CNN for Scene Graph Generation
|
2018
|
Jianwei Yang
Jiasen Lu
Stefan Lee
Dhruv Batra
Devi Parikh
|
1
|
+
PDF
Chat
|
Learning Human-Object Interactions by Graph Parsing Neural Networks
|
2018
|
Siyuan Qi
Wenguan Wang
Baoxiong Jia
Jianbing Shen
Song‐Chun Zhu
|
1
|
+
|
iCAN: Instance-Centric Attention Network for Human-Object Interaction Detection
|
2018
|
Chen Gao
Yuliang Zou
Jia‐Bin Huang
|
1
|
+
PDF
Chat
|
Exploring Visual Relationship for Image Captioning
|
2018
|
Ting Yao
Yingwei Pan
Yehao Li
Tao Mei
|
1
|
+
|
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
|
2018
|
Jacob Devlin
Ming‐Wei Chang
Kenton Lee
Kristina Toutanova
|
1
|
+
|
The Curious Case of Neural Text Degeneration
|
2019
|
Ari Holtzman
Jan Buys
Li Du
Maxwell Forbes
Yejin Choi
|
1
|
+
PDF
Chat
|
Generalized Intersection Over Union: A Metric and a Loss for Bounding Box Regression
|
2019
|
Hamid Rezatofighi
Nathan Tsoi
JunYoung Gwak
Amir Sadeghian
Ian Reid
Silvio Savarese
|
1
|
+
PDF
Chat
|
Knowledge-Embedded Routing Network for Scene Graph Generation
|
2019
|
Tianshui Chen
Weihao Yu
Riquan Chen
Liang Lin
|
1
|
+
PDF
Chat
|
Transferable Interactiveness Knowledge for Human-Object Interaction Detection
|
2019
|
Yong-Lu Li
Siyuan Zhou
Xijie Huang
Liang Xu
Ze Ma
Hao-Shu Fang
Yanfeng Wang
Cewu Lu
|
1
|
+
PDF
Chat
|
Detecting and Recognizing Human-Object Interactions
|
2018
|
Georgia Gkioxari
Ross Girshick
Piotr Dollár
Kaiming He
|
1
|
+
PDF
Chat
|
Focal Loss for Dense Object Detection
|
2017
|
Tsung-Yi Lin
Priya Goyal
Ross Girshick
Kaiming He
Piotr Dollár
|
1
|
+
PDF
Chat
|
Squeeze-and-Excitation Networks
|
2019
|
Jie Hu
Li Shen
Samuel Albanie
Gang Sun
Enhua Wu
|
1
|
+
PDF
Chat
|
Graphical Contrastive Losses for Scene Graph Parsing
|
2019
|
Ji Zhang
Kevin J. Shih
Ahmed Elgammal
Andrew Tao
Bryan Catanzaro
|
1
|
+
PDF
Chat
|
Neural Motifs: Scene Graph Parsing with Global Context
|
2018
|
Rowan Zellers
Mark Yatskar
Sam Thomson
Yejin Choi
|
1
|
+
PDF
Chat
|
Learning to Compose Dynamic Tree Structures for Visual Contexts
|
2019
|
Kaihua Tang
Hanwang Zhang
Baoyuan Wu
Wenhan Luo
Wei Liu
|
1
|
+
|
RoBERTa: A Robustly Optimized BERT Pretraining Approach
|
2019
|
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
Mike Lewis
Luke Zettlemoyer
Veselin Stoyanov
|
1
|
+
PDF
Chat
|
Rethinking ImageNet Pre-Training
|
2019
|
Kaiming He
Ross Girshick
Piotr Dollár
|
1
|
+
PDF
Chat
|
Cross-modal Scene Graph Matching for Relationship-aware Image-Text Retrieval
|
2020
|
Sijin Wang
Ruiping Wang
Ziwei Yao
Shiguang Shan
Xilin Chen
|
1
|
+
PDF
Chat
|
GPS-Net: Graph Property Sensing Network for Scene Graph Generation
|
2020
|
Xin Lin
Changxing Ding
Jinquan Zeng
Dacheng Tao
|
1
|
+
|
Rethinking Pre-training and Self-training
|
2020
|
Barret Zoph
Golnaz Ghiasi
Tsung-Yi Lin
Yin Cui
Hanxiao Liu
Ekin D. Cubuk
Quoc V. Le
|
1
|
+
PDF
Chat
|
Unbiased Scene Graph Generation From Biased Training
|
2020
|
Kaihua Tang
Yulei Niu
Jianqiang Huang
Jiaxin Shi
Hanwang Zhang
|
1
|
+
PDF
Chat
|
PPDM: Parallel Point Detection and Matching for Real-Time Human-Object Interaction Detection
|
2020
|
Yue Liao
Si Liu
Fei Wang
Yanjie Chen
Chen Qian
Jiashi Feng
|
1
|
+
|
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
|
2020
|
Alexey Dosovitskiy
Lucas Beyer
Alexander Kolesnikov
Dirk Weissenborn
Xiaohua Zhai
Thomas Unterthiner
Mostafa Dehghani
Matthias Minderer
Georg Heigold
Sylvain Gelly
|
1
|
+
PDF
Chat
|
DRG: Dual Relation Graph for Human-Object Interaction Detection
|
2020
|
Chen Gao
Jiarui Xu
Yuliang Zou
Jia‐Bin Huang
|
1
|
+
PDF
Chat
|
End-to-End Object Detection with Transformers
|
2020
|
Nicolas Carion
Francisco Massa
Gabriel Synnaeve
Nicolas Usunier
Alexander Kirillov
Sergey Zagoruyko
|
1
|
+
|
HOI Analysis: Integrating and Decomposing Human-Object Interaction
|
2020
|
Yong-Lu Li
Xinpeng Liu
Xiaoqian Wu
Yizhuo Li
Cewu Lu
|
1
|
+
PDF
Chat
|
Visual Compositional Learning for Human-Object Interaction Detection
|
2020
|
Zhi Hou
Xiaojiang Peng
Yu Qiao
Dacheng Tao
|
1
|
+
PDF
Chat
|
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
|
2021
|
Ze Liu
Yutong Lin
Yue Cao
Han Hu
Yixuan Wei
Zheng Zhang
Stephen Lin
Baining Guo
|
1
|
+
PDF
Chat
|
MDETR - Modulated Detection for End-to-End Multi-Modal Understanding
|
2021
|
Aishwarya Kamath
Mannat Singh
Yann LeCun
Gabriel Synnaeve
Ishan Misra
Nicolas Carion
|
1
|
+
|
Learning Transferable Visual Models From Natural Language Supervision
|
2021
|
Alec Radford
Jong Wook Kim
Chris Hallacy
Aditya Ramesh
Gabriel Goh
Sandhini Agarwal
Girish Sastry
Amanda Askell
Pamela Mishkin
Jack Clark
|
1
|
+
PDF
Chat
|
Affordance Transfer Learning for Human-Object Interaction Detection
|
2021
|
Zhi Hou
Baosheng Yu
Yu Qiao
Xiaojiang Peng
Dacheng Tao
|
1
|
+
PDF
Chat
|
Reformulating HOI Detection as Adaptive Set Prediction
|
2021
|
Chen Ming-fei
Yue Liao
Si Liu
Zhiyuan Chen
Fei Wang
Chen Qian
|
1
|
+
PDF
Chat
|
End-to-End Human Object Interaction Detection with HOI Transformer
|
2021
|
Cheng Zou
Bohan Wang
Yue Hu
Junqi Liu
Qian Wu
Yu Zhao
Boxun Li
Chenguang Zhang
Chi Zhang
Yichen Wei
|
1
|
+
PDF
Chat
|
Dynamic Head: Unifying Object Detection Heads with Attentions
|
2021
|
Xiyang Dai
Yinpeng Chen
Bin Xiao
Dongdong Chen
Mengchen Liu
Lu Yuan
Lei Zhang
|
1
|
+
PDF
Chat
|
Fully Convolutional Scene Graph Generation
|
2021
|
Hengyue Liu
Ning Yan
Masood Mortazavi
Bir Bhanu
|
1
|
+
PDF
Chat
|
QPIC: Query-Based Pairwise Human-Object Interaction Detection with Image-Wide Contextual Information
|
2021
|
Masato Tamura
Hiroki Ohashi
Tomoaki Yoshinaga
|
1
|
+
PDF
Chat
|
HOTR: End-to-End Human-Object Interaction Detection with Transformers
|
2021
|
Bumsoo Kim
Junhyun Lee
Jaewoo Kang
Eun‐Sol Kim
Hyunwoo J. Kim
|
1
|
+
PDF
Chat
|
Bipartite Graph Network with Adaptive Message Passing for Unbiased Scene Graph Generation
|
2021
|
Rongjie Li
Songyang Zhang
Bo Wan
Xuming He
|
1
|
+
PDF
Chat
|
Glance and Gaze: Inferring Action-aware Points for One-Stage Human-Object Interaction Detection
|
2021
|
Xubin Zhong
Xian Qu
Changxing Ding
Dacheng Tao
|
1
|
+
|
Align before Fuse: Vision and Language Representation Learning with Momentum Distillation
|
2021
|
Junnan Li
Ramprasaath R. Selvaraju
Akhilesh Gotmare
Shafiq Joty
Caiming Xiong
Steven C. H. Hoi
|
1
|
+
|
SimVLM: Simple Visual Language Model Pretraining with Weak Supervision
|
2021
|
Zirui Wang
Jiahui Yu
Adams Wei Yu
Zihang Dai
Yulia Tsvetkov
Yuan Cao
|
1
|
+
PDF
Chat
|
Detecting Human-Object Interaction via Fabricated Compositional Learning
|
2021
|
Zhi Hou
Baosheng Yu
Yu Qiao
Xiaojiang Peng
Dacheng Tao
|
1
|
+
PDF
Chat
|
Learning to Generate Scene Graph from Natural Language Supervision
|
2021
|
Yiwu Zhong
Jing Shi
Jianwei Yang
Chenliang Xu
Li Yin
|
1
|