Projects
Reading
People
Chat
SU\G
(𝔸)
/K·U
Projects
Reading
People
Chat
Sign Up
Light
Dark
System
Ruifei Zhang
Follow
Share
Generating author description...
All published works
Action
Title
Year
Authors
+
PDF
Chat
Advancing Visual Grounding with Scene Knowledge: Benchmark and Method
2023
Zhihong Chen
Ruifei Zhang
Yibing Song
Xiang Wan
Guanbin Li
+
Advancing Visual Grounding with Scene Knowledge: Benchmark and Method
2023
Zhihong Chen
Ruifei Zhang
Yibing Song
Xiang Wan
Guanbin Li
Common Coauthors
Coauthor
Papers Together
Xiang Wan
2
Guanbin Li
2
Yibing Song
2
Zhihong Chen
1
Zhihong Chen
1
Commonly Cited References
Action
Title
Year
Authors
# of times referenced
+
PDF
Chat
Modeling Context in Referring Expressions
2016
Licheng Yu
Patrick Poirson
Shan Yang
Alexander C. Berg
Tamara L. Berg
1
+
PDF
Chat
Neighbourhood Watch: Referring Expression Comprehension via Language-Guided Graph Attention Networks
2019
Peng Wang
Qi Wu
Jiewei Cao
Chunhua Shen
Lianli Gao
Anton van den Hengel
1
+
PDF
Chat
CLEVR-Ref+: Diagnosing Visual Reasoning With Referring Expressions
2019
Runtao Liu
Chenxi Liu
Yutong Bai
Alan Yuille
1
+
PDF
Chat
LVIS: A Dataset for Large Vocabulary Instance Segmentation
2019
Agrim Gupta
Piotr Dollár
Ross Girshick
1
+
Microsoft COCO: Common Objects in Context
2014
Tsung-Yi Lin
Michael Maire
Serge Belongie
Lubomir Bourdev
Ross Girshick
James Hays
Pietro Perona
Deva Ramanan
C. Lawrence Zitnick
Piotr Dollár
1
+
PDF
Chat
Improving Referring Expression Grounding With Cross-Modal Attention-Guided Erasing
2019
Xihui Liu
Zihao Wang
Jing Shao
Xiaogang Wang
Hongsheng Li
1
+
PDF
Chat
Visual7W: Grounded Question Answering in Images
2016
Yuke Zhu
Oliver Groth
Michael S. Bernstein
Li Fei-Fei
1
+
PDF
Chat
Generalized Intersection Over Union: A Metric and a Loss for Bounding Box Regression
2019
Hamid Rezatofighi
Nathan Tsoi
JunYoung Gwak
Amir Sadeghian
Ian Reid
Silvio Savarese
1
+
PDF
Chat
You Only Look Once: Unified, Real-Time Object Detection
2016
Joseph Redmon
Santosh Divvala
Ross Girshick
Ali Farhadi
1
+
PDF
Chat
Generation and Comprehension of Unambiguous Object Descriptions
2016
Junhua Mao
Jonathan Huang
Alexander Toshev
Oana Camburu
Alan Yuille
Kevin Murphy
1
+
PDF
Chat
From Recognition to Cognition: Visual Commonsense Reasoning
2019
Rowan Zellers
Yonatan Bisk
Ali Farhadi
Yejin Choi
1
+
PDF
Chat
VQS: Linking Segmentations to Questions and Answers for Supervised Attention in VQA and Question-Focused Semantic Segmentation
2017
Chuang Gan
Yandong Li
Haoxiang Li
Chen Sun
Boqing Gong
1
+
PDF
Chat
Vision-and-Language Navigation: Interpreting Visually-Grounded Navigation Instructions in Real Environments
2018
Peter Anderson
Qi Wu
Damien Teney
Jake Bruce
Mark Johnson
Niko Sünderhauf
Ian Reid
Stephen Jay Gould
Anton van den Hengel
1
+
PDF
Chat
MAttNet: Modular Attention Network for Referring Expression Comprehension
2018
Licheng Yu
Zhe Lin
Xiaohui Shen
Shuicheng Yan
Xin Lu
Mohit Bansal
Tamara L. Berg
1
+
PDF
Chat
Learning to Assemble Neural Module Tree Networks for Visual Grounding
2019
Daqing Liu
Hanwang Zhang
Zheng-Jun Zha
Feng Wu
1
+
PDF
Chat
Dynamic Graph Attention for Referring Expression Comprehension
2019
Sibei Yang
Guanbin Li
Yizhou Yu
1
+
PDF
Chat
A Fast and Accurate One-Stage Approach to Visual Grounding
2019
Zhengyuan Yang
Boqing Gong
Liwei Wang
Wenbing Huang
Dong Yu
Jiebo Luo
1
+
PDF
Chat
PhraseCut: Language-Based Image Segmentation in the Wild
2020
Chenyun Wu
Zhe Lin
Scott Cohen
Trung Bui
Subhransu Maji
1
+
PDF
Chat
Cops-Ref: A New Dataset and Task on Compositional Referring Expression Comprehension
2020
Zhenfang Chen
Peng Wang
Lin Ma
Kenneth K. Wong
Qi Wu
1
+
PDF
Chat
Multi-Task Collaborative Network for Joint Referring Expression Comprehension and Segmentation
2020
Gen Luo
Yiyi Zhou
Xiaoshuai Sun
Liujuan Cao
Chenglin Wu
Cheng Deng
Rongrong Ji
1
+
PDF
Chat
Graph-Structured Referring Expression Reasoning in the Wild
2020
Sibei Yang
Guanbin Li
Yizhou Yu
1
+
PDF
Chat
On the General Value of Evidence, and Bilingual Scene-Text Visual Question Answering
2020
Xinyu Wang
Yuliang Liu
Chunhua Shen
Chun Chet Ng
Canjie Luo
Lianwen Jin
Chee Seng Chan
Anton van den Hengel
Handong Wang
1
+
PDF
Chat
Give Me Something to Eat: Referring Expression Comprehension with Commonsense Knowledge
2020
Peng Wang
Dongyang Liu
Hui Li
Qi Wu
1
+
PDF
Chat
End-to-End Object Detection with Transformers
2020
Nicolas Carion
Francisco Massa
Gabriel Synnaeve
Nicolas Usunier
Alexander Kirillov
Sergey Zagoruyko
1
+
SSD: Single Shot MultiBox Detector
2016
Wei Liu
Dragomir Anguelov
Dumitru Erhan
Christian Szegedy
Scott Reed
Cheng-Yang Fu
Alexander C. Berg
1
+
PDF
Chat
Improving One-Stage Visual Grounding by Recursive Sub-query Construction
2020
Zhengyuan Yang
Tianlang Chen
Liwei Wang
Jiebo Luo
1
+
Grounding Physical Concepts of Objects and Events Through Dynamic Visual Reasoning
2021
Zhenfang Chen
Jiayuan Mao
Jiajun Wu
Kenneth K. Wong
Joshua B. Tenenbaum
Chuang Gan
1
+
PDF
Chat
MDETR - Modulated Detection for End-to-End Multi-Modal Understanding
2021
Aishwarya Kamath
Mannat Singh
Yann LeCun
Gabriel Synnaeve
Ishan Misra
Nicolas Carion
1
+
PDF
Chat
Dynamic Head: Unifying Object Detection Heads with Attentions
2021
Xiyang Dai
Yinpeng Chen
Bin Xiao
Dongdong Chen
Mengchen Liu
Lu Yuan
Lei Zhang
1
+
PDF
Chat
Look Before You Leap: Learning Landmark Features for One-Stage Visual Grounding
2021
Binbin Huang
Dongze Lian
Weixin Luo
Shenghua Gao
1
+
PDF
Chat
A Real-Time Global Inference Network for One-Stage Referring Expression Comprehension
2021
Yiyi Zhou
Rongrong Ji
Gen Luo
Xiaoshuai Sun
Jinsong Su
Xinghao Ding
Chia‐Wen Lin
Qi Tian
1
+
ComPhy: Compositional Physical Reasoning of Objects and Events from Videos
2022
Zhenfang Chen
Kexin Yi
Yunzhu Li
Mingyu Ding
Antonio Torralba
Joshua B. Tenenbaum
Chuang Gan
1
+
Dynamic Visual Reasoning by Learning Differentiable Physics Models from Video and Language
2021
Mingyu Ding
Zhenfang Chen
Tao Du
Ping Luo
Joshua B. Tenenbaum
Chuang Gan
1
+
PDF
Chat
TransVG: End-to-End Visual Grounding with Transformers
2021
Jiajun Deng
Zhengyuan Yang
Tianlang Chen
Wengang Zhou
Houqiang Li
1
+
GLIPv2: Unifying Localization and Vision-Language Understanding
2022
Haotian Zhang
Pengchuan Zhang
Xiaowei Hu
Yen‐Chun Chen
Liunian Harold Li
Xiyang Dai
Lijuan Wang
Lu Yuan
Jenq‐Neng Hwang
Jianfeng Gao
1
+
PDF
Chat
Grounded Language-Image Pre-training
2022
Liunian Harold Li
Pengchuan Zhang
Haotian Zhang
Jianwei Yang
Chunyuan Li
Yiwu Zhong
Lijuan Wang
Lu Yuan
Lei Zhang
Jenq‐Neng Hwang
1
+
PDF
Chat
UniTAB: Unifying Text and Box Outputs for Grounded Vision-Language Modeling
2022
Zhengyuan Yang
Zhe Gan
Jianfeng Wang
Xiaowei Hu
Faisal Ahmed
Zicheng Liu
Yumao Lu
Lijuan Wang
1
+
Find Someone Who: Visual Commonsense Understanding in Human-Centric Grounding
2022
Haoxuan You
Rui Sun
Zhecan Wang
Kai-Wei Chang
Shih‐Fu Chang
1
+
PDF
Chat
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
2016
Shaoqing Ren
Kaiming He
Ross Girshick
Jian Sun
1