Ruifei Zhang

Follow

Generating author description...

Common Coauthors
Coauthor Papers Together
Xiang Wan 2
Guanbin Li 2
Yibing Song 2
Zhihong Chen 1
Zhihong Chen 1
Commonly Cited References
Action Title Year Authors # of times referenced
+ PDF Chat Modeling Context in Referring Expressions 2016 Licheng Yu
Patrick Poirson
Shan Yang
Alexander C. Berg
Tamara L. Berg
1
+ PDF Chat Neighbourhood Watch: Referring Expression Comprehension via Language-Guided Graph Attention Networks 2019 Peng Wang
Qi Wu
Jiewei Cao
Chunhua Shen
Lianli Gao
Anton van den Hengel
1
+ PDF Chat CLEVR-Ref+: Diagnosing Visual Reasoning With Referring Expressions 2019 Runtao Liu
Chenxi Liu
Yutong Bai
Alan Yuille
1
+ PDF Chat LVIS: A Dataset for Large Vocabulary Instance Segmentation 2019 Agrim Gupta
Piotr Dollár
Ross Girshick
1
+ Microsoft COCO: Common Objects in Context 2014 Tsung-Yi Lin
Michael Maire
Serge Belongie
Lubomir Bourdev
Ross Girshick
James Hays
Pietro Perona
Deva Ramanan
C. Lawrence Zitnick
Piotr Dollár
1
+ PDF Chat Improving Referring Expression Grounding With Cross-Modal Attention-Guided Erasing 2019 Xihui Liu
Zihao Wang
Jing Shao
Xiaogang Wang
Hongsheng Li
1
+ PDF Chat Visual7W: Grounded Question Answering in Images 2016 Yuke Zhu
Oliver Groth
Michael S. Bernstein
Li Fei-Fei
1
+ PDF Chat Generalized Intersection Over Union: A Metric and a Loss for Bounding Box Regression 2019 Hamid Rezatofighi
Nathan Tsoi
JunYoung Gwak
Amir Sadeghian
Ian Reid
Silvio Savarese
1
+ PDF Chat You Only Look Once: Unified, Real-Time Object Detection 2016 Joseph Redmon
Santosh Divvala
Ross Girshick
Ali Farhadi
1
+ PDF Chat Generation and Comprehension of Unambiguous Object Descriptions 2016 Junhua Mao
Jonathan Huang
Alexander Toshev
Oana Camburu
Alan Yuille
Kevin Murphy
1
+ PDF Chat From Recognition to Cognition: Visual Commonsense Reasoning 2019 Rowan Zellers
Yonatan Bisk
Ali Farhadi
Yejin Choi
1
+ PDF Chat VQS: Linking Segmentations to Questions and Answers for Supervised Attention in VQA and Question-Focused Semantic Segmentation 2017 Chuang Gan
Yandong Li
Haoxiang Li
Chen Sun
Boqing Gong
1
+ PDF Chat Vision-and-Language Navigation: Interpreting Visually-Grounded Navigation Instructions in Real Environments 2018 Peter Anderson
Qi Wu
Damien Teney
Jake Bruce
Mark Johnson
Niko Sünderhauf
Ian Reid
Stephen Jay Gould
Anton van den Hengel
1
+ PDF Chat MAttNet: Modular Attention Network for Referring Expression Comprehension 2018 Licheng Yu
Zhe Lin
Xiaohui Shen
Shuicheng Yan
Xin Lu
Mohit Bansal
Tamara L. Berg
1
+ PDF Chat Learning to Assemble Neural Module Tree Networks for Visual Grounding 2019 Daqing Liu
Hanwang Zhang
Zheng-Jun Zha
Feng Wu
1
+ PDF Chat Dynamic Graph Attention for Referring Expression Comprehension 2019 Sibei Yang
Guanbin Li
Yizhou Yu
1
+ PDF Chat A Fast and Accurate One-Stage Approach to Visual Grounding 2019 Zhengyuan Yang
Boqing Gong
Liwei Wang
Wenbing Huang
Dong Yu
Jiebo Luo
1
+ PDF Chat PhraseCut: Language-Based Image Segmentation in the Wild 2020 Chenyun Wu
Zhe Lin
Scott Cohen
Trung Bui
Subhransu Maji
1
+ PDF Chat Cops-Ref: A New Dataset and Task on Compositional Referring Expression Comprehension 2020 Zhenfang Chen
Peng Wang
Lin Ma
Kenneth K. Wong
Qi Wu
1
+ PDF Chat Multi-Task Collaborative Network for Joint Referring Expression Comprehension and Segmentation 2020 Gen Luo
Yiyi Zhou
Xiaoshuai Sun
Liujuan Cao
Chenglin Wu
Cheng Deng
Rongrong Ji
1
+ PDF Chat Graph-Structured Referring Expression Reasoning in the Wild 2020 Sibei Yang
Guanbin Li
Yizhou Yu
1
+ PDF Chat On the General Value of Evidence, and Bilingual Scene-Text Visual Question Answering 2020 Xinyu Wang
Yuliang Liu
Chunhua Shen
Chun Chet Ng
Canjie Luo
Lianwen Jin
Chee Seng Chan
Anton van den Hengel
Handong Wang
1
+ PDF Chat Give Me Something to Eat: Referring Expression Comprehension with Commonsense Knowledge 2020 Peng Wang
Dongyang Liu
Hui Li
Qi Wu
1
+ PDF Chat End-to-End Object Detection with Transformers 2020 Nicolas Carion
Francisco Massa
Gabriel Synnaeve
Nicolas Usunier
Alexander Kirillov
Sergey Zagoruyko
1
+ SSD: Single Shot MultiBox Detector 2016 Wei Liu
Dragomir Anguelov
Dumitru Erhan
Christian Szegedy
Scott Reed
Cheng-Yang Fu
Alexander C. Berg
1
+ PDF Chat Improving One-Stage Visual Grounding by Recursive Sub-query Construction 2020 Zhengyuan Yang
Tianlang Chen
Liwei Wang
Jiebo Luo
1
+ Grounding Physical Concepts of Objects and Events Through Dynamic Visual Reasoning 2021 Zhenfang Chen
Jiayuan Mao
Jiajun Wu
Kenneth K. Wong
Joshua B. Tenenbaum
Chuang Gan
1
+ PDF Chat MDETR - Modulated Detection for End-to-End Multi-Modal Understanding 2021 Aishwarya Kamath
Mannat Singh
Yann LeCun
Gabriel Synnaeve
Ishan Misra
Nicolas Carion
1
+ PDF Chat Dynamic Head: Unifying Object Detection Heads with Attentions 2021 Xiyang Dai
Yinpeng Chen
Bin Xiao
Dongdong Chen
Mengchen Liu
Lu Yuan
Lei Zhang
1
+ PDF Chat Look Before You Leap: Learning Landmark Features for One-Stage Visual Grounding 2021 Binbin Huang
Dongze Lian
Weixin Luo
Shenghua Gao
1
+ PDF Chat A Real-Time Global Inference Network for One-Stage Referring Expression Comprehension 2021 Yiyi Zhou
Rongrong Ji
Gen Luo
Xiaoshuai Sun
Jinsong Su
Xinghao Ding
Chia‐Wen Lin
Qi Tian
1
+ ComPhy: Compositional Physical Reasoning of Objects and Events from Videos 2022 Zhenfang Chen
Kexin Yi
Yunzhu Li
Mingyu Ding
Antonio Torralba
Joshua B. Tenenbaum
Chuang Gan
1
+ Dynamic Visual Reasoning by Learning Differentiable Physics Models from Video and Language 2021 Mingyu Ding
Zhenfang Chen
Tao Du
Ping Luo
Joshua B. Tenenbaum
Chuang Gan
1
+ PDF Chat TransVG: End-to-End Visual Grounding with Transformers 2021 Jiajun Deng
Zhengyuan Yang
Tianlang Chen
Wengang Zhou
Houqiang Li
1
+ GLIPv2: Unifying Localization and Vision-Language Understanding 2022 Haotian Zhang
Pengchuan Zhang
Xiaowei Hu
Yen‐Chun Chen
Liunian Harold Li
Xiyang Dai
Lijuan Wang
Lu Yuan
Jenq‐Neng Hwang
Jianfeng Gao
1
+ PDF Chat Grounded Language-Image Pre-training 2022 Liunian Harold Li
Pengchuan Zhang
Haotian Zhang
Jianwei Yang
Chunyuan Li
Yiwu Zhong
Lijuan Wang
Lu Yuan
Lei Zhang
Jenq‐Neng Hwang
1
+ PDF Chat UniTAB: Unifying Text and Box Outputs for Grounded Vision-Language Modeling 2022 Zhengyuan Yang
Zhe Gan
Jianfeng Wang
Xiaowei Hu
Faisal Ahmed
Zicheng Liu
Yumao Lu
Lijuan Wang
1
+ Find Someone Who: Visual Commonsense Understanding in Human-Centric Grounding 2022 Haoxuan You
Rui Sun
Zhecan Wang
Kai-Wei Chang
Shih‐Fu Chang
1
+ PDF Chat Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks 2016 Shaoqing Ren
Kaiming He
Ross Girshick
Jian Sun
1