Feature Pyramid Networks for Object Detection

Type: Preprint

Publication Date: 2017-07-01

Citations: 22469

DOI: https://doi.org/10.1109/cvpr.2017.106

Download PDF

Abstract

Feature pyramids are a basic component in recognition systems for detecting objects at different scales. But pyramid representations have been avoided in recent object detectors that are based on deep convolutional networks, partially because they are slow to compute and memory intensive. In this paper, we exploit the inherent multi-scale, pyramidal hierarchy of deep convolutional networks to construct feature pyramids with marginal extra cost. A top-down architecture with lateral connections is developed for building high-level semantic feature maps at all scales. This architecture, called a Feature Pyramid Network (FPN), shows significant improvement as a generic feature extractor in several applications. Using a basic Faster R-CNN system, our method achieves state-of-the-art single-model results on the COCO detection benchmark without bells and whistles, surpassing all existing single-model entries including those from the COCO 2016 challenge winners. In addition, our method can run at 5 FPS on a GPU and thus is a practical and accurate solution to multi-scale object detection. Code will be made publicly available.

Locations

  • arXiv (Cornell University) - View - PDF

Works That Cite This (5193)

Action Title Year Authors
+ ReGroup: Recursive Neural Networks for Hierarchical Grouping of Vector Graphic Primitives 2021 Sumit Kumar Chaturvedi
Michal Lukáč
Siddhartha Chaudhuri
+ PDF Chat One Step Learning, One Step Review 2024 Xiaolong Huang
Qiankun Li
Xueran Li
Xuesong Gao
+ PDF Chat Leveraging Swin Transformer for Local-to-Global Weakly Supervised Semantic Segmentation 2024 Rozhan Ahmadi
Shohreh Kasaei
+ PDF Chat Grafting Vision Transformers 2024 Jongwoo Park
Kumara Kahatapitiya
Donghyun Kim
Shivchander Sudalairaj
Quanfu Fan
Michael S. Ryoo
+ PDF Chat Efficient Transferability Assessment for Selection of Pre-trained Detectors 2024 Wang Zhao
Aoxue Li
Zhenguo Li
Qi Dou
+ You Better Look Twice: a new perspective for designing accurate detectors with reduced computations. 2021 Alexandra Dana
Maor Shutman
Yotam Perlitz
Ran Vitek
Tomer Peleg
Roy J. Jevnisek
+ UVO Challenge on Video-based Open-World Segmentation 2021: 1st Place Solution 2021 Yuming Du
W. Guo
Yang Xiao
Vincent Lepetit
+ UltraPose: Synthesizing Dense Pose with 1 Billion Points by Human-body Decoupling 3D Model 2021 Haonan Yan
Jiaqi Chen
Xujie Zhang
Shengkai Zhang
Nianhong Jiao
Xiaodan Liang
Tianxiang Zheng
+ PDF Chat SCPMan: Shape context and prior constrained multi-scale attention network for pancreatic segmentation 2024 Leilei Zeng
Xuechen Li
Xinquan Yang
Wenting Chen
Jingxin Liu
Linlin Shen
Song Wu
+ PDF Chat Cerberus: Attribute-based person re-identification using semantic IDs 2024 Chanho Eom
Geon Lee
Kyunghwan Cho
Hyeonseok Jung
Moon-sub Jin
Bumsub Ham