Aerial Lifting: Neural Urban Semantic and Building Instance Lifting from Aerial Imagery

Type: Preprint

Publication Date: 2024-03-18

Citations: 1

DOI: https://doi.org/10.48550/arxiv.2403.11812

Abstract

We present a neural radiance field method for urban-scale semantic and building-level instance segmentation from aerial images by lifting noisy 2D labels to 3D. This is a challenging problem due to two primary reasons. Firstly, objects in urban aerial images exhibit substantial variations in size, including buildings, cars, and roads, which pose a significant challenge for accurate 2D segmentation. Secondly, the 2D labels generated by existing segmentation methods suffer from the multi-view inconsistency problem, especially in the case of aerial images, where each image captures only a small portion of the entire scene. To overcome these limitations, we first introduce a scale-adaptive semantic label fusion strategy that enhances the segmentation of objects of varying sizes by combining labels predicted from different altitudes, harnessing the novel-view synthesis capabilities of NeRF. We then introduce a novel cross-view instance label grouping strategy based on the 3D scene representation to mitigate the multi-view inconsistency problem in the 2D instance labels. Furthermore, we exploit multi-view reconstructed depth priors to improve the geometric quality of the reconstructed radiance field, resulting in enhanced segmentation results. Experiments on multiple real-world urban-scale datasets demonstrate that our approach outperforms existing methods, highlighting its effectiveness.

Locations

  • arXiv (Cornell University) - View - PDF

Similar Works

Action Title Year Authors
+ Panoptic NeRF: 3D-to-2D Label Transfer for Panoptic Urban Scene Segmentation 2022 Xiao Fu
Shangzhan Zhang
Tianrun Chen
Yichong Lu
Lanyun Zhu
Xiaowei Zhou
Andreas Geiger
Yiyi Liao
+ PDF Chat Panoptic NeRF: 3D-to-2D Label Transfer for Panoptic Urban Scene Segmentation 2022 Xiao Fu
Shangzhan Zhang
Tianrun Chen
Yichong Lu
Lanyun Zhu
Xiaowei Zhou
Andreas Geiger
Yiyi Liao
+ PDF Chat Instance Neural Radiance Field 2023 Yichen Liu
Benran Hu
Junkai Huang
Yu‐Wing Tai
Chi-Keung Tang
+ SaNet: Scale-aware Neural Network for Semantic Labelling of Multiple Spatial Resolution Aerial Images 2021 Libo Wang
Shenghui Fang
Ce Zhang
Rui Li
Chenxi Duan
Xiaoliang Meng
Peter M. Atkinson
+ Predicting Ground-Level Scene Layout from Aerial Imagery 2016 Menghua Zhai
Zachary Bessinger
Scott Workman
Nathan Jacobs
+ APNet: Urban-level Scene Segmentation of Aerial Images and Point Clouds 2023 Weijie Wei
Martin R. Oswald
Fatemeh Karimi Nejadasl
Theo Gevers
+ PDF Chat APNet: Urban-level Scene Segmentation of Aerial Images and Point Clouds 2023 Weijie Wei
Martin R. Oswald
Fatemeh Karimi Nejadasl
Theo Gevers
+ PDF Chat OmniCity: Omnipotent City Understanding with Multi-Level and Multi-View Images 2023 Weijia Li
Yawen Lai
Linning Xu
Yuanbo Xiangli
Jinhua Yu
Conghui He
Gui-Song Xia
Dahua Lin
+ OmniCity: Omnipotent City Understanding with Multi-level and Multi-view Images 2022 Weijia Li
Yawen Lai
Linning Xu
Yuanbo Xiangli
Jinhua Yu
Conghui He
Gui-Song Xia
Dahua Lin
+ PDF Chat Campus3D 2020 Xinke Li
Chongshou Li
Zekun Tong
Andrew Lim
Junsong Yuan
Yuwei Wu
Jing Tang
Raymond Huang
+ UrbanBIS: a Large-scale Benchmark for Fine-grained Urban Building Instance Segmentation 2023 Guoqing Yang
Fuyou Xue
Qi Zhang
Ke Xie
Chi‐Wing Fu
Hui Huang
+ Instance Neural Radiance Field 2023 Benran Hu
Junkai Huang
Yichen Liu
Yu‐Wing Tai
Chi-Keung Tang
+ Panoptic Lifting for 3D Scene Understanding with Neural Fields 2022 Yawar Siddiqui
Lorenzo Porzi
Samuel Rota Bulò
Norman Müller
Matthias Nießner
Angela Dai
Peter Kontschieder
+ PDF Chat Panoptic Lifting for 3D Scene Understanding with Neural Fields 2023 Yawar Siddiqui
Lorenzo Porzi
Samuel Rota Bulò
Norman Müller
Matthias Nießner
Angela Dai
Peter Kontschieder
+ Implicit Ray-Transformers for Multi-view Remote Sensing Image Segmentation 2023 Zipeng Qi
Hao Chen
Chenyang Liu
Zhenwei Shi
Zhengxia Zou
+ Large-Scale 3D Scene Classification With Multi-View Volumetric CNN 2017 Dror Aiger
Brett L. Allen
Aleksey Golovinskiy
+ PDF Chat Semantic-Aware Network for Aerial-To-Ground Image Synthesis 2021 Jinhyun Jang
Taeyong Song
Kwanghoon Sohn
+ Semantic-aware Network for Aerial-to-Ground Image Synthesis 2023 Jinhyun Jang
Taeyong Song
Kwanghoon Sohn
+ PDF Chat Implicit Ray Transformers for Multiview Remote Sensing Image Segmentation 2023 Zipeng Qi
Hao Chen
Chenyang Liu
Zhenwei Shi
Zhengxia Zou
+ PDF Chat Multi-view Remote Sensing Image Segmentation With SAM priors 2024 Zipeng Qi
Chenyang Liu
Zili Liu
Hao Chen
Yongchang Wu
Zhengxia Zou
Zhenwei Sh

Works That Cite This (0)

Action Title Year Authors

Works Cited by This (0)

Action Title Year Authors