Urban2Vec: Incorporating Street View Imagery and POIs for Multi-Modal Urban Neighborhood Embedding

Type: Article

Publication Date: 2020-04-03

Citations: 50

DOI: https://doi.org/10.1609/aaai.v34i01.5450

Abstract

Understanding intrinsic patterns and predicting spatiotemporal characteristics of cities require a comprehensive representation of urban neighborhoods. Existing works relied on either inter- or intra-region connectivities to generate neighborhood representations but failed to fully utilize the informative yet heterogeneous data within neighborhoods. In this work, we propose Urban2Vec, an unsupervised multi-modal framework which incorporates both street view imagery and point-of-interest (POI) data to learn neighborhood embeddings. Specifically, we use a convolutional neural network to extract visual features from street view images while preserving geospatial similarity. Furthermore, we model each POI as a bag-of-words containing its category, rating, and review information. Analog to document embedding in natural language processing, we establish the semantic similarity between neighborhood (“document”) and the words from its surrounding POIs in the vector space. By jointly encoding visual, textual, and geospatial information into the neighborhood representation, Urban2Vec can achieve performances better than baseline models and comparable to fully-supervised methods in downstream prediction tasks. Extensive experiments on three U.S. metropolitan areas also demonstrate the model interpretability, generalization capability, and its value in neighborhood similarity analysis.

Locations

  • Proceedings of the AAAI Conference on Artificial Intelligence - View - PDF
  • arXiv (Cornell University) - View - PDF

Similar Works

Action Title Year Authors
+ Urban2Vec: Incorporating Street View Imagery and POIs for Multi-Modal Urban Neighborhood Embedding 2020 Zhecheng Wang
Haoyuan Li
Ram Rajagopal
+ PDF Chat Urban Region Embedding via Multi-View Contrastive Prediction 2024 Zechen Li
Weiming Huang
Kai Zhao
Min Yang
Yongshun Gong
Meng Chen
+ Urban Region Embedding via Multi-View Contrastive Prediction 2023 Zechen Li
Weiming Huang
Kai Zhao
Min Yang
Yongshun Gong
Meng Chen
+ PDF Chat Region2Vec 2022 Yunlei Liang
Jiawei Zhu
Wen Ye
Song Gao
+ hood2vec: Identifying Similar Urban Areas Using Mobility Networks 2019 Xin Liu
Konstantinos Pelechrinis
Alexandros Labrinidis
+ hood2vec: Identifying Similar Urban Areas Using Mobility Networks. 2019 Xin Liu
Konstantinos Pelechrinis
Alexandros Labrinidis
+ PDF Chat Demo2Vec: Learning Region Embedding with Demographic Information 2024 Ya Wen
Yulun Zhou
+ Urban Region Profiling via A Multi-Graph Representation Learning Framework 2022 Yuyu Luo
F. Chung
Kai Chen
+ Macross: Urban Dynamics Modeling based on Metapath Guided Cross-Modal Embedding 2019 Yunan Zhang
Heting Gao
Tarek Abdelzaher
+ Region-Wise Attentive Multi-View Representation Learning for Urban Region Embeddings 2023 Weiliang Chan
Qianqian Ren
+ PDF Chat Region Embedding With Intra and Inter-View Contrastive Learning 2022 Liang Zhang
Cheng Long
Gao Cong
+ PDF Chat Multimodal Contrastive Learning of Urban Space Representations from POI Data 2024 Xinglei Wang
Tao Cheng
Stephen C.K. Law
Zichao Zeng
Lu Yin
Junyuan Liu
+ PDF Chat MuseCL: Predicting Urban Socioeconomic Indicators via Multi-Semantic Contrastive Learning 2024 Xixian Yong
Xiao Zhou
+ PDF Chat hex2vec 2021 Szymon Woźniak
Piotr Szymański
+ PDF Chat MuseCL: Predicting Urban Socioeconomic Indicators via Multi-Semantic Contrastive Learning 2024 Xixian Yong
Xiao‐Hua Zhou
+ PDF Chat Towards Effective Next POI Prediction: Spatial and Semantic Augmentation with Remote Sensing Data 2024 Jiang Nan
Haitao Yuan
Jianing Si
Minxiao Chen
Shangguang Wang
+ Unsupervised Learning of Parsimonious General-Purpose Embeddings for User and Location Modelling 2017 Jing Yang
Carsten Eickhoff
+ Multi-Level Visual Similarity Based Personalized Tourist Attraction Recommendation Using Geo-Tagged Photos 2023 Ling Chen
Dandan Lyu
Shanshan Yu
Gencai Chen
+ Learning from #Barcelona Instagram data what Locals and Tourists post about its Neighbourhoods 2018 Raúl Gómez
Lluís Gómez
Jaume Gibert
Dìmosthenis Karatzas
+ Learning from #Barcelona Instagram data what Locals and Tourists post about its Neighbourhoods 2018 Raúl Gómez
Lluís Gómez
Jaume Gibert
Dìmosthenis Karatzas

Works That Cite This (17)

Action Title Year Authors
+ PDF Chat Knowledge-infused Contrastive Learning for Urban Imagery-based Socioeconomic Prediction 2023 Yu Liu
Xin Zhang
Jingtao Ding
Yanxin Xi
Yong Li
+ Combining discrete choice models and neural networks through embeddings: Formulation, interpretability and performance 2023 Ioanna Arkoudi
Rico Krueger
Carlos Lima Azevedo
Francisco C. Pereira
+ PDF Chat City Foundation Models for Learning General Purpose Representations from OpenStreetMap 2024 Pasquale Balsebre
Weiming Huang
Gao Cong
Yi Li
+ PDF Chat UrbanCLIP: Learning Text-enhanced Urban Region Profiling with Contrastive Language-Image Pretraining from the Web 2024 Y. H. Yan
Haomin Wen
Siru Zhong
Wei Chen
Chen Haodong
Qingsong Wen
Roger Zimmermann
Yuxuan Liang
+ PDF Chat Urban Region Representation Learning with Attentive Fusion 2024 Fengze Sun
Jianzhong Qi
Yanchuan Chang
Fan Xiaoliang
Shanika Karunasekera
Egemen Tanin
+ Self-supervised learning unveils change in urban housing from street-level images 2023 Steven Stalder
Michele Volpi
Nicolas Büttner
Stephen Law
Kenneth Harttgen
Esra Süel
+ Neural Embeddings of Urban Big Data Reveal Emergent Structures in Cities 2021 Chao Fan
Yang Yang
Ali Mostafavi
+ PDF Chat Leveraging an Efficient and Semantic Location Embedding to Seek New Ports of Bike Share Services 2020 Yuan Wang
Chenwei Wang
Yinan Ling
Keita Yokoyama
Hsin-Tai Wu
Yi Fang
+ Leveraging an Efficient and Semantic Location Embedding to Seek New Ports of Bike Share Services 2020 Yuan Wang
Chenwei Wang
Yinan Ling
Keita Yokoyama
Hsin-Tai Wu
Yi Fang
+ PDF Chat GEOGRAPHIC RATEMAKING WITH SPATIAL EMBEDDINGS 2021 Christopher Blier-Wong
Hélène Cossette
Luc Lamontagne
Étienne Marceau

Works Cited by This (10)

Action Title Year Authors
+ PDF Chat Rethinking the Inception Architecture for Computer Vision 2016 Christian Szegedy
Vincent Vanhoucke
Sergey Ioffe
Jon Shlens
Zbigniew Wojna
+ PDF Chat Using deep learning and Google Street View to estimate the demographic makeup of neighborhoods across the United States 2017 Timnit Gebru
Jonathan Krause
Yilun Wang
Duyun Chen
Jia Deng
E Aiden
Li Fei-Fei
+ Learning from Discovering: An unsupervised approach to Geographical Knowledge Discovery using street level and street network images. 2019 Stephen Law
Mateo Neira
+ PDF Chat Predicting Twitter User Socioeconomic Attributes with Network and Language Information 2018 Νικόλαος Αλέτρας
Benjamin Paul Chamberlain
+ PDF Chat Learning Deep Structure-Preserving Image-Text Embeddings 2016 Liwei Wang
Yin Li
Svetlana Lazebnik
+ PDF Chat Tile2Vec: Unsupervised Representation Learning for Spatially Distributed Data 2019 Neal Jean
Sherrie Wang
Anshul Samar
George Azzari
David B. Lobell
Stefano Ermon
+ PDF Chat Deep multimodal embedding: Manipulating novel objects with point-clouds, language and trajectories 2017 Jaeyong Sung
Ian Lenz
Ashutosh Saxena
+ Using Social Media to Measure Labor Market Flows 2014 Dolan Antenucci
Michael Cafarella
Margaret C. Levenstein
Christopher Ré
Matthew D. Shapiro
+ Distributed Representations of Words and Phrases and their Compositionality 2013 Tomáš Mikolov
Ilya Sutskever
Kai Chen
Greg S. Corrado
Jeffrey Dean
+ node2vec: Scalable Feature Learning for Networks 2016 Aditya Grover
Jure Leskovec