Urban2Vec: Incorporating Street View Imagery and POIs for Multi-Modal Urban Neighborhood Embedding

Zhecheng Wang, Haoyuan Li, Ram Rajagopal

Type: Article

Publication Date: 2020-04-03

Citations: 50

DOI: https://doi.org/10.1609/aaai.v34i01.5450

Abstract

Understanding intrinsic patterns and predicting spatiotemporal characteristics of cities require a comprehensive representation of urban neighborhoods. Existing works relied on either inter- or intra-region connectivities to generate neighborhood representations but failed to fully utilize the informative yet heterogeneous data within neighborhoods. In this work, we propose Urban2Vec, an unsupervised multi-modal framework which incorporates both street view imagery and point-of-interest (POI) data to learn neighborhood embeddings. Specifically, we use a convolutional neural network to extract visual features from street view images while preserving geospatial similarity. Furthermore, we model each POI as a bag-of-words containing its category, rating, and review information. Analog to document embedding in natural language processing, we establish the semantic similarity between neighborhood (“document”) and the words from its surrounding POIs in the vector space. By jointly encoding visual, textual, and geospatial information into the neighborhood representation, Urban2Vec can achieve performances better than baseline models and comparable to fully-supervised methods in downstream prediction tasks. Extensive experiments on three U.S. metropolitan areas also demonstrate the model interpretability, generalization capability, and its value in neighborhood similarity analysis.

Locations

Proceedings of the AAAI Conference on Artificial Intelligence - View - PDF
arXiv (Cornell University) - View - PDF

Similar Works

Action	Title	Year	Authors
+	Urban2Vec: Incorporating Street View Imagery and POIs for Multi-Modal Urban Neighborhood Embedding	2020	Zhecheng Wang Haoyuan Li Ram Rajagopal
+ PDF Chat	Urban Region Embedding via Multi-View Contrastive Prediction	2024	Zechen Li Weiming Huang Kai Zhao Min Yang Yongshun Gong Meng Chen
+	Urban Region Embedding via Multi-View Contrastive Prediction	2023	Zechen Li Weiming Huang Kai Zhao Min Yang Yongshun Gong Meng Chen
+ PDF Chat	Region2Vec	2022	Yunlei Liang Jiawei Zhu Wen Ye Song Gao
+	hood2vec: Identifying Similar Urban Areas Using Mobility Networks	2019	Xin Liu Konstantinos Pelechrinis Alexandros Labrinidis
+	hood2vec: Identifying Similar Urban Areas Using Mobility Networks.	2019	Xin Liu Konstantinos Pelechrinis Alexandros Labrinidis
+ PDF Chat	Demo2Vec: Learning Region Embedding with Demographic Information	2024	Ya Wen Yulun Zhou
+	Urban Region Profiling via A Multi-Graph Representation Learning Framework	2022	Yuyu Luo F. Chung Kai Chen
+	Macross: Urban Dynamics Modeling based on Metapath Guided Cross-Modal Embedding	2019	Yunan Zhang Heting Gao Tarek Abdelzaher
+	Region-Wise Attentive Multi-View Representation Learning for Urban Region Embeddings	2023	Weiliang Chan Qianqian Ren
+ PDF Chat	Region Embedding With Intra and Inter-View Contrastive Learning	2022	Liang Zhang Cheng Long Gao Cong
+ PDF Chat	Multimodal Contrastive Learning of Urban Space Representations from POI Data	2024	Xinglei Wang Tao Cheng Stephen C.K. Law Zichao Zeng Lu Yin Junyuan Liu
+ PDF Chat	MuseCL: Predicting Urban Socioeconomic Indicators via Multi-Semantic Contrastive Learning	2024	Xixian Yong Xiao Zhou
+ PDF Chat	hex2vec	2021	Szymon Woźniak Piotr Szymański
+ PDF Chat	MuseCL: Predicting Urban Socioeconomic Indicators via Multi-Semantic Contrastive Learning	2024	Xixian Yong Xiao‐Hua Zhou
+ PDF Chat	Towards Effective Next POI Prediction: Spatial and Semantic Augmentation with Remote Sensing Data	2024	Jiang Nan Haitao Yuan Jianing Si Minxiao Chen Shangguang Wang
+	Unsupervised Learning of Parsimonious General-Purpose Embeddings for User and Location Modelling	2017	Jing Yang Carsten Eickhoff
+	Multi-Level Visual Similarity Based Personalized Tourist Attraction Recommendation Using Geo-Tagged Photos	2023	Ling Chen Dandan Lyu Shanshan Yu Gencai Chen
+	Learning from #Barcelona Instagram data what Locals and Tourists post about its Neighbourhoods	2018	Raúl Gómez Lluís Gómez Jaume Gibert Dìmosthenis Karatzas
+	Learning from #Barcelona Instagram data what Locals and Tourists post about its Neighbourhoods	2018	Raúl Gómez Lluís Gómez Jaume Gibert Dìmosthenis Karatzas

Works That Cite This (17)

Action	Title	Year	Authors
+ PDF Chat	Knowledge-infused Contrastive Learning for Urban Imagery-based Socioeconomic Prediction	2023	Yu Liu Xin Zhang Jingtao Ding Yanxin Xi Yong Li
+	Combining discrete choice models and neural networks through embeddings: Formulation, interpretability and performance	2023	Ioanna Arkoudi Rico Krueger Carlos Lima Azevedo Francisco C. Pereira
+ PDF Chat	City Foundation Models for Learning General Purpose Representations from OpenStreetMap	2024	Pasquale Balsebre Weiming Huang Gao Cong Yi Li
+ PDF Chat	UrbanCLIP: Learning Text-enhanced Urban Region Profiling with Contrastive Language-Image Pretraining from the Web	2024	Y. H. Yan Haomin Wen Siru Zhong Wei Chen Chen Haodong Qingsong Wen Roger Zimmermann Yuxuan Liang
+ PDF Chat	Urban Region Representation Learning with Attentive Fusion	2024	Fengze Sun Jianzhong Qi Yanchuan Chang Fan Xiaoliang Shanika Karunasekera Egemen Tanin
+	Self-supervised learning unveils change in urban housing from street-level images	2023	Steven Stalder Michele Volpi Nicolas Büttner Stephen Law Kenneth Harttgen Esra Süel
+	Neural Embeddings of Urban Big Data Reveal Emergent Structures in Cities	2021	Chao Fan Yang Yang Ali Mostafavi
+ PDF Chat	Leveraging an Efficient and Semantic Location Embedding to Seek New Ports of Bike Share Services	2020	Yuan Wang Chenwei Wang Yinan Ling Keita Yokoyama Hsin-Tai Wu Yi Fang
+	Leveraging an Efficient and Semantic Location Embedding to Seek New Ports of Bike Share Services	2020	Yuan Wang Chenwei Wang Yinan Ling Keita Yokoyama Hsin-Tai Wu Yi Fang
+ PDF Chat	GEOGRAPHIC RATEMAKING WITH SPATIAL EMBEDDINGS	2021	Christopher Blier-Wong Hélène Cossette Luc Lamontagne Étienne Marceau

Works Cited by This (10)

Action	Title	Year	Authors
+ PDF Chat	Rethinking the Inception Architecture for Computer Vision	2016	Christian Szegedy Vincent Vanhoucke Sergey Ioffe Jon Shlens Zbigniew Wojna
+ PDF Chat	Using deep learning and Google Street View to estimate the demographic makeup of neighborhoods across the United States	2017	Timnit Gebru Jonathan Krause Yilun Wang Duyun Chen Jia Deng E Aiden Li Fei-Fei
+	Learning from Discovering: An unsupervised approach to Geographical Knowledge Discovery using street level and street network images.	2019	Stephen Law Mateo Neira
+ PDF Chat	Predicting Twitter User Socioeconomic Attributes with Network and Language Information	2018	Νικόλαος Αλέτρας Benjamin Paul Chamberlain
+ PDF Chat	Learning Deep Structure-Preserving Image-Text Embeddings	2016	Liwei Wang Yin Li Svetlana Lazebnik
+ PDF Chat	Tile2Vec: Unsupervised Representation Learning for Spatially Distributed Data	2019	Neal Jean Sherrie Wang Anshul Samar George Azzari David B. Lobell Stefano Ermon
+ PDF Chat	Deep multimodal embedding: Manipulating novel objects with point-clouds, language and trajectories	2017	Jaeyong Sung Ian Lenz Ashutosh Saxena
+	Using Social Media to Measure Labor Market Flows	2014	Dolan Antenucci Michael Cafarella Margaret C. Levenstein Christopher Ré Matthew D. Shapiro
+	Distributed Representations of Words and Phrases and their Compositionality	2013	Tomáš Mikolov Ilya Sutskever Kai Chen Greg S. Corrado Jeffrey Dean
+	node2vec: Scalable Feature Learning for Networks	2016	Aditya Grover Jure Leskovec