Multi-modal contrastive learning of urban space representations from POI data

扫码查看

原文链接

NETL
NSTL
Elsevier

外文摘要：Understanding and characterising urban environment is crucial for urban planning and geospatial analysis. One common approach to this process is through using point of interest (POI) data, which offers rich information about the spatial-semantic characteristics of urban spaces. Existing methods for learning urban space representations from POIs face several limitations, including reliance on predefined spatial units, ignorance of POI location information, underutilisation of POI semantic attributes, and computational inefficiencies. To address these gaps, we propose CaLLiPer (Contrastive Language-Location Pre-training), a novel approach that directly embeds continuous urban spaces into vector representations that capture the spatial and semantic characteristics of urban environment. This model leverages multimodal contrastive learning to align location embeddings with textual descriptions of POIs, bypassing the need for complex training corpus construction and negative sampling. Applying CaLLiPer to learning urban space representations in London, UK, we demonstrate 5-15% improvement in predictive performance for land use classification and socioeconomic mapping tasks compared to state-of-theart methods. Visualisations and correlation analysis of the learned representations further verify our model's ability to capture spatial variations in urban semantics with high accuracy and fine resolution. Moreover, CaLLiPer achieves reduced training time, showcasing its efficiency and scalability. Additional experiments demonstrate the robustness of our model across different spatial scales and urban context. Notably, the experiment on Singapore showed an improvement of over 20%. This work also provides a promising pathway for scalable, semantically rich urban space representation learning that can support the development of geospatial foundation models. The implementation code is available at https://github.com/xlwang233/CaLLiPer.

外文关键词：

Representation learningUrban environmentPoints of interestContrastive learningGeoAIGeospatial foundation modelsCITIES

作者：

Wang, Xinglei、Cheng, Tao、Law, Stephen、Zeng, Zichao、Yin, Lu、Liu, Junyuan

展开 >

作者单位：

University College London Department of Civil Environmental and Geomatic Engineering

University College London Department of Geography

University College London Department of Civil Environmental and Geomatic Engineering||University College London Department of Civil Environmental and Geomatic Engineering

Univ Surrey

展开 >

出版年：

2025

DOI：

10.1016/j.compenvurbsys.2025.102299

Computers，environment and urban systems

ISSN：0198-9715

年,卷(期)：2025.120(Sep.)

参考文献量72