首页|Multi-modal contrastive learning of urban space representations from POI data

Multi-modal contrastive learning of urban space representations from POI data

扫码查看
Understanding and characterising urban environment is crucial for urban planning and geospatial analysis. One common approach to this process is through using point of interest (POI) data, which offers rich information about the spatial-semantic characteristics of urban spaces. Existing methods for learning urban space representations from POIs face several limitations, including reliance on predefined spatial units, ignorance of POI location information, underutilisation of POI semantic attributes, and computational inefficiencies. To address these gaps, we propose CaLLiPer (Contrastive Language-Location Pre-training), a novel approach that directly embeds continuous urban spaces into vector representations that capture the spatial and semantic characteristics of urban environment. This model leverages multimodal contrastive learning to align location embeddings with textual descriptions of POIs, bypassing the need for complex training corpus construction and negative sampling. Applying CaLLiPer to learning urban space representations in London, UK, we demonstrate 5-15% improvement in predictive performance for land use classification and socioeconomic mapping tasks compared to state-of-theart methods. Visualisations and correlation analysis of the learned representations further verify our model's ability to capture spatial variations in urban semantics with high accuracy and fine resolution. Moreover, CaLLiPer achieves reduced training time, showcasing its efficiency and scalability. Additional experiments demonstrate the robustness of our model across different spatial scales and urban context. Notably, the experiment on Singapore showed an improvement of over 20%. This work also provides a promising pathway for scalable, semantically rich urban space representation learning that can support the development of geospatial foundation models. The implementation code is available at https://github.com/xlwang233/CaLLiPer.

Representation learningUrban environmentPoints of interestContrastive learningGeoAIGeospatial foundation modelsCITIES

Wang, Xinglei、Cheng, Tao、Law, Stephen、Zeng, Zichao、Yin, Lu、Liu, Junyuan

展开 >

University College London Department of Civil Environmental and Geomatic Engineering

University College London Department of Geography

University College London Department of Civil Environmental and Geomatic Engineering||University College London Department of Civil Environmental and Geomatic Engineering

Univ Surrey

展开 >

2025

Computers,environment and urban systems

Computers,environment and urban systems

ISSN:0198-9715
年,卷(期):2025.120(Sep.)
  • 72