首页|Multi-modal contrastive learning of urban space representations from POI data
Multi-modal contrastive learning of urban space representations from POI data
扫码查看
点击上方二维码区域,可以放大扫码查看
原文链接
NETL
NSTL
Elsevier
Understanding and characterising urban environment is crucial for urban planning and geospatial analysis. One common approach to this process is through using point of interest (POI) data, which offers rich information about the spatial-semantic characteristics of urban spaces. Existing methods for learning urban space representations from POIs face several limitations, including reliance on predefined spatial units, ignorance of POI location information, underutilisation of POI semantic attributes, and computational inefficiencies. To address these gaps, we propose CaLLiPer (Contrastive Language-Location Pre-training), a novel approach that directly embeds continuous urban spaces into vector representations that capture the spatial and semantic characteristics of urban environment. This model leverages multimodal contrastive learning to align location embeddings with textual descriptions of POIs, bypassing the need for complex training corpus construction and negative sampling. Applying CaLLiPer to learning urban space representations in London, UK, we demonstrate 5-15% improvement in predictive performance for land use classification and socioeconomic mapping tasks compared to state-of-theart methods. Visualisations and correlation analysis of the learned representations further verify our model's ability to capture spatial variations in urban semantics with high accuracy and fine resolution. Moreover, CaLLiPer achieves reduced training time, showcasing its efficiency and scalability. Additional experiments demonstrate the robustness of our model across different spatial scales and urban context. Notably, the experiment on Singapore showed an improvement of over 20%. This work also provides a promising pathway for scalable, semantically rich urban space representation learning that can support the development of geospatial foundation models. The implementation code is available at https://github.com/xlwang233/CaLLiPer.
Representation learningUrban environmentPoints of interestContrastive learningGeoAIGeospatial foundation modelsCITIES
University College London Department of Civil Environmental and Geomatic Engineering
University College London Department of Geography
University College London Department of Civil Environmental and Geomatic Engineering||University College London Department of Civil Environmental and Geomatic Engineering