A method for identifying urban functional areas considering the correlation of multimodal data feature:A case study of Wuhan
Urban functional zones play an important role in urbanization,and modeling and analyzing them are vital for planning urban land layout and optimizing land use patterns.However,traditional research often relies on historical planning data and administrative units,hindering the accurate representation of modern urban information and functional structure.Multimodal data,a frontier in geographic information science,offers advantages like large volume,strong present situation,multimodal,low cost,covering detailed and comprehensive urban spatial-temporal information.Despite these benefits,challenges like low utilization and significant loss of associated features persist.This paper addresses these challenges by proposing a method for identifying urban functional areas that considers the correlation of multimodal data features.This paper takes Wuhan city center area as an example and constructs an urban functional area recognition framework based on street view image data and interest point data.First,the study area is divided into 500×500m grid cells,and methods such as FCN image semantic segmentation,Place2Vec natural language model,IDW spatial interpolation,and others are used to extract multidimensional urban functional features.Validation is done through methods like POI richness,cluster analysis,DC coefficient curves,and spatial distribution of scene elements.Second,nuclear typical correlation analysis is used to extract correlation features of multimodal data and analyze the relationships between different shared spatial dimensions and correlation.Finally,a random forest functional area identification model is constructed,incorporating built environment features,socio-economic semantic features,splicing features,and correlation features.The results are analyzed using methods such as confusion matrices,feature importance score calculations,and real maps.In the experiment of extracting multi-dimensional urban functional features in this paper,the clustering results and POI richness are compared and analyzed with the real map,and found to be basically consistent with the actual situation.through the DC coefficient curves and the spatial distribution of scene elements,it can be seen that the spatial interpolation results in this paper are good,and the results of the semantic segmentation basically conform to the actual situation.In the experiment of extraction and analysis of associated features,through the visual display of related variables,it can be found that there are obvious differences before and after the fusion of the two kinds of data.Through the study of shared spatial dimensions and correlation,30 dimensions are finally chosen as the shared feature spatial dimension.In the functional area identification model experiment,through the comparative analysis of multiple models constructed by different features,it can be clearly found that multimodal data has a driving effect on the improvement of identification accuracy compared with single data.On this basis,the input of associated features further improves the recognition accuracy by 3.4%and achieves the highest score in the feature importance score calculation.In the confusion matrix of the recognition results,it is found that the accuracies of the five site types are all above 70%.The full exploitation and use of multimodal data are of great significance for the modeling and analysis of urban functional zones.In this paper,we constructed an urban functional area identification method based on the KCCA data fusion method,which takes into account the feature association of multimodal data.The main conclusions from the experimental analysis are as follows:(1)The semantic features extracted in this paper can accurately express urban functions.The results of multi-method validation experiments fully prove the validity of the socio-economic semantic feature extraction method in this paper.It also shows the feasibility of extracting socio-economic semantic features using the Place2vec model,and its potential semantics have some correlation with the urban functions.In addition to verifying the validity of the semantic feature extraction method of the urban scene in this paper,it is found that in the distribution of the elements,the semantic features of human-induced and human-induced scenes account for the highest proportion of the six elements.Among the six highest elements,the proportion of man-made features and natural features is balanced,which also shows the importance of natural features in the development of urbanization.(2)The KCCA data fusion method can improve the performance of the urban functional area identification framework.The results show that data splicing can improve the recognition accuracy compared with single feature,while multi-feature data fusion,i.e.,the method in this paper,can further improve the recognition accuracy compared with data splicing.This fully proves that the KCCA data fusion method has better performance than pure data splicing and also has a better ability to express the functional characteristics of the city.In addition,the data samples used in this paper have the problem of uneven distribution,but the recognition accuracy of the five land use types is more consistent,indicating that the framework of this paper has enough adaptability to the problem of uneven distribution of data samples.
urban functionmultimodal data fusionfeature correlationkernel typical correlation analysisstreet view imagePOI