Research Progress of Construction of an Association Network of Open Scientific Data Catalogues for the Earth Surface System
With the new generation of information technologies,such as earth observation,IoT monitoring,the Internet,5G,and the deepening of the concept of open data sharing,there has been an explosive growth of openly shared earth surface system data on the web,and open big data on the earth surface system has taken shape.Rapid discovery,mining,and utilization of massive open scientific data of the earth surface system(hereinafter referred to as"surface system")is a new development trend and frontier research direction of scientific data sharing of surface system in the era of big data.The open scientific data of surface system has the characteristics of decentralized organization,multi-source heterogeneity,multi-modality,and multi-type,and usually exists in the form of thematic sharing websites,data services,metadata,journal papers(especially data papers),etc.The research on the development of open data mining methods for surface system adapted to different modalities and the analysis of its sharing quality are key scientific issues to make full use of these data.The association network provides a powerful support for semantic interconnection and knowledge discovery of open scientific data of surface system,which takes metadata Uniform Resource Identifier(URI)as nodes,semantic relationship between metadata as edges,and the strength of association between nodes as the value of edges.This paper investigates and analyzes the current development status,basic features,and construction technology from the perspective of construction of open scientific data association network of surface system.We select typical association networks and related literatures at home and abroad as the research objects.Based on the selected nine mainstream association networks and more than 200 related literatures,we make a comparative analysis from the aspects of basic features of the association networks and the construction technology.In terms of basic features,the data source,automation degree,and updating method of the association networks are analyzed;in terms of construction technology,the selection of association indexes is introduced,and the methods of extracting,representing,and calculating the features of open scientific data of the surface system are discussed.Finally,recommendations for future construction of the surface system association network are put forward,including construction of a high-quality and full-coverage surface system open scientific data ontology,consideration of"time-space-content"geoscientific knowledge complex relationship and reasoning,establishment of a multi-language surface system open data association network method,and enhancement of the effectiveness of the surface system open scientific data association network application.
earth surface systemassociation networkdata catalogcharacteristic calculationmetadataevaluation of the level of sharingdata ontologyassociated indicators