首页|基于大数据空间标记的惯常环境技术定义

基于大数据空间标记的惯常环境技术定义

扫码查看
旅游是在非惯常环境的活动,但非惯常环境要由惯常环境来定义.无论学术上还是技术上,惯常环境自身也都缺少相对统一和明确的定义,使得我国旅游统计执行错乱频生,各类负面舆情不断.文章探讨了惯常环境技术定义的国际实践、原则和推荐的表达,认为惯常环境是由以居所为中心和以职学地为中心的两类惯常空间的合集,一个以不等圆组成的不受行政区划限定的不规则区域.研究还发现:1)为减少位置噪声点干扰使得簇质点偏移,对两个惯常空间进行具有噪声的基于密度的空间聚类时扫描半径宜限定在1km以内;2)以居所为中心的惯常空间不超过1个,以职学地为中心的惯常空间个数小于等于2,根据位置点衰减情况判断,前者空间半径以40 km为宜,后者空间半径以2 km~3 km更为合理;3)无需用全样本位置数据进行标记,通过代表性用户出游率或抵达率扩样实现总体推算;4)不能标记惯常环境的用户,可假定其出游率或抵达率与能标记的用户相同,即符合同一性假定.该研究可为大数据的旅游统计规范化应用提供技术参考,为基于大数据的旅游流研究夯实了基础.
Understanding the Usual Environment in Tourism:A Technical Definition Based on Big Data Space Marking
The term"tourism"refers to various forms of activities that take place in an unusual environment.This"unusual"environment needs to be defined in terms of its opposite,i.e.,the"usual"environment.However,the lack of a relatively homogeneous and unambiguous description of the usual environment,both academically and technically,has led to frequent and repeated mistakes in the implementation of tourism statistics in China.As a result,there has been a continual flood of negative public opinion and a variety of controversies relating to the concept of tourism.This study attempts to address the aforementioned issues through the following procedure.First,we present a review of the international practices,principles,and recommended expressions of the"usual environment"in a technical context.Second,we identify an individual's usual environment as an ensemble of two distinct types of usual spaces,namely the direct vicinity of a person's residential address and the region surrounding a person's place of employment or education.Geographically speaking,the usual environment is an irregular area made up of uneven circles without the limitation of administrative subdivisions.Third,based on the labeling of big data,we employ several spatial clustering algorithms to label the usual environment,and apply the method of inversion and expansion sampling for the monitoring of tourism flows.Finally,we present a preliminary determination of the feasible radius for the two types of usual spaces by comparing the operational parameters of different scanning radii in the density-based spatial clustering of applications with noise(DBSCAN)algorithm.The findings reveal that the scanning radius should be restricted to less than 1 km for the optimal DBSCAN clustering of two usual spaces,as this will minimize the positional noise interference that leads to a mean shift.Moreover,there is typically no more than a single usual residential space,and the number of usual locations relating to a person's place of employment or education is generally only one or two.Based on the attenuation of location points,the usual environment for a place of residence has a maximum radius of 40 km,whereas that for a place of employment or education has a maximum radius of 2 km~3 km.An inference about the statistical population is reached by expanding the sample space to include representative user travelling rates or arrival rates,rather than labeling with a full sample of location data.In addition,it can be assumed that users who are unable to identify their usual environments have the same travelling or arrival rates as those who are able to specify their usual locations,which is consistent with the assumption of homogeneity.The findings of this study serve as a reference for the standardized and consistent application of big data in tourism statistics,and reinforce the basis for big data-based research on tourism flows.Several significant policy and practical implications can be determined from these findings.

usual environmenttechnical definitionspace marking

马仪亮、宋彦亭

展开 >

中国旅游研究院,北京 100005

农业农村部规划设计研究院,北京 100125

惯常环境 技术定义 空间标记

科技创新2030重大项目

2021ZD0111403

2024

旅游学刊
北京联合大学旅游学院

旅游学刊

CSTPCDCSSCICHSSCD北大核心
影响因子:2.013
ISSN:1002-5006
年,卷(期):2024.39(5)
  • 18