首页|改进关联规则算法在自然资源云中的应用研究

改进关联规则算法在自然资源云中的应用研究

扫码查看
针对自然资源信息管理分散、网络安全防御能力弱,以及难以追踪溯源威胁攻击行为等问题,本研究在自然资源云中建立了一套安全防护体系,用以整合网络安全资源,强化网络安全态势感知能力,做到攻击敏捷预测、快速回溯.安全防护体系工作效能的提升,核心在于其安全组件检测引擎模块中关联规则算法的改进.首先,在数据采集阶段,通过预处理将威胁告警数据转换为可供机器处理的标准数据格式;其次,在矩阵计算阶段,使用MapReduce分布式计算框架提升频繁项集的处理效率;最后,以Apriori算法为蓝本,通过单次扫描锁定频繁k 项集范围、矩阵向量内积运算、减少冗余候选项集生成三项措施进行算法改进.实验仿真表明:在处理同样容量网络安全多源数据集合,并在相同维度的关联规则矩阵下,本算法处理效率较经典Apriori算法提升 3 倍以上;随着输入数据集合瞬时容量的逐渐扩增,本算法的时间复杂度稳定,并为增量挖掘算法的一半以下.研究成果可以实现自然资源部网络安全防护工作从传统的"被动挨打"转向"主动防御"的新局面.
Application of improved association rule algorithm in Natural Resource Cloud
According to the"Overall Informatization Construction Plan of the Ministry of Natural Resources"issued by the Ministry of Natural Resources,there is a need to enhance security protection measures for the external network of natural resources.This includes further improving the protection and construction of security management centers,secure computing environments,secure communication networks,secure area boundaries,and enhancing capabilities related to trusted verification,data security,active defense,security detection,notification and early warning,and emergency response.A security protection system has been established in the Natural Resources Cloud to integrate network security resources and enhance network security situational awareness capabilities.This addresses issues such as decentralized management of security resources,weak network security defense capabilities,and challenges in tracking and tracing threat attacks by the Ministry of Natural Resources.The goal is to achieve agile attack prediction and fast backtracking.To improve the work efficiency of the security protection system,the association rule algorithm in its security component detection engine module is enhanced.The improved algorithm initially converts threat alarm data into a standard machine-processable format during the data collection stage.Secondly,in the matrix calculation phase,the MapReduce distributed computing framework is used to improve the processing efficiency of frequent itemsets.Finally,three measures were taken to improve the algorithm based on the Apriori algorithm,including locking the range of frequent k-term sets in a single scan,matrix vector inner product operation,and reducing the generation of redundant candidate sets.Following the algorithm improvement,it is encapsulated in the algorithm engine component of the Natural Resource Cloud detection engine module,further enhancing the security protection capability of the Natural Resources Department.Experimental simulations indicate that the improved algorithm enhances processing efficiency by over three times compared to the classic Apriori algorithm when dealing with multi-source datasets with the same capacity network security and under the same dimension of association rule matrix.Compared to the classic Apriori algorithm,this algorithm unifies the format of data elements through data preprocessing during the data collection stage,reduces processing time using the MapReduce processing framework,and the dataset has been reduced through distributed parallel processing architecture and cloud computing.Compared to the incremental mining algorithm,this algorithm further shortens the time to process frequent k-item sets through three improvement measures.Although the incremental mining algorithm adopts the MapReduce processing framework,it frequently scans the global transaction matrix without optimizing the transaction matrix operation method.Its time complexity is still more than twice that of the algorithm proposed in this paper,which still leads to a high execution time of the algorithm.Therefore,the algorithm proposed in this paper demonstrates superior processing performance.In conclusion,the application of improved algorithms has achieved a new situation in the network security protection work of the Ministry of Natural Resources,transitioning from the traditional"passive attack"to"active defense".

Natural Resource Cloudassociation rulesMapReducefrequent itemsetsApriorinetwork security

李佳临、邬阳、魏奇、赵雯雯、李芳芳、陈卉

展开 >

自然资源部信息中心,北京 100812

自然资源云 关联规则 MapReduce 频繁项集 Apriori 网络安全

自然资源信息化运行维护项目

121101000000180042

2024

地理信息世界
中国地理信息产业协会 黑龙江测绘地理信息局

地理信息世界

CSTPCD
影响因子:0.826
ISSN:1672-1586
年,卷(期):2024.31(1)
  • 25