首页|基于约束轨迹聚类的事件日志批量修复方法

基于约束轨迹聚类的事件日志批量修复方法

扫码查看
企业业务运行过程中会产生大量的事件日志,事件日志是业务过程挖掘、监控和优化的基础和保障.然而,原始的事件日志由于缺乏结构及过于灵活导致难以直接应用于过程挖掘,对事件日志进行修复势在必行.现有日志修复方法需要结合过程模型逐条检查轨迹,并对各类异常行为采用不同策略进行修复,导致修复效率低下、适用性不强.针对上述问题,利用轨迹聚类方法,结合文本相似度指标,提出一种基于约束轨迹聚类的批量日志修复方法.该方法通过对轨迹聚类的每个步骤施加约束条件,使得单个簇包含作为簇中心的拟合轨迹以及与该拟合轨迹相似的异常轨迹,且中心轨迹即为异常轨迹的修复结果.该方法不但无需分析异常行为,直接获得修复后的拟合轨迹,而且实现了对于异常轨迹的批量修复.实验表明,该方法在脱离过程模型并保证高修复准确率的前提下,能够在噪音过滤之后,有效且高效地对事件日志进行批量修复.
Batch repair of event logs based on constrained trace clustering
A large amount of event logs are generated during the operation of the enterprise business,which are the foundation and guarantee for the mining,monitoring and optimization of business process.However,original event logs are so less structured and more flexible that it is difficult to apply them to process mining directly.Hence,it is imperative to repair event logs.The existing log repair methods need to align the traces one by one with the process model,and different kinds of deviation behaviors should be repaired using different means,which lead to low repair efficiency and weak applicability.To resolve the above-mentioned problems,a batch log repair method based on con-strained trace clustering was proposed which combined trace clustering methods and text similarity metrics.By im-posing constraints on each procedure of trace clustering,one single cluster included the fitting trace as the cluster center and the unfitting traces similar to the fitting trace,and the central trace was considered as the repair re-sult.This method could not only directly obtain the repaired fitting traces without analyzing the deviations,but also realized the batch repair of the unfitting traces.Experiment results showed that the proposed method could filter the noise and then repair the event logs in batch,without process models and ensuring high repair accuracy.

trace clusteringtext similaritylog repairevent lognoise filtering

田银花、李昕燃、武于皓、韩咚、杜玉越、王路

展开 >

山东科技大学智能装备学院,山东 泰安 271000

山东科技大学继续教育学院,山东 泰安 271000

山东科技大学计算机科学与工程学院,山东 青岛 266590

轨迹聚类 文本相似度 日志修复 事件日志 噪音过滤

国家自然科学基金资助项目教育部人文社会科学研究青年基金资助项目教育部人文社会科学研究青年基金资助项目山东省自然科学基金资助项目山东省自然科学基金资助项目山东省重点研发计划(软科学)资助项目山东省习近平新时代中国特色社会主义思想研究中心山东科技大学山东数字经济研究基地资助项目

7210113721YJCZH15020YJCZH159ZR2021MF117ZR2022QF0202022RKY02009SDSZJD202314

2024

计算机集成制造系统
中国兵器工业集团第210研究所

计算机集成制造系统

CSTPCD北大核心
影响因子:1.092
ISSN:1006-5911
年,卷(期):2024.30(8)
  • 1