吉林电力2024,Vol.52Issue(1) :53-56.

基于数据不均衡的CART决策树用电异常检测方法

CART Decision Tree Method Based on Unbalanced Data for Abnormal Electricity Consumption Detection

鞠默欣 周雨馨 唐伟宁 于欢 宋昊燃 倪鹏翔 戚意彬 谢蓓欣
吉林电力2024,Vol.52Issue(1) :53-56.

基于数据不均衡的CART决策树用电异常检测方法

CART Decision Tree Method Based on Unbalanced Data for Abnormal Electricity Consumption Detection

鞠默欣 1周雨馨 1唐伟宁 1于欢 2宋昊燃 1倪鹏翔 1戚意彬 1谢蓓欣1
扫码查看

作者信息

  • 1. 国网吉林省电力有限公司营销服务中心,长春 130062
  • 2. 国网长春供电公司,长春 130021
  • 折叠

摘要

针对现有的用户用电行为数据量庞大,且正负样本不均衡从而导致用电异常检测准确率低的问题,通过合成少数类过采样技术算法增加样本数据,构建正负样本相对均衡的数据集.然后,通过分类回归树决策树模型对重平衡后的样本数据集进行特征提取,从而获得分类结果并对其进行可视化,通过五折交叉验证对该算法进行仿真实现,分析模型的可靠性及稳定性.结果表明,提出的方法提高了用户用电行为异常检测准确率,有效消除了数据不均衡对异常检测准确率的影响.

Abstract

In response to the problem of a large amount of existing user electricity consumption behavior data and imbalanced positive and negative samples,resulting in low accuracy in electricity anomaly detection,this article uses the synthetic minority oversampling technique algorithm to add negative sample data and construct a dataset with relatively balanced positive and negative samples.Then,the classification and regression tree decision tree model is used to extract features from the rebalanced sample dataset,obtaining classification results and visualizing them.The algorithm is simulated and implemented through five fold cross validation to analyze the reliability and stability of the model.The experiment shows that the method proposed in this article effectively achieves abnormal detection of user electricity consumption behavior and improves recognition accuracy.

关键词

数据不均衡/决策树/用电异常检测

Key words

data imbalance/decision tree/abnormal electricity consumption detection

引用本文复制引用

出版年

2024
吉林电力
吉林省电机工程学会,吉林省电力有限公司电力科学研究院

吉林电力

影响因子:0.338
ISSN:1009-5306
参考文献量11
段落导航相关论文