针对变压器故障识别方法在处理不均衡故障数据时存在较大偏差的问题,构建了一种基于改进轻量级梯度提升机的混合集成模型,用以变压器故障识别.首先,提出一种结合梯度调和损失函数和交叉熵损失函数的改进轻量级梯度提升机(gradient harmonizing mechanism loss and cross entropy loss improved light gradient boosting machine,GCLightGBM),提升模型对数据集中少数样本的关注度.然后,针对GCLightGBM中参数特异性取值影响模型识别能力的问题,提出一种基于GCLightGBM的混合集成模型,进一步提高其准确率的同时,确保模型对现实多变不均衡数据集依然保持良好的准确率.实验结果表明,GCLightGBM可有效解决少数类样本准确率低的问题,整体准确率高达0.911.且针对其他多变不均衡数据集,基于GCLightGBM混合集成模型故障识别方法平均准确率高达0.988.
Transformer Fault Identification Method Based on Improved LightGBM Hybrid Integration Model
There is a significant bias when imbalanced fault data are dealt with by transformer fault identification methods.To solve this problem,we have built a hybrid ensemble model based on an improved lightweight gradient boosting ma-chine.First,we propose a gradient harmonizing mechanism loss and cross entropy loss improved light gradient boosting machine(GCLightGBM).This approach enhances the model's attention to minority samples in the dataset.Then,to ad-dress the issue of parameter specificity affecting the model's recognition capability in GCLightGBM,we propose a hybrid ensemble model based on GCLightGBM.This model further improves accuracy while ensuring good performance on re-al-world varied and imbalanced datasets.The experimental results show that GCLightGBM can effectively solve the problem of low accuracy of minority samples,and the overall accuracy is as high as 0.911.In addition,for other variable and unbalanced datasets,the fault identification method based on GCLightGBM hybrid ensemble model has an average accuracy of 0.988.
transformerunbalanced samplesfault identificationgradient harmonic loss functionStacking integration framework