大数据2024,Vol.10Issue(5) :109-122.DOI:10.11959/j.issn.2096-0271.2024060

一种融合注意力机制的CNN-BiGRU磁盘故障预测方法研究

Research on a CNN-BiGRU disk fault prediction method integrating attention mechanism

王艳 刘亚东 皮婵娟 施君豪
大数据2024,Vol.10Issue(5) :109-122.DOI:10.11959/j.issn.2096-0271.2024060

一种融合注意力机制的CNN-BiGRU磁盘故障预测方法研究

Research on a CNN-BiGRU disk fault prediction method integrating attention mechanism

王艳 1刘亚东 1皮婵娟 1施君豪1
扫码查看

作者信息

  • 1. 华东交通大学软件学院,江西 南昌 330013
  • 折叠

摘要

磁盘作为重要的存储介质,一旦出现故障很可能会导致存储数据丢失,给个人及企业带来难以估量的损失.现有磁盘故障预测模型存在不能很好地平衡磁盘数据样本、未充分利用磁盘数据的时序特性等问题.以Backblaze云存储公司公布的真实磁盘数据为研究对象,提出了一种融合注意力机制的卷积神经网络(CNN)和双向门控循环单元(BiGRU)网络的磁盘故障预测模型.在数据预处理方面,采用负采样与焦点损失函数来平衡正负样本,利用CNN进行特征提取,并结合BiGRU网络来有效地处理时序数据.通过融合注意力机制,能够让模型快速地捕捉更多关键特征信息,将筛选出的特征与数据输入模型进行训练.通过对比其他故障预测模型,本文提出的模型在精确率等4个评价指标上均有1%~7%的性能提升,为提高磁盘存储的可靠性提供了有力的支撑.

Abstract

Disk,as a crucial storage medium,can result in significant data loss if it malfunctions,causing immeasurable losses for individuals and businesses.Existing models for predicting disk failures have problems such as imbalanced disk data samples and underutilization of the temporal characteristics of the data.In this study,we focused on real disk data provided by the Backblaze cloud storage company and proposed a disk failure prediction model that combines a convolutional neural network(CNN)with a bidirectional gated recurrent unit(BiGRU)network,incorporating an attention mechanism.In terms of data preprocessing,we employed negative sampling and a focal loss function to balance positive and negative samples.Subsequently,we utilized CNN for feature extraction and combined it with BiGRU to effectively handle temporal data.The integration of an attention mechanism enables the model to quickly capture more critical feature informations.The selected features were then trained with the input data into the model.Compared to other fault prediction models,the proposed model in this paper demonstrates a performance improvement of 1%to 7%on four evaluation indicators,such as precision.This provides a robust support for enhancing disk storage reliability.

关键词

注意力机制/磁盘故障预测/双向门控循环单元/卷积神经网络/焦点损失函数

Key words

attention mechanism/disk failure prediction/convolutional neural network/bidirectional gated recurrent unit/focal loss function

引用本文复制引用

基金项目

国家自然科学基金项目(61962020)

上海市智能信息处理重点实验室开放基金项目(IIPL201910)

江西省教育厅项目(GJJ2200640)

出版年

2024
大数据
人民邮电出版社

大数据

CSTPCD
ISSN:2096-0271
段落导航相关论文