太赫兹科学与电子信息学报2024,Vol.22Issue(9) :952-958.DOI:10.11805/TKYDA2024034

基于Q?learning的变电站无线传感器网络路由算法

Q-learning based routing algorithm for substation wireless sensor networks

赵锴 沙杰 丛尤嘉
太赫兹科学与电子信息学报2024,Vol.22Issue(9) :952-958.DOI:10.11805/TKYDA2024034

基于Q?learning的变电站无线传感器网络路由算法

Q-learning based routing algorithm for substation wireless sensor networks

赵锴 1沙杰 1丛尤嘉1
扫码查看

作者信息

  • 1. 国网上海市电力公司 嘉定供电公司,上海 201800
  • 折叠

摘要

电力系统中的无线传感器网络(WSN)可以对工作中设备的状态和环境数据进行实时感知采集,是一种推动智能电网发展的重要技术.针对变电站场景中WSN的网络存活时间、传输时延、传输丢包率上的特殊要求,提出了一种基于强化学习的WSN路由方案.将数据包在WSN的发送过程抽象为一个马尔科夫决策过程(MDP),根据优化目标合理设置奖励,并给出了基于Q-learning的最优路由求解方法.仿真结果与数值分析表明,所提方案在网络存活时间、传输时延、丢包率等方面的性能均优于基准方案.

Abstract

Wireless Sensor Network(WSN)in the power system can sense and collect the status of the working equipment and environmental data in real time,which is an important technology to promote the development of smart grid.Aiming at the special requirements of network survival time,transmission delay,and transmission packet loss rate of WSN in substation scenarios,a WSN routing scheme based on reinforcement learning is proposed.The sending process of packets in WSN is abstracted as a Markov Decision Process(MDP),the rewards are reasonably set according to the optimization objective,and the optimal routing solution method based on Q-learning is given.Simulation results and numerical analysis show that the proposed scheme outperforms the benchmark scheme in terms of network survival time,transmission delay,and packet loss rate.

关键词

变电站无线传感网/路由策略/马尔科夫决策过程/Q-learning算法/网络性能优化

Key words

Wireless Sensor Networks for substations/routing policy/Markov Decision Process(MDP)/Q-learning/network performance optimization

引用本文复制引用

基金项目

国网上海市电力公司科技项目(B30931230003)

出版年

2024
太赫兹科学与电子信息学报
中国工程物理研究院电子工程研究所

太赫兹科学与电子信息学报

CSTPCD
影响因子:0.407
ISSN:2095-4980
参考文献量4
段落导航相关论文