首页|一种基于Q-learning强化学习的导向性处理器安全性模糊测试方案

一种基于Q-learning强化学习的导向性处理器安全性模糊测试方案

扫码查看
针对处理器安全性模糊测试在进行细粒度变异时遗传算法存在一定的盲目性,易使生成的测试用例触发相同类型漏洞的问题,提出了一种基于Q-learning强化学习的导向性处理器安全性模糊测试方案.通过测试用例的状态值和所触发的漏洞类型对应的权值构造奖励函数,使用强化学习指导生成具有针对性和导向性的测试用例,快速地触发不同类型的漏洞.在Hikey970平台上的实验验证了基于ARMv8的测试用例生成框架的有效性,并且相较于传统使用遗传算法作为反馈的策略,本文方案在相同时间内生成有效测试用例的的数量多19.15%,发现漏洞类型的数量多80.00%.
A guided processor security fuzz testing scheme based on Q-learning reinforcement learning
A guided processor security fuzz testing scheme based on Q-learning reinforcement learning was proposed to address the issue of blindness in genetic algorithms during fine-grained mutations for processor security fuzz testing,which often leads to test cases triggering the same type of vulnerability.By constructing a reward function using the state values of test cases and the weights corresponding to the types of triggered vulnerabilities,reinforcement learning was adopted to guide the generation of targeted and directional test cases,quickly triggering a variety of vulnerabilities.Experiments on the Hikey970 platform verified the effectiveness of the ARM v8-based test case generation framework.Compared with the traditional strategy using genetic algorithms as feedback,this scheme generates 19.15%more effective test cases and identifies 80.00%more types of vulnerabilities within the same time frame.

processor vulnerability detectionfuzz testingQ-learning reinforcement learningARMv8branch prediction vulnerability

崔云凯、张伟

展开 >

北京信息科技大学计算机学院,北京 102206

北京市未来区块链与隐私计算高精尖创新中心,北京 102206

国家经济安全预警工程北京实验室,北京 102206

处理器漏洞检测 模糊测试 Q-learning强化学习 ARMv8 分支预测类漏洞

2024

北京信息科技大学学报(自然科学版)
北京信息科技大学

北京信息科技大学学报(自然科学版)

影响因子:0.363
ISSN:1674-6864
年,卷(期):2024.39(4)