首页|基于领域分析的结构线性静力软件串并行一致化方法

基于领域分析的结构线性静力软件串并行一致化方法

扫码查看
并行CAE软件的计算结果串并行一致性是其计算结果可信的必要条件.然而,软件研发时常引入串并行不一致缺陷,其形式众多,现象相互耦合,散布于海量代码中,成为实现CAE软件串并行一致性的挑战.文中以结构线性静力软件的串并行一致性需求为切入点,针对现有的"专家知识法"与"缺陷定位法"应用于CAE软件串并行一致化时存在的粒度粗、准度差、成本高和缺乏系统性问题,引入领域分析方法,并与专家知识和数据流状态比对结合,提出了一种适用于结构线性静力的串并行一致化方法,实现了结构线性静力软件串并行不一致缺陷的细粒度、高准度与低成本系统性识别与修复.基于前述方法形成相关工具,并将方法与工具应用于SSTA的串并行一致化,识别并修复其中8处串并行不一致缺陷,使其通过90余真实模型的串并行一致考核,并实现串并行结果严格一致;同时,该方法与工具还将串并行不一致缺陷定位耗时由平均大于两人天降低至数人时.
Domain Analysis Based Approach to Obtain Identical Results on Varying Number of Processors for Structural Linear Static Software
Obtain identical results on varying number of processors is a prerequisite for the reliability of parallel CAE software.However,during the development of parallel CAE software,various types of faults that can cause non-identical results are often introduced.These faults couple with each other to produce the final non-identical results,and are concealed within various levels of the CAE software that incorporates numerous lines of code.This poses the challenge to obtain identical simulation results on varying number of processors for parallel CAE software.When applied to parallel CAE software,traditional approaches such as expert knowledge and fault-location are often characterized by coarse granularity,poor accuracy,high cost or lack of systematism.To address this issue,we propose an approach that combines domain analysis with expert knowledge and dataflow state compari-son to obtain identical results on varying number of processors for structural linear static software.This approach can be used to identify and fix faults that cause non-identical results in structural linear static software with high accuracy and low cost.Based on the above approach,we develop a corresponding tool and apply it in conjunction with the approach to identify and fix eight faults in SSTA,a structural linear static software.This endeavor helps SSTA to obtain strictly identical results on varying number of processors in more than ninety real simulation models,and significantly reduces the time required to identify a fault from more than two days to several hours.

Obtain identical results on varying number of processorsStructural linear static softwareParallelization of serial codesDomain analysisFault-location

唐德泓、杨浩、文龙飞、徐正秋

展开 >

中物院高性能数值模拟软件中心 北京 100088

北京应用物理与计算数学研究所 北京 100088

四川中锐信息技术有限公司 成都 610041

串并行一致化 结构线性静力软件 串行代码并行化 领域分析 缺陷定位

2024

计算机科学
重庆西南信息有限公司(原科技部西南信息中心)

计算机科学

CSTPCD北大核心
影响因子:0.944
ISSN:1002-137X
年,卷(期):2024.51(9)