首页|Improving the performance of software fault localization with effective coverage data reduction techniques

Improving the performance of software fault localization with effective coverage data reduction techniques

扫码查看
Fault localization (FL) techniques are widely used to identify the exact location of faulty statement in programs. Three common FL families are SBFL, MBFL, and deep learning-based FL, respectively. Before running any FL methods, coverage data is usually considered as input of FL stage. Therefore, coverage data plays an important role in FL field. On the other hand, if coverage data can be reduced effectively, the performance of FL will be greatly improved. In past studies, filtering out fault-irrelevant statements based on solely failed test cases, the traditional principal component analysis (PCA), and revised PCA techniques were applied to minimize coverage data. However, these approaches have a great opportunity to remove the actual faulty statement, especially in multiple fault localization (MFL). Tracing their root causes does not reflect the actual status of each statement. In this paper, we propose two approaches to improve the situations of deleted faulty statements. For the first approach, called Revised PCA with Ensemble Weight Integration (RPCA-EWI), it updates the contribution value of each statement based on revised PCA and incorporate the results of different combinations of failed and passed test cases. For the second approach, called Revised PCA with Important List Checking (RPCA-ILC), we establish a list of the top N% important statements by using the results of different test case combinations. If the deleted statement appears within this list, preserve it in reduced coverage data. Otherwise, it discards directly. We selected three Linux open-source codes (Gzip, Grep, and Sed) with 4 fault injections to validate the correctness. From the analysis of various perspectives, experimental results show that there is a significant improvement in shortening execution time of the FL process, and also can alleviate the situations for removed faulty statements compared to PCA and the revised PCA methods.

Revised PCAMultiple fault localizationEnsemble weightSoftware debugging

Chih-Chiang Fang、Chin-Yu Huang、Shou-Yu Lee、Yao-Hsien Tseng、C.W. Chu

展开 >

Department of Computer Science, National Tsing Hua University, Hsinchu, Taiwan

Department of Computer Science, Tunghai University, Taichung, Taiwan

Department of Post-Baccalaureate Medicine, National Chung Hsing University, Taichung, Taiwan||Division of Endocrinology and Metabolism, Department of Internal Medicine, Tungs' Taichung MetroHarbor Hospital Taichung, Taiwan

Fo Guang University, Jiaosi, Yilan County, Taiwan

展开 >

2025

The Journal of systems and software

The Journal of systems and software

ISSN:0164-1212
年,卷(期):2025.226(Aug.)
  • 44