首页|全基因组甲基化测序比对软件在植物数据分析中的性能评估

全基因组甲基化测序比对软件在植物数据分析中的性能评估

扫码查看
DNA甲基化作为一种表观遗传修饰,在植物的生长、发育和逆境反应中发挥着至关重要的作用.全基因组重亚硫酸盐测序(whole genome bisulfite sequencing,WGBS)被认为是DNA甲基化研究的"金标准",并且广泛应用于动植物的功能基因组研究.虽然目前针对WGBS已经开发了数种分析工具,但还没有全面评估这些工具在植物基因组中的分析效率.本研究通过分析拟南芥(Arabidopsis thaliana)、水稻(Oryza sativa)和大豆(Glycine max)3种主要作物的DNA甲基化数据,对DNA甲基化的 4 个分析工具中常用的 6 种比对方法(BSMAP、Bismark-bwt2-e2e、Bismark-his2、Abismal、BSSeeker2-bwt2-e2e 和 BS-Seeker2-bwt2-local)从运行效率、内存资源利用、比对质量和甲基化位点识别等方面进行了全面的性能评估.结果显示,与另外5种甲基化比对方法相比,虽然BSMAP运存较大,但是在运行速度上表现出较高的效率,尤其在处理大规模基因组数据时具有明显的优势.此外,BSMAP在比对质量和甲基化位点识别方面也展现出较高水平,保证了结果的准确性和可靠性.值得注意的是,研究认为在选择比对工具时,还需要根据实际计算资源、研究需求以及对运行速度和内存消耗的权衡来做出合适的决策.这项研究为从事植物领域DNA甲基化研究的科研人员提供了有益的分析指导,有助于提高研究效率和结果的可信度,对深入分析植物生物学中的DNA甲基化具有重要的科学意义.
Performance Evaluation of Whole Genome Bisulfite Sequencing Alignment Soft-ware for Plant Data Analysis
DNA methylation,as an epigenetic modification,plays a crucial role in the growth,development,and stress responses of plants.Whole genome bisulfite sequencing(WGBS)is considered as the"gold standard"for DNA methylation studies and is widely applied in the functional genomics research of both animals and plants.Although several analysis tools have been developed for WGBS,there has yet to be a comprehensive assessment of their analysis efficiency in plant data.This study conducted a thorough performance evaluation of six commonly used alignment methods(BSMAP,Bismark-bwt2-e2e,Bismark-his2,Abismal,BSSeeker2-bwt2-e2e,and BSSeeker2-bwt2-local)within four DNA methylation analysis tools.The evaluation encompassed aspects such as runtime efficiency,memory resource utilization,alignment quality,and identification of methylation sites by analyzing DNA methylation data from three major crops:Arabidopsis thaliana,Oryza sativa and Glycine max.The results indicate that although BSMAP requires larger memory requirements,it exhibits higher efficiency in terms of running speed,particularly when dealing with large-scale genomic data.Fur-thermore,BSMAP demonstrates high levels of alignment quality and methylated site identification,ensuring the accuracy and relia-bility of the results.The study emphasizes the need for researchers to make informed decisions when selecting alignment tools based on practical computational resources,research requirements,and the trade-off between running speed and memory consumption.This research provides valuable analytical guidance for scientists engaged in DNA methylation studies in the field of plants,contribu-ting to improve research efficiency and result reliability.It holds significant scientific importance for a deeper analysis of DNA methy-lation in plant biology.

DNA methylationWhole genome bisulfite sequencingSequence alignmentPlant

徐子健、江祉依、王海峰、谢亮

展开 >

广西大学农学院,亚热带农业生物资源保护与利用国家重点实验室,广西甘蔗生物学重点实验室,南宁,530004

DNA甲基化 全基因组甲基化测序 序列比对 植物

国家自然科学基金广西自然科学基金重点项目

321601422022JJD130001

2024

基因组学与应用生物学
广西大学

基因组学与应用生物学

CSTPCD北大核心
影响因子:1.108
ISSN:1674-568X
年,卷(期):2024.43(6)