南方农业学报2023,Vol.54Issue(9) :2544-2551.DOI:10.3969/j.issn.2095-1191.2023.09.005

大刺鳅基因组中微卫星分布特征分析

Distribution characteristics of microsatellite in genome of Mastacembelus armatus

谢佳燕 唐晟 孙赫英
南方农业学报2023,Vol.54Issue(9) :2544-2551.DOI:10.3969/j.issn.2095-1191.2023.09.005

大刺鳅基因组中微卫星分布特征分析

Distribution characteristics of microsatellite in genome of Mastacembelus armatus

谢佳燕 1唐晟 1孙赫英2
扫码查看

作者信息

  • 1. 武汉轻工大学生命科学与技术学院,湖北武汉 430023
  • 2. 生态环境部长江流域生态环境监督管理局生态环境监测与科学研究中心,湖北武汉 430019
  • 折叠

摘要

[目的]通过查找和分析大刺鳅基因组中微卫星序列的分布规律及特征,为开展大刺鳅种质资源评估和分子标记辅助育种提供理论基础.[方法]基于大刺鳅基因组序列数据,采用MISA挖掘微卫星信息,利用生物信息学方法分析大刺鳅基因组中微卫星序列的分布规律及组成特征.[结果]从大刺鳅基因组中共筛选获得19020个微卫星,其中16025个为完全型微卫星;分布于2号染色体上的微卫星数目最多,24号染色体上的微卫星数目最少,分别占微卫星总数的10.26%和1.94%;单核苷酸为数目最多的微卫星,六核苷酸的最少.在完全型微卫星中,数目最多的前4种基元类型依次是A/T、AC/GT、G/C和AG/CT,分别占微卫星总数的37.82%、29.57%、6.88%和6.77%.微卫星基元重复次数的分布范围在5~104次,主要发生在重复5~6次和10~11次.微卫星长度区间为10~192 bp,主要分布在10~36 bp,占其总数的92.46%;微卫星数目最多的长度为12 bp,占其总数的15.22%.Pearson相关分析结果表明,大刺鳅微卫星长度与其数目呈极显著负相关(P<0.01).GO功能注释分析结果发现,微卫星的基因主要富集到生物学过程、分子功能和细胞组分3个类群,富集最显著的条目为细胞核.[结论]利用基因组数据筛选大刺鳅基因组中微卫星可用性高且具有较高的多态性潜能,可为高效开发微卫星分子标记提供遗传学数据.

Abstract

[Objective]The distribution and characteristics of microsatellite sequences were searched and analyzed in the genome of Mastacembelus armatus,which could provide a theoretical basis for the evaluation of M.armatus germ-plasm resources and marker-assisted breeding.[Method]Based on genomic sequence data of M.armatus,microsatellite loci were searched by MISA software and their distribution and composition characteristics in the genome of M.armatus were evaluated by bioinformatics analysis.[Result]19020 microsatellites were obtained from genome of M.armatus in-cluding 16025 perfect microsatellites.The number of microsatellites distributed on chromosome 2 was the most,and the least was on chromosome 24,which accounting for 10.26%and 1.94%of the total number of microsatellites respectively.Mononucletide was the most abundant microsatellite while hexanucleotide was the lowest.The first four types of perfect microsatellites in the genome of M.armatus were A/T,AC/GT,G/C and AG/CT,which accounting for 37.82%,29.57%,6.88%and 6.77%of the total number respectively.The range of the repeat times of motifs was 5-104 with the main repeat times including 5-6 and 10-11.The length interval of microsatellites was 10-192 bp,and the main length in-terval was 10-36 bp,which taking up 92.46%of the total number.The maximum number of microsatellites was 12 bp in length,which accounting for 15.22%of the total number of microsatellites.Pearson correlation analysis showed that there was extremely significantly negative correlation between the length and number of microsatellites for M.armatus(P<0.01).Functional annotation and analysis of GO showed that microsatellites genes were mainly enriched in three groups including biological process,molecular function and cellular component,and nucleus was the most significantly enriched term.[Conclusion]The microsatellites obtained based on the genome of M.armatus have high availability and polymor-phism potential,which can provide genetic data for the efficient development of microsatellite molecular markers.

关键词

大刺鳅/基因组/微卫星/染色体/功能注释

Key words

Mastacembelus armatus/genome/microsatellite/chromosome/functional annotation

引用本文复制引用

基金项目

国家自然科学基金(32101410)

教育部产学合作协同育人项目(202102102111)

湖北省教育厅科学研究计划项目(B2017076)

出版年

2023
南方农业学报
广西壮族自治区农业科学院

南方农业学报

CSTPCDCSCD北大核心
影响因子:0.83
ISSN:2095-1191
参考文献量19
段落导航相关论文