首页|二代基因组测序鉴别狮头鹅拷贝数变异及其与体重体尺关联

二代基因组测序鉴别狮头鹅拷贝数变异及其与体重体尺关联

扫码查看
[背景]许多研究报道拷贝数变异(copy number variation,CNV)是一种长度在 50 bp至 5 Mb之间的缺失或插入,可以影响基因的表达,从而影响动物的生长发育特征,与畜禽重要经济性状有紧密的关联,是一种重要分子遗传标记之一。狮头鹅是世界体型最大鹅种之一,原产地为广东饶平,为广东卤鹅的原材料。但是,至今还没有关于狮头鹅CNV与体重体尺的全基因组关联研究报道。[目的]通过二代基因组测序数据鉴别狮头鹅的 CNV 和拷贝数变异区域(copy number variation region,CNVR)在基因组上分布情况,通过 CNV 与体重体尺性状的关联分析,挖掘显著影响体重体尺的 CNV 及候选基因,为狮头鹅后续的分子育种研究提供参考。[方法]试验共收集了来自汕头市白沙禽畜原种研究所的 111 只狮头鹅,其中公鹅 20 只,母鹅 91 只。所有鹅均采用统一标准饲养管理。对 111 只鹅进行体重体尺测定,体尺性状包括体斜长、胸深、胸宽等 9 个指标。本试验对 111 只鹅进行体重体尺测定和二代基因组测序(5×)。测序数据利用SOAPnuke进行质控,软件Speedseq中的 BWA模块进行序列比对,采用Speedseq中的LUMPY和CNVnator模块检测结构变异(structure variation,SV),从SV中筛选CNV。本试验用软件SVtools对CNV进行基因分型,然后采用单标记混合模型开展分型CNV与体重体尺的关联分析。采用染色体显著性水平(即 0。05/染色体CNV数目)作为定义与性状显著关联CNV的阈值,对显著CNV位点及上下游 50 kb进行基因注释,找到影响狮头鹅体重体尺关联的候选基因。用R包CNVrd2 对物理距离小于 1 Mb的染色体水平显著CNV和染色体水平显著SNP做连锁不平衡(linkage disequilibrium,LD)分析。[结果]对于 111 只狮头鹅,共检测出 99 158 个CNV,其中缺失型 94 560 个,重复型 4 598 个,CNV平均长度 11 858 bp,大部分(74。06%)CNV长度位于 50-1 000 bp区间。CNVR共 5 225 个,包括缺失型 5 029 个,重复型110 个和混合型 86 个,CNVR平均长度为 7 136 bp,大部分(81。03%)CNVR长度位于 50-1 000 bp区间。功能注释发现 46。92%CNVR位于基因间区域,10。30%位于基因上游,9。35%位于基因下游。准确进行基因分型的 CNV有 6 217 个,通过 10 个体重体尺性状与这些CNV关联分析,共检测 55 个染色体显著性水平的CNV位点,注释到 45 个候选基因。在45 个候选基因中,发现SETD2、UBR7、G2E3等 10 个基因同时影响两个及两个以上性状。染色体水平显著CNV独立于染色体水平显著SNP影响体重体尺性状(r2<0。02)。[结论]通过二代基因组测序首次报道狮头鹅基因组CNV和CNVR分布及CNV和体重体尺关联的情况。本试验共发现影响体重体尺的 45 个候选基因,其中 11 个已被报道与畜禽生长信号通路有关,分别是SETD2、UBR7、ASB1和HDAC4参与肌肉的增殖、分化和代谢;G2E3、P3C2B、NOVA1和PDE1B参与脂肪生成和肥胖;ILKAP与调节生长因子有关;KIF1B参与骨代谢;ZFP37参与糖原代谢。这些为后续狮头鹅生长性能的分子遗传机制解析和分子标记挖掘奠定基础。
Identification of Copy Number Variation and Its Association with Body Weight and Size of Lion-Head Geese by Next-Generation Sequencing
[Background]Many previous studies have reported that copy number variation(CNV)is a kind of deletion or duplication with the length of 50 bp-5 Mb,which can affect the expression of genes.It is closely associated with economically important traits of livestock,which is one kind of promising molecular markers.Lion-head goose is one of the largest goose species in the world.It is originated in Raoping,Guangdong Province and is the raw material for Guangdong marinated geese.So far,there has no genome-wide association study on investigating the relationship between CNV and body weight and size in lion-head geese.[Objective]This study identified the CNV and CNV region(CNVR)of lion-head geese by using the second-generation genome sequencing data,and then detected CNV and candidate genes significantly affecting body weight and size through the association between them,which could provide the valuable reference information for molecular breeding of lion-head geese.[Method]A total of 111 lion-head geese were collected from Baisha Poultry and Livestock Origin Research Institute in Shantou,including 20 males and 91females.All geese were raised and managed under the uniform standards.The body weight and size traits of 111 geese were measured,and the body size traits included body oblique length,chest depth,chest width and so on.The next-generation genome sequencing data(5×)was generated using blood samples for these geese.SOAPnuke was used for the quality control of sequencing data.The BWA module of Speedseq was used for alignment,and the LUMPY and CNVnator modules of Speedseq were used to detect structural variations(SVs).CNV were selected from SV.The software SVtools was used to genotype CNV,and the association analysis between CNV and body weight and size traits was performed by using the single maker mixed model.CNV significantly associated with traits was screened through the chromosome significance level(0.05/number of CNV on the chromosome),and then annotated the significant CNV including their upstream and downstream 50 kb to identify candidate genes for the body weight and size of lion-head geese.The R package CNVrd2 was used to analyze the linkage disequilibrium(LD)of chromosome-significant CNV and chromosome-significant SNP with physical distance less than 1 Mb.[Result]For 111 lion-head geese,this study detected 99158 CNV including 94 560 deletions and 4 598 duplications.The average length of CNV was 11 858 bp,and most(74.06% )of them were located in the range of 50 bp-1 Kb.A total of 5 225 CNVR were detected,which contained 5 029 loss types,110 gain types,and 86 mixed types.The average length of CNVR was 7 136 bp,and the lengths of most(81.03% )of the CNVRs were 50 bp-1 Kb.Functional annotation showed that 46.92% of CNVR were located in the inter gene region,10.30% were located the upstream,and 9.35% were located the downstream.There were 6 217 CNV accurately genotyped for association analysis.By the association analysis of body weight and size traits and CNV,a total of 55 CNV exceeded the significance level of chromosomes,and then annotated 45 candidate genes based on these 55 CNV.Among these 45 candidate genes,it was found that 10 genes,such as SETD2,UBR7 and G2E3,simultaneously influenced two or more traits.Chromosome-significant CNV affected body weight and size traits independently of chromosome-significant SNP(r2<0.02).[Conclusion]This study for the first time reported the distribution of CNV and CNVR in the genome of lion-head geese as well as the association between CNV and body weight and size by using the next-generation genome sequencing data.It was found that a total of 45 candidate genes influencing the body weight and size traits,in which 11 genes were reported to be related to signal pathways of animal growth,among these 11 genes,SETD2,UBR7,ASB1 and HDAC4 were involved in muscle proliferation,differentiation and metabolism,G2E3,P3C2B,NOVA1 and PDE1B were involved in adipogenesis and obesity,ILKAP was involved in regulating growth factors,KIF1B was involved in bone metabolism,and ZFP37 was involved in glycogen metabolism.These results laid a solid foundation for analyzing molecular genetic mechanism and detecting molecular marker for the growth performance of lion-head goose.

lion-head goosebody weight and size traitsCNVcandidate gene

张力允、黄智荣、杨柳、陈俊鹏、林祯平、黄红艳、伍仲平、张续勐、田允波、黄运茂、李秀金

展开 >

仲恺农业工程学院动物科技学院/广东省水禽健康养殖科技创新平台,广州 510225

中国农业科学院深圳农业基因组研究所,广东深圳 518120

广东省汕头市白沙禽畜原种研究所,广东汕头 515821

狮头鹅 体重体尺 拷贝数变异 候选基因

广东省重点领域研发计划广东省乡村振兴战略专项种业振兴行动项目

2020B0202220032023XDY00001

2024

中国农业科学
中国农业科学院

中国农业科学

CSTPCD北大核心
影响因子:1.899
ISSN:0578-1752
年,卷(期):2024.57(14)
  • 52