内蒙古大学学报(自然科学版)2024,Vol.55Issue(5) :502-508.DOI:10.13484/j.nmgdxxbzk.20240507

六个转录因子在人类全基因组中结合位点数的估计

Estimation of the Number of Binding Sites for Six Transcription Factors in the Human Genomes

李子涵 郭庭赫 柴露 高洁 张利绒
内蒙古大学学报(自然科学版)2024,Vol.55Issue(5) :502-508.DOI:10.13484/j.nmgdxxbzk.20240507

六个转录因子在人类全基因组中结合位点数的估计

Estimation of the Number of Binding Sites for Six Transcription Factors in the Human Genomes

李子涵 1郭庭赫 1柴露 1高洁 1张利绒1
扫码查看

作者信息

  • 1. 内蒙古大学物理科学与技术学院,呼和浩特 010021
  • 折叠

摘要

转录因子是一种与DNA上特定序列相结合,进而对基因的转录和表达进行调控的蛋白质.利用理论方法对细胞或组织特异的转录因子结合位点进行预测时,负集的大小和选择往往会影响预测模型性能的评估.通过估计转录因子在人类基因组中结合位点的数量,可以准确评估预测模型的性能.因此,本文利用不同细胞系中CTCF、POLR2A、EZH2、REST、MAX、RAD21六个转录因子的ChIP-Seq数据,对转录因子在人类基因组中的结合位点数进行拟合和估计,为构建转录因子预测模型负集的选择提供参考.

Abstract

Transcription factors are proteins that bind to specific DNA sequences,thereby regu-lating the transcription and expression of genes.When using theoretical methods to predict cell line or tissue-specific transcription factor binding sites,the size and selection of the negative set often impact the performance evaluation of the prediction model.Therefore,an accurate estimation of the number of transcription factor binding sites in the human genome can help evaluate the performance of prediction models more accurately.In this study,utilizing the ChIP-Seq data of six transcription factors(CTCF,POLR2A,EZH2,REST,MAX,RAD21)in different cell lines,we performed poly-nomial fitting and estimation for the number of their binding sites in the human genomes.This results provide a reference for the selection of negative sets when constructing prediction models of transcription factor binding sites.

关键词

转录因子/结合位点数/拟合

Key words

transcription factor/number of binding site/fitting

引用本文复制引用

基金项目

国家自然科学基金项目(61962041)

出版年

2024
内蒙古大学学报(自然科学版)
内蒙古大学

内蒙古大学学报(自然科学版)

CSTPCD
影响因子:0.346
ISSN:1000-1638
段落导航相关论文