首页|深度学习在基于序列的蛋白质互作预测中的应用进展

深度学习在基于序列的蛋白质互作预测中的应用进展

扫码查看
蛋白质-蛋白质相互作用在细胞信号转导、基因表达和代谢调控等生物过程中发挥重要作用,鉴定蛋白质间的相互作用对于理解复杂生物过程至关重要.预测蛋白质间的相互作用可以为药物发现、蛋白质功能研究和设计等领域提供帮助.近年来,随着人工智能技术的蓬勃发展,深度学习技术在预测蛋白质互作领域做出巨大贡献,其中基于序列的深度学习模型通过学习蛋白质序列信息的深层特征进行互作预测.本文综述了深度学习在基于序列的蛋白质互作预测中的应用,按照算法框架和时间线对该领域进展进行分类归纳,介绍了数据处理、序列编码方法、算法架构以及模型的评估指标等内容,并分析了当前面临的问题以及未来的发展方向.随着深度学习技术的发展,预测蛋白质互作的效率大幅提高,未来需要发展泛化能力更强的预测模型,助力蛋白质互作的预测.
Advances in applications of deep learning for predicting sequence-based protein interactions
Protein-protein interactions play a crucial role in biological processes such as cell signal transduction,gene expression and metabolic regulation,and thus their identification is essential for understanding these complex biological processes.Predicting protein-protein interactions is a hot topic of great significance,which can provide assistances in areas such as drug discovery and protein function research and design as well.In recent years,with the development of artificial intelligence,machine learning technologies have been applied gradually to the prediction of protein-protein interactions,which has shown good potentials.However,when processing a large amount of protein information,traditional machine learning methods are difficult to mine the intrinsic patterns and potential features,and deep learning techniques are needed.Compared with the three-dimensional structure of proteins,sequence information is easier to obtain,and the development of high-throughput sequencing technology provides abundant protein sequence information,which greatly facilitates the development of sequence-based deep learning technologies.Sequence-based deep learning models predict protein-protein interactions by learning intrinsic patterns and features from protein sequence information,which greatly improves prediction efficiency and accuracy.In this review,we focus on progress of deep learning in predicting sequence-based protein interactions,categorize,which is summarized according to the algorithmic framework and timeline,briefly describing the construction methods of datasets and the evaluation metrics of the models,discussing in detail the sequence encoding methods and common algorithmic architectures,and demonstrating the computational models based on various types of algorithms and their features and advantages.Finally,we analyze current challenges in predicting protein-protein interactions using deep learning methods,and discuss possible solutions.With the development of deep learning technology,the efficiency of predicting protein-protein interactions has increased dramatically.As a result,there is a need to develop models with stronger generalization and more robust prediction capabilities to aid the prediction of protein-protein interactions in the future.

protein interactionsdeep learningartificial intelligencesequence encodingneural network

朱景勇、李钧翔、李旭辉、张瑾、毋文静

展开 >

浙江理工大学生命科学与医药学院,浙江 杭州 310018

嘉兴学院生物与化学工程学院,浙江 嘉兴 314000

浙江清华长三角研究院,衰老科学创新研发中心,浙江 嘉兴 341001

禾美生物科技(浙江)有限公司,浙江嘉兴 341001

浙江清华长三角研究院,浙江省应用酶学重点实验室,浙江 嘉兴 314006

展开 >

蛋白质互作 深度学习 人工智能 序列编码 神经网络

国家自然科学基金国家自然科学基金浙江省自然科学基金重点项目

3217270832102506LZ23C170002

2024

合成生物学

合成生物学

CSTPCD北大核心
ISSN:
年,卷(期):2024.5(1)
  • 127