首页|RUDEUS, a machine learning classification system to study DNA-Binding proteins
RUDEUS, a machine learning classification system to study DNA-Binding proteins
扫码查看
点击上方二维码区域,可以放大扫码查看
原文链接
NETL
NSTL
By a News Reporter-Staff News Editor at Robotics & Machine Learning Daily News Daily News – According to news reporting based on a preprint abstract, our journalists obtained the following quote sourced from biorxiv.org: “DNA-binding proteins are essential in different biological processes, including DNA replication, tran- scription, packaging, and chromatin remodelling. Exploring their characteristics and functions has become relevant in diverse scientific domains. Computational biology and bioinformatics have assisted in studying DNA-binding proteins, complementing traditional molecular biology methods. “While recent advances in machine learning have enabled the integration of predictive systems with bioinformatic approaches, there still needs to be generalizable pipelines for identifying unknown proteins as DNA-binding and assessing the specific type of DNA strand they recognize. “In this work, we introduce RUDEUS, a Python library featuring hierarchical classification models de- signed to identify DNA-binding proteins and assess the specific interaction type, whether single-stranded or double-stranded. RUDEUS has a versatile pipeline capable of training predictive models, synergizing protein language models with supervised learning algorithms, and integrating Bayesian optimization strate- gies. The trained models have high performance, achieving a precision rate of 95% for DNA-binding identification and 89% for discerning between single-stranded and double-stranded interactions. RUDEUS includes an exploration tool for evaluating unknown protein sequences, annotating them as DNA-binding, and determining the type of DNA strand they recognize. Moreover, a structural bioinformatic pipeline has been integrated into RUDEUS for validating the identified DNA strand through DNA-protein molecular docking.
BioinformaticsBiotechnologyBiotechnology - Bioinformat- icsCyborgsDNA-Binding ProteinsEmerging TechnologiesInformation TechnologyMachine LearningPeptidesPeptides and ProteinsProteins