查看更多>>摘要:This paper introduces an innovative approach to sleep stage classification, leveraging a multi-modal signal integration framework encompassing Electrooculography (EOG) and two-channel electroencephalography (EEG) data. We explore the utility of various feature extraction techniques, including Short-Time Fourier Transform (STFT), Wavelet Transform, and raw signal processing, alongside the utilization of neural networks as feature extractors. This unique combination allows us to harness the benefits of traditional feature extraction methods while capitalizing on the power of neural networks to enhance classification performance. Our comprehensive classifier evaluation encompasses a range of models, including Long Short-Term Memory (LSTM) networks and XGBoost. Remarkably, our results reveal exceptional performance with the XGBoost classifier, achieving an overall accuracy of 84.57 % and a macro-F1 score of 78.21 % on the Sleep-EDF expanded dataset, and an overall accuracy of 86.02 % and a macro-F1 score of 81.96 % on the ISRUC-Sleep dataset. Class-specific accuracies highlight its proficiency, particularly in detecting wake and N2 stages, solidifying its superiority among the classifiers tested. This amalgamation of feature sets, complemented by Principal Component Analysis (PCA) for dimensionality reduction, underscores its significance in yielding top-tier classification outcomes. The integration of traditional feature extraction methods with neural networks as feature extractors creates a robust and comprehensive system for sleep stage classification, offering the advantages of both approaches to enhance the accuracy and reliability of the results.
查看更多>>摘要:Diagnosis codes are standard code format of diseases or medical conditions. This study is aimed at assigning diagnosis codes to patients in large-scale biobanks, particularly addressing the issue of missing codes for some patients. This is crucial for downstream disease-related tasks. While recent methods primarily rely on structured biobank data for code assignment, they often overlook the valuable medical context provided by textual information in the biobanks and hierarchical structure of the disease coding system. To address this gap, we have developed CATI, a medical context-enhanced framework for diagnosis Code Assignment by integrating Textual details derived from key features and disease hIerarchy. The study is based on the UK Biobank data and considers Phecodes and ICD-10 codes as standard disease formats. We start by representing ten informative codified features using their formal names and then integrate them into CATI as text embeddings, achieved through prompt tuning on the pre-trained language model BioBERT. Recognizing the hierarchical structure of diagnosis codes, we have developed a novel convolution layer in our method that effectively propagates logits between adjacent diagnosis codes. Evaluation results demonstrate that CATI outperforms existing stateof-the-art methods in terms of both Phecodes and ICD-10 codes, boasting at least a 5.16% improvement in average AUROC for unseen disease codes and an 8.68% rise in average AUPRC for disease codes with training instances ranging in (1000,10000]. This framework contributes to the formation of well-defined cohorts for downstream studies and offers a unique perspective for addressing complex healthcare tasks by incorporating vital medical context.
Mouazen, BadrBendaouia, AhmedAbdelwahed, El HassanDe Marco, Giovanni...
1.1-1.15页
查看更多>>摘要:Multiple Sclerosis (MS) is a chronic neuroinflammatory disease of the Central Nervous System (CNS) in which the body's immune system attacks and destroys the myelin sheath that protects nerve fibers, leading to a wide range of debilitating symptoms and causing disruption of axonal signal transmission. Accurate prediction, diagnosis, monitoring and treatment (PDMT) of MS are essential to improve patient outcomes. Recent advances in neuroimaging technologies, particularly electroencephalography (EEG), combined with machine learning (ML) techniques - including Deep Learning (DL) models - offer promising avenues for enhancing MS management. This systematic review synthesizes existing research on the application of ML and DL models to EEG data for MS. It explores the methodologies used, with a focus on DL architectures such as Convolutional Neural Networks (CNNs) and hybrid models, and highlights recent advancements in ML techniques and EEG technologies that have significantly improved MS diagnosis and monitoring. The review addresses the challenges and potential biases in using ML-based EEG analysis for MS. Strategies to mitigate these challenges, including advanced preprocessing techniques, diverse training datasets, crossvalidation methods, and explainable Artificial Intelligence (AI), are discussed. Finally, the paper outlines potential future applications and trends in ML for MS management. This review underscores the transformative potential of ML-enhanced EEG analysis in improving MS management, providing insights into future research directions to overcome existing limitations and further improve clinical practice.
Chowdhury, Md Nakib HayatReaz, Mamun Bin IbneAli, Sawal Hamid MdCrespo, Maria Liz...
1.1-1.17页
查看更多>>摘要:Chronic kidney disease (CKD) poses a significant risk for diabetes patients, often leading to severe complications. Early and accurate CKD stage detection is crucial for timely intervention. However, it remains challenging due to its asymptomatic progression, the oversight of routine CKD tests during diabetes checkups, and limited access to nephrologists. This study aimed to address these challenges by developing a multiclass CKD stage prediction model for diabetes patients using longitudinal data from the Chronic Renal Insufficiency Cohort (CRIC) study. A novel iterative backward feature selection strategy was employed to determine key predictors of the CKD stage. TabNet, an attention-based deep learning architecture, was used to build classification models in complete and simplified categories. The complete model used 31 features, including complex kidney biomarkers, while the simplified model used 15 features readily available from routine checkups. The performance of TabNet was compared against traditional tree-based ensemble methods (XGBoost, random forest, AdaBoost) and a multilayer perceptron. Model-specific and model-agnostic explainable AI (XAI) techniques were applied to interpret model decisions, enhancing the transparency and clinical applicability of the proposed approach. The TabNet models demonstrated superior performance, achieving 94.06 % and 92.71 % accuracy in cross-validation for the complete and simplified models, respectively, and 91.00 % and 88.00 % accuracy on test sets. XAI analysis identified serum creatinine, cystatin C, sex, and age as the most influential factors in CKD stage classification. The proposed TabNet models offer a robust approach for early CKD severity detection in diabetes patients, potentially improving clinical decision-making and patient outcomes.
查看更多>>摘要:Large Language Models (LLMs) have shown remarkable potential in various fields. This study explores their application in solving multi-objective combinatorial optimization problems-surgery scheduling problem. Traditional multi-objective optimization algorithms, such as the Non-dominated Sorting Genetic Algorithm II (NSGA-II), often require domain expertise for designing precise operators. Here, we propose LLM-NSGA, where LLMs act as evolutionary optimizers, performing selection, crossover, and mutation operations. Results show that for 40 cases, LLMs can independently generate high-quality solutions from prompts. As problem size increases, LLM-NSGA outperformed traditional approaches like NSGA-II and MOEA/D, achieving average improvements of 5.39 %, 80 %, and 0.42 % in three objectives. While LLM-NSGA provided similar results to EoH, another LLMbased method, it outperformed EoH in overall resource allocation. Additionally, we applied LLMs for hyper-parameter optimization, comparing them with Bayesian Optimization and Ant Colony Optimization (ACO). LLMs reduced runtime by an average of 23.68 %, and their generated parameters, validated with NSGA-II, produced better surgery scheduling solutions. This demonstrates that LLMs can not only help traditional algorithms find better solutions but also optimize their parameters efficiently.
Iranzo-Sanchez, JorgeSantamaria-Jorda, JaumeMas-Molla, GerardDiaz-Munio, Goncal V. Garces...
1.1-1.18页
查看更多>>摘要:The application of large language models (LLMs) to speech translation (ST) or, in general, to machine translation (MT) has recently provided excellent results, superseding conventional encoder-decoder MT systems in the general domain. However, this is not clearly the case when LLMs as MT systems are translating medical-related materials. In this respect, the provision of multilingual training materials for oncology professionals is a goal of the EU project Interact-Europe in which this work was framed. To this end, cross-language technology adapted to the oncology domain was developed, evaluated and deployed for multilingual interspecialty medical education. More precisely, automatic speech recognition (ASR) and MT models were adapted to the oncology domain to translate English pre-recorded training videos, kindly provided by the European School of Oncology (ESO), into French, Spanish, German and Slovene. In this work, three categories of MT models adapted to the medical domain were assessed: bilingual encoder-decoder MT models trained from scratch, pre-trained large multilingual encoder-decoder MT models, and multilingual decoder-only LLMs. The experimental results underline the competitiveness in translation quality of LLMs compared to encoder-decoder MT models. Finally, the ESO speech dataset, comprising roughly 1000 videos and 745 h for the training and evaluation of ASR, MT and ST models, was publicly released for the scientific community.