融合遗传算法和图神经网络的抑郁症智能诊断

扫码查看

原文链接

NETL
NSTL
万方数据

中文摘要：目的构建基于脑网络的抑郁症智能诊断模型是一项具有挑战性的任务.近年来,图神经网络(graph neu-ral network,GNN)越来越多地应用于图的分类任务中,大部分GNN研究都只是对单一空间(样本空间或者特征空间)进行建模,导致模型分类性能不够好,本文提出一种基于遗传算法(genetic algorithm,GA)和GNN的多空间融合算法来对抑郁症患者进行智能诊断.方法模型采用留一站点交叉验证来确保模型的泛化性.脑网络的构建是基于Pearson相关的功能连接方法.整个算法以遗传算法作为主要框架,其中适应度函数是基于图卷积网络(graph convolutional network,GCN)分类算法,通过搜索个体间相似性阈值来找到具有最高分辨力的GCN.GCN由两个网络串联组成,一个网络获取受试者特征空间信息,另一个提取受试者之间样本空间的信息,最后通过两级GCN的联合学习实现分类.结果所有数据来源于The REST-meta-MDD项目,一共有来自10个站点1 160个受试者功能磁共振数据纳入本实验(男434、女726).实验结果显示,本文提出的分类器准确性、精度和受试者特征曲线(receiver operating characteristic,ROC)下面积分别为64.72％、69.69％和64.58％,优于其他主流算法.结论与其他算法相比,本文提出的算法融合了传统模型和深度学习模型的优点,获得了最佳的分类性能,未来很有可能为临床抑郁症诊断提供重要依据.

外文标题：Intelligent diagnosis of depression by integrating genetic algorithm and graph neural network

外文摘要：Objective Depression is currently one of the most common neuropsychiatric disorders in the world.However,its pathophysiological mechanisms are still unclear.The diagnosis of depression in clinical practice typically depends on neuropsychological scores and treatment responses,lacking objective evaluation tools,resulting in low consistency in diag-nosis.In recent years,an increasing number of people have begun to use machine learning technology to extract imaging biomarkers for the intelligent diagnosis of depression due to the capability of functional magnetic resonance imaging to pro-vide in vivo brain function and structural information.The brain network-based model has remarkable potential as an imag-ing marker for effectively distinguishing depression from normal controls.Graph neural networks(GNNs)are highly suit-able for graph classification tasks because they directly acquire graph structure information and maintain the topological characteristics of the graph during task execution.However,most GNN studies only model a single space(sample or fea-ture space),and the aggregation of GNN information can lead to over-smooth effects,resulting in poor model classification performance.This study aims to integrate multiple feature space information and propose a multispace fusion algorithm for the intelligent diagnosis of depression patients.Method Leave-one-site cross-validation(LOSCV)is used to ensure the gen-eralization of the model.The data are first preprocessed,and then a brain network is constructed using Pearson-related functional connectivity methods.The entire algorithm is mainly based on a genetic algorithm(GA),where the fitness func-tion is a classification algorithm based on a graph convolutional network(GCN).The solution space searched by GA is the similarity between the subject networks.The main steps of GA are as follows:1)Set the search range of the solution space[0.05,0.7];2)Generate an initial population;3)Based on LOSCV,GCN is used to classify the data,with the F1 value as the target value of the fitness function,and the threshold with the best fitness is finally retained(representing A*);4)Generate new populations through selection,crossover,and mutation operations(representing A);5)Compare A with A*.If the fitness value of A is better than A*,then A replaces A*;6)Determine whether the number of iterations for updating the population has reached the preset value.If not,then proceed to step 3 and continue executing the algorithm;if the threshold is reached,then the algorithm ends.The GA has a chromosome length of 8 bits and a threshold of 20 iterations.This paper aims to determine the similarity threshold between individuals with the highest classification capability in the population network.The GCN module comprises two networks connected in series:one mainly obtains information regard-ing the feature space of the brain network of a single subject,while the other network takes the subject as a node in the net-work.All subjects form a network to extract information from the sample space.The classification of a certain subject can be achieved through the joint learning of two levels of GCN.The two-level GCN architecture mainly includes f-GCN and p-GCN,and the basic ideas for constructing each architecture are as follows:f-GCN is a potential information representa-tion for learning the connectivity relationships of each brain region and transforming it into a highly efficient information rep-resentation for each brain network.F-GCN uses GCN to learn the embedding representation of a single brain region and then uses Eigenpooling to embed all brain region nodes into a single supernode to represent the information representation of the entire brain network.Eigenpooling is a pooling method in graph convolution neural network(GCN),which uses the eigenvectors of the Laplacian matrix to represent the information of nodes,transforms the original graph nodes into coordi-nates in the feature space,and associates each node with a specific number of high-energy eigenvectors,which are deter-mined by the eigenvalues of the Laplacian matrix.The feature vector represents the position of a node in the feature space,and its corresponding feature values indicate node importance.P-GCN constructs a topological structure based on the rela-tionship between subject brain networks and the representation of graph information acquired by f-GCN.The graph convolu-tional kernel aggregates the information representations of adjacent node entities of the subject and further reduces the dimensionality of the node information representation through graph pooling to generate the current supernode information representation.In this case,the hypernode represents the information of the entity as a whole.The graph information of the entire subject can be accurately represented through this super node,and the parameters of the f-GCN and p-GCN can be jointly updated through backpropagation to improve recognition accuracy.A scaled exponential similarity kernel is used for p-GCN to determine the similarity between samples.Result All data came from the REST-meta-MDD project,and a total of 1160 functional magnetic resonance imaging data from 10 sites(male 434,female 726)were included in this experiment.The experiment is a comparison of four representative algorithms of different types.The algorithm achieved the highest accuracy of 64.27％,which is 4.47％higher than the second-place support vector machine(SVM).Based on the Brain-NetCNN method,the accuracy is only 56.69％,demonstrating the worst classification performance.The accuracy of the Graphormer is 57.43％,and the hierarchical GCN also adopts the fusion of two networks,resulting in a classification accu-racy of 58.28％.The sample similarity threshold also impacts the final result,with an interval of 0.4-0.5 during identifi-cation of the optimal solution.Conclusion The intelligent diagnosis framework for depression based on GA and GCN pro-posed in this article combines the advantages of traditional and deep learning models.The results show that the proposed algorithm is not only superior to traditional machine learning algorithms(such as SVM),but also better than several main-stream GCN algorithms,with good generalization.This algorithm is likely to provide important information for clinical depression diagnosis in the future.

外文关键词：

major disorder depressiongraph convolutional network(GCN)intelligent diagnosisfusion algorithmindi-vidual similarity

作者：

龙丹、章梦达、应仁辉、陈丰农、邵岚、谢璩、罗聪

展开 >

作者单位：

浙江省肿瘤医院中国科学院杭州医学研究所,杭州 310022

杭州电子科技大学自动化学院,杭州 310018

关键词：

抑郁症图卷积网络(GCN) 智能诊断融合算法个体相似性

出版年：

2024

DOI：

10.11834/jig.230337

中国图象图形学报

中国科学院遥感应用研究所,中国图象图形学学会 ,北京应用物理与计算数学研究所

中国图象图形学报

CSTPCD北大核心

影响因子：1.111

ISSN：1006-8961

年,卷(期)：2024.29(11)