中国科学:信息科学(英文版)2024,Vol.67Issue(6) :210-225.DOI:10.1007/s11432-023-3897-2

A novel graph oversampling framework for node classification in class-imbalanced graphs

Riting XIA Chunxu ZHANG Yan ZHANG Xueyan LIU Bo YANG
中国科学:信息科学(英文版)2024,Vol.67Issue(6) :210-225.DOI:10.1007/s11432-023-3897-2

A novel graph oversampling framework for node classification in class-imbalanced graphs

Riting XIA 1Chunxu ZHANG 2Yan ZHANG 3Xueyan LIU 2Bo YANG2
扫码查看

作者信息

  • 1. Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education,Jilin University,Changchun 130012,China;College of Artificial Intelligence,Jilin University,Changchun 130012,China
  • 2. Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education,Jilin University,Changchun 130012,China;College of Computer Science and Technology,Jilin University,Changchun 130012,China
  • 3. College of Communication Engineering,Jilin University,Changchun 130012,China
  • 折叠

Abstract

Graph neural network(GNN)is a promising method to analyze graphs.Most existing GNNs adopt the class-balanced assumption,which cannot deal with class-imbalanced graphs well.The oversampling technique is effective in alleviating class-imbalanced problems.However,most graph oversampling methods generate synthetic minority nodes and their edges after applying GNNs.They ignore the problem that the representations of the original and synthetic minority nodes are dominated by majority nodes caused by aggregating neighbor information through GNN before oversampling.In this paper,we propose a novel graph oversampling framework,termed distribution alignment-based oversampling for node classification in class-imbalanced graphs(named Graph-DAO).Our framework generates synthetic minority nodes before GNN to avoid the dominance of majority nodes caused by message passing in GNNs.Additionally,we introduce a distribution alignment method based on the sum-product network to learn more information about minority nodes.To our best knowledge,it is the first to use the sum-product network to solve the class-imbalanced problem in node classification.A large number of experiments on four real datasets show that our method achieves the optimal results on the node classification task for class-imbalanced graphs.

Key words

graph neural network/class-imbalanced graphs/sum-product network/oversampling/node clas-sification

引用本文复制引用

基金项目

National Key R&D Program of China(2021ZD0112500)

National Natural Science Foundation of China(U22A2098)

National Natural Science Foundation of China(62172185)

National Natural Science Foundation of China(62202200)

National Natural Science Foundation of China(62206105)

Fundamental Research Funds for the Central Universities,JLU()

出版年

2024
中国科学:信息科学(英文版)
中国科学院

中国科学:信息科学(英文版)

CSTPCDEI
影响因子:0.715
ISSN:1674-733X
参考文献量1
段落导航相关论文