首页|面向大规模数据的高效超图神经网络

面向大规模数据的高效超图神经网络

扫码查看
高阶关联广泛存在于现实世界中,如社交网络、生物网络、交通网络等,建模及优化高阶关联对于网络属性研究和演化趋势预测具有重要意义。超图是一种灵活的数据结构,能够自然地建模高阶关联。近年来,随着深度学习的发展,基于超图建模的超图神经网络被广泛应用于面向高阶关联的表示学习。然而,现有的超图神经网络均基于直推学习范式,虽然在小规模超图数据集上取得了不错的效果,但难以应用到大规模数据上,限制了其应用范围。本文首先分析了现有超图神经网络方法在大规模数据上应用的挑战,然后针对该问题提出了面向大规模数据的高效超图神经网络方法(efficient hypergraph neural network,EHGNN)。针对现有方法空间、时间复杂度过高的问题,EHGNN分别设计了超图采样模块和基于单阶段超图卷积的计算加速模块,同时降低了超图神经网络的空间开销和时间开销,使得超图神经网络适用于大规模超图数据,显著增强了可扩展性。在4个真实超图数据集上的实验结果验证了 EHGNN的有效性和高效性。
Efficient hypergraph neural network on million-level data
High-order correlations are ubiquitous in the real world,such as the social network,the biological network,and the transportation network.It is of significant importance to model and optimize high-order correlations for network investigation.The hypergraph,as a flexible and scalable structure,can be applied to model the high-order correlations in a natural manner.With the development of deep learning,hypergraph neural networks(HGNNs)are widely leveraged for high-order correlation modeling and optimization.Although existing HGNNs have shown decent performance on small-scale datasets,they cannot be applied to large-scale data due to their expensive space cost caused by the transductive learning paradigm in that case.This paper first analyzes the root causes of the deficiency that HGNNs are unable to handle large-scale data.Furthermore,this paper presents the efficient hypergraph neural network(EHGNN)towards the million-level data.EHGNN designs the hypergraph sampling module and the computational acceleration module that is based on single-stage hypergraph convolution,reducing the time and space cost of HGNNs.Experimental results on four real-world hypergraph datasets demonstrate the effectiveness and efficiency of the proposed EHGNN.

hypergraph computationhypergraph neural networkhigh-order correlationlarge-scale datavertex classification

吉书仪、魏宇轩、戴琼海、高跃

展开 >

清华大学软件学院,北京 100084

脑与认知智能北京实验室,北京 100084

北京信息科学与技术国家研究中心,北京 100084

清华大学脑与认知科学研究院,北京 100084

清华大学自动化系,北京 100084

展开 >

超图计算 超图神经网络 高阶关联 大规模数据 节点分类

国家自然科学基金国家自然科学基金清华大学自主科研项目北京市自然科学基金之江实验室开放基金

62021002620881022022702000742220252021KG0AB05

2024

中国科学F辑
中国科学院,国家自然科学基金委员会

中国科学F辑

CSTPCD北大核心
影响因子:1.438
ISSN:1674-5973
年,卷(期):2024.54(4)
  • 34