欢迎访问中国科学院大学学报,今天是

中国科学院大学学报

• •    

基于直积图结构挖掘的图相似度计算方法*

张龙跃, 赵彤   

  1. 中国科学院大学数学科学学院,北京 100049
  • 收稿日期:2025-12-04 修回日期:2026-04-15 发布日期:2026-04-21
  • 通讯作者: E-mail: zhaotong@ucas.ac.cn
  • 基金资助:
    *国家自然科学基金 (12271504,T2341006)和教育部学科先导突破项目(JYB2025XDXM612)资助

Graph similarity computation method based on structure mining of direct product graphs

ZHANG Longyue, ZHAO Tong   

  1. School of Mathematical Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
  • Received:2025-12-04 Revised:2026-04-15 Published:2026-04-21

摘要: 图相似度计算是一项基础而富有挑战性的任务,其通过量化图数据的内在结构与关联模式来揭示它们功能或行为上的相似性。该任务在生物信息学、软件工程及社交网络分析等领域扮演着核心角色,广泛应用于分子结构比对、代码抄袭检测及社群发现。尽管基于图神经网络(graph neural network,GNN)的方法有效近似了图编辑距离(graph edit distance,GED)等经典度量,但它们往往难以捕捉细粒度的拓扑关系。这种局限性限制了图嵌入向量的表征能力,导致模型无法充分反映图与图之间的结构差异。本文提出了一种基于直积图结构挖掘的图相似度计算方法(DSM-GSC)。该方法核心是构建直积图,以显式建模图对间所有潜在的节点配对关系。针对传统 GNN 嵌入表征能力不足的问题,引入了一个结构学习模块,旨在从直积图庞大的搜索空间中,甄别并提取出能表征关键拓扑对应的子结构,并利用这些子结构指导节点对齐与相似性的联合优化。这一机制使得模型突破了对全局嵌入的单一依赖,实现了基于关键结构模式的精准对齐。在多个基准数据集上的实验表明,该方法展现出了独特的优势,证实本文提出的图相似度计算新方法是可行且有效的。

关键词: 图数据, 图神经网络, 图相似度计算, 图结构学习

Abstract: Graph similarity computation is a fundamental yet challenging task that reveals functional or behavioral similarities by quantifying the intrinsic structures and relational patterns of graph data. It plays a pivotal role in applications such as molecular structure alignment in bioinformatics, code plagiarism detection in software engineering, and community detection in social network analysis. Although methods based on Graph Neural Networks (GNN) offer effective approximations for classical metrics like Graph Edit Distance (GED), they often struggle to capture fine-grained topological relationships. This limitation constrains the representational capacity of learned graph embeddings, rendering them unable to adequately reflect structural discrepancies between graphs. To address this, this paper proposes a Graph Similarity Computation method based on Direct Structure Mining (DSM-GSC). At its core, the method constructs a product graph to explicitly model all potential node pairing relationships between two graphs. To overcome the limited representational capacity of traditional GNN embeddings, we introduce a structural learning module designed to identify and extract discriminative substructures—representing key topological correspondences—from the vast search space of the product graph. These structural patterns are subsequently utilized to guide the joint optimization of node alignment and similarity computation. This mechanism enables the model to break the reliance on global embeddings alone, achieving precise alignment driven by key structural patterns. Extensive experiments on multiple benchmark datasets demonstrate the unique advantages of the proposed method, verifying the feasibility and effectiveness of this new paradigm for graph similarity computation.

Key words: graph data, graph neural networks, graph similarity computation, graph structure learning

中图分类号: