研究动态
Articles below are published ahead of final publication in an issue. Please cite articles in the following format: authors, (year), title, journal, DOI.

用于识别新癌症基因及其致病生物学机制的多视图表示学习。

Multiview representation learning for identification of novel cancer genes and their causative biological mechanisms.

发表日期:2024 Jul 25
作者: Jianye Yang, Haitao Fu, Feiyang Xue, Menglu Li, Yuyang Wu, Zhanhui Yu, Haohui Luo, Jing Gong, Xiaohui Niu, Wen Zhang
来源: BRIEFINGS IN BIOINFORMATICS

摘要:

肿瘤发生源于癌症基因的功能障碍,通过各种机制导致细胞增殖不受控制。建立完整的癌症基因目录将使精准肿瘤学成为可能。尽管基于图神经网络(GNN)的现有方法可以有效识别癌症基因,但它们在有效整合来自多个视图的数据和解释预测结果方面存在不足。为了解决这些缺点,提出了一种可解释的表示学习框架 IMVRL-GCN,用于从多视图数据中捕获共享和特定的表示,为癌症基因的识别提供重要的见解。实验结果表明,IMVRL-GCN 优于最先进的癌症基因识别方法和多个基线。此外,IMVRL-GCN 用于识别总共 74 个高置信度的新型癌症基因,多视图数据分析强调了共享、突变特异性和结构特异性表征在区分独特癌症基因中的关键作用。对它们区分能力背后机制的探索表明,共享表征与基因功能密切相关,而突变特异性和结构特异性表征分别与诱变倾向和功能协同相关。最后,我们对这些候选药物的深入分析为个体化治疗提供了潜在的见解:阿法替尼可以抵消许多突变驱动的风险,而靶向与癌症基因 SRC 的相互作用是减轻 NR3C1、RXRA、HNF4A、和 SP1。© 作者 2024 年。由牛津大学出版社出版。
Tumorigenesis arises from the dysfunction of cancer genes, leading to uncontrolled cell proliferation through various mechanisms. Establishing a complete cancer gene catalogue will make precision oncology possible. Although existing methods based on graph neural networks (GNN) are effective in identifying cancer genes, they fall short in effectively integrating data from multiple views and interpreting predictive outcomes. To address these shortcomings, an interpretable representation learning framework IMVRL-GCN is proposed to capture both shared and specific representations from multiview data, offering significant insights into the identification of cancer genes. Experimental results demonstrate that IMVRL-GCN outperforms state-of-the-art cancer gene identification methods and several baselines. Furthermore, IMVRL-GCN is employed to identify a total of 74 high-confidence novel cancer genes, and multiview data analysis highlights the pivotal roles of shared, mutation-specific, and structure-specific representations in discriminating distinctive cancer genes. Exploration of the mechanisms behind their discriminative capabilities suggests that shared representations are strongly associated with gene functions, while mutation-specific and structure-specific representations are linked to mutagenic propensity and functional synergy, respectively. Finally, our in-depth analyses of these candidates suggest potential insights for individualized treatments: afatinib could counteract many mutation-driven risks, and targeting interactions with cancer gene SRC is a reasonable strategy to mitigate interaction-induced risks for NR3C1, RXRA, HNF4A, and SP1.© The Author(s) 2024. Published by Oxford University Press.