研究动态
Articles below are published ahead of final publication in an issue. Please cite articles in the following format: authors, (year), title, journal, DOI.

QIGTD:通过张量分解识别肺腺癌进化中的关键基因。

QIGTD: identifying critical genes in the evolution of lung adenocarcinoma with tensor decomposition.

发表日期:2024 Sep 04
作者: Bolin Chen, Jinlei Zhang, Ci Shao, Jun Bian, Ruiming Kang, Xuequn Shang
来源: BioData Mining

摘要:

识别关键基因对于理解复杂疾病的发病机制非常重要。传统研究通常比较正常样本和疾病样本之间生物分子的变化或从单个静态生物分子网络中检测重要顶点,这往往忽略了不同疾病阶段之间发生的动态变化。然而,研究生物分子网络的时间变化并识别关键基因对于理解疾病的发生和发展至关重要。本研究提出了一种称为张量分解量化基因重要性(QIGTD)的新方法。它首先通过整合时间内和时间间网络信息来构建时间序列网络,根据局部相似性保留相邻阶段网络之间的连接。采用张量来描述该时间序列网络的连接,并提出了三阶张量分解方法来捕获每个网络快照的拓扑信息和整个网络的时间序列特征。 QIGTD 也是一种免学习且高效的方法,可应用于样本量较少的数据集。使用肺腺癌(LUAD)数据集和三种最先进的方法来评估 QIGTD 的有效性:T 度、T-接近度和T-介数被用作基准方法。数值实验结果表明,QIGTD 在精度和 mAP 指标方面均优于这些方法。值得注意的是,在前 50 个基因中,根据 DisGeNET 数据库,有 29 个基因被证实与 LUAD 高度相关,其中 36 个基因显着富集了 LUAD 相关基因本体论 (GO) 术语,包括核分裂、有丝分裂核分裂、染色体分离、细胞器裂变和有丝分裂姐妹染色单体分离。 总之,QIGTD 有效捕获基因网络的时间变化并识别关键基因。它为研究生物网络中的时间动态提供了一个有价值的工具,并且可以帮助理解 LUAD 等疾病的潜在机制。© 2024。作者。
Identifying critical genes is important for understanding the pathogenesis of complex diseases. Traditional studies typically comparing the change of biomecules between normal and disease samples or detecting important vertices from a single static biomolecular network, which often overlook the dynamic changes that occur between different disease stages. However, investigating temporal changes in biomolecular networks and identifying critical genes is critical for understanding the occurrence and development of diseases.A novel method called Quantifying Importance of Genes with Tensor Decomposition (QIGTD) was proposed in this study. It first constructs a time series network by integrating both the intra and inter temporal network information, which preserving connections between networks at adjacent stages according to the local similarities. A tensor is employed to describe the connections of this time series network, and a 3-order tensor decomposition method was proposed to capture both the topological information of each network snapshot and the time series characteristics of the whole network. QIGTD is also a learning-free and efficient method that can be applied to datasets with a small number of samples.The effectiveness of QIGTD was evaluated using lung adenocarcinoma (LUAD) datasets and three state-of-the-art methods: T-degree, T-closeness, and T-betweenness were employed as benchmark methods. Numerical experimental results demonstrate that QIGTD outperforms these methods in terms of the indices of both precision and mAP. Notably, out of the top 50 genes, 29 have been verified to be highly related to LUAD according to the DisGeNET Database, and 36 are significantly enriched in LUAD related Gene Ontology (GO) terms, including nuclear division, mitotic nuclear division, chromosome segregation, organelle fission, and mitotic sister chromatid segregation.In conclusion, QIGTD effectively captures the temporal changes in gene networks and identifies critical genes. It provides a valuable tool for studying temporal dynamics in biological networks and can aid in understanding the underlying mechanisms of diseases such as LUAD.© 2024. The Author(s).