研究动态
Articles below are published ahead of final publication in an issue. Please cite articles in the following format: authors, (year), title, journal, DOI.

Cox 回归和广义 Cox 回归模型与机器学习在预测弥漫性大 B 细胞淋巴瘤儿童生存方面的比较。

Comparison of Cox regression and generalized Cox regression models to machine learning in predicting survival of children with diffuse large B-cell lymphoma.

发表日期:2024 Jul 31
作者: Jia-Jia Qin, Xiao-Xiao Zhu, Xi Chen, Wei Sang, Ying-Liang Jin
来源: Disease Models & Mechanisms

摘要:

全球儿童弥漫性大 B 细胞淋巴瘤 (DLBCL) 的发病率正在增加。由于儿童免疫系统不成熟,DLBCL的预后与成人有较大差异。我们的目标是利用多中心大型回顾性分析对该疾病的预后进行研究。为了进行回顾性分析,我们从监测、流行病学和最终结果(SEER)数据库中检索了数据,该数据库包括 836 名 18 岁以下的 DLBCL 患者,他们在 22 岁时接受治疗。 2000年至2019年中心机构的患者按7:3的比例随机分为模型组和验证组。使用Cox逐步回归、广义Cox回归和极限梯度提升(XGBoost)来筛选所有变量。选定的预后变量用于通过 Cox 逐步回归构建列线图。使用 XGBoost 对变量的重要性进行排名。通过C指数、受试者工作特征(ROC)曲线下面积(AUC)、敏感性和特异性来评估模型的预测性能。使用校准曲线评估模型的一致性。通过决策曲线分析(DCA)验证了模型的临床实用性。ROC曲线显示,除非比例风险和非对数线性(NPHNLL)模型外,所有模型的AUC值均在0.7以上,表明准确性较高。校准曲线和DCA进一步证实了强大的预测性能和临床实用性。在本研究中,我们通过将XGBoost与Cox和广义Cox回归模型相结合,成功构建了机器学习模型。这种综合方法从多个维度准确预测DLBCL儿童的预后。这些发现为准确的临床预后预测提供了科学依据。2024转化癌症研究。版权所有。
The incidence of diffuse large B-cell lymphoma (DLBCL) in children is increasing globally. Due to the immature immune system in children, the prognosis of DLBCL is quite different from that of adults. We aim to use the multicenter large retrospective analysis for prognosis study of the disease.For our retrospective analysis, we retrieved data from the Surveillance, Epidemiology and End Results (SEER) database that included 836 DLBCL patients under 18 years old who were treated at 22 central institutions between 2000 and 2019. The patients were randomly divided into a modeling group and a validation group based on the ratio of 7:3. Cox stepwise regression, generalized Cox regression and eXtreme Gradient Boosting (XGBoost) were used to screen all variables. The selected prognostic variables were used to construct a nomogram through Cox stepwise regression. The importance of variables was ranked using XGBoost. The predictive performance of the model was assessed by using C-index, area under the curve (AUC) of receiver operating characteristic (ROC) curve, sensitivity and specificity. The consistency of the model was evaluated by using a calibration curve. The clinical practicality of the model was verified through decision curve analysis (DCA).ROC curve demonstrated that all models except the non-proportional hazards and non-log linearity (NPHNLL) model, achieved AUC values above 0.7, indicating high accuracy. The calibration curve and DCA further confirmed strong predictive performance and clinical practicability.In this study, we successfully constructed a machine learning model by combining XGBoost with Cox and generalized Cox regression models. This integrated approach accurately predicts the prognosis of children with DLBCL from multiple dimensions. These findings provide a scientific basis for accurate clinical prognosis prediction.2024 Translational Cancer Research. All rights reserved.