Machine Learning Algorithm to Explore Patients With Heterogeneous Treatment Effects of Clinically Significant CMV Infection and Non-Relapse Mortality After HSCT

利用机器学习算法探索造血干细胞移植后临床显著性巨细胞病毒感染和非复发死亡率患者的异质性治疗效果。

阅读：2

作者：Toya,Takashi,Nakajima,Yujiro,Hara,Konan,Kaito,Satoshi,Nishida,Tetsuya,Uchida,Naoyuki,Shingai,Naoki,Takeda,Wataru,Ozawa,Yukiyasu,Tanaka,Masatsugu,Yoshihara,Satoshi,Katayama,Yuta,Eto,Tetsuya,Sawa,Masashi,Ota,Shuichi,Ohigashi,Hiroyuki,Takada,Satoru,Kataoka,Keisuke,Kanda,Junya,Fukuda,Takahiro,Ogata,Masao,Taguchi,Ayumi,Atsuta,Yoshiko

期刊：	eJHaem	影响因子：	1.200
时间：	2025	起止号：	2025 Aug;6(4):e70117
doi：	10.1002/jha2.70117	研究方向：	细胞生物学、炎症/感染、微生物学、发育与干细胞
疾病类型：	巨细胞病毒感染	细胞类型：	干细胞

Abstract

INTRODUCTION: Clinically significant cytomegalovirus infection (csCMVi) and non-relapse mortality (NRM) remain serious concerns after allogeneic hematopoietic stem cell transplantation (HSCT), but subpopulations with heterogeneous treatment effects (HTEs) is unclear. Although machine learning (ML) algorithms have recently been applied to HSCT, the methodology has not been well elucidated. METHODS: We developed a ML algorithm which combined weighting procedures and left-truncated and right-censored trees based on classification and regression tree algorithms to fit survival data with time-varying covariates and competing risks comprehensively. The Japanese large-scale registry data were applied to the algorithm to explore subpopulations with HTEs of csCMVi and NRM after HSCT. Its performance was evaluated by comparing their c-indices with those of the conventional Fine-Gray model. RESULTS: A total of 10,480 patients were divided into training (75%) and test (25%) cohorts; the training cohort was used to develop the ML model. Using the model, patient CMV-seropositivity, patient age, and acute graft-versus-host disease were identified as important predictors of csCMVi. In addition, the patients were successfully classified by the estimated cumulative incidence of csCMVi, which varied from 22.7% at 0.5 year to 82.7%. This model also depicts interpretable survival trees in various settings. Similarly, the patients can be also classified based on the estimated 3-year NRM, which varied from 8.0% to 48.5%. C-indices of the ML and the Fine-Gray model using the test cohort showed comparable performance. CONCLUSION: A reliable, explainable, and interpretable ML model was developed to explore subpopulations with HTEs of csCMVi and NRM after HSCT. Trial Registration: The authors have confirmed clinical trial registration is not needed for this submission.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用；引用内容仅为补充信息，不代表本站立场。

2、若认为本页面引用内容涉及侵权，请及时与本站联系，我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容，需注明“来源：[生知库]”并获得授权；使用引用内容的，需自行联系原作者获得许可。

4、投稿及合作请联系：info@biocloudy.com。