Machine Learning Prediction of Progression in Forced Expiratory Volume in 1 Second in the COPDGene® Study

COPDGene®研究中1秒用力呼气容积进展的机器学习预测

阅读：1

作者：Boueiz,Adel,Xu,Zhonghui,Chang,Yale,Masoomi,Aria,Gregory,Andrew,Lutz,Sharon M,Qiao,Dandi,Crapo,James D,Dy,Jennifer G,Silverman,Edwin K,Castaldi,Peter J

期刊：	Chronic Obstructive Pulmonary Diseases-Journal of the Copd Foundation	影响因子：	2.000
时间：	2022	起止号：	2022 Jul 29;9(3):349-365
doi：	10.15326/jcopdf.2021.0275	靶点：	COPD
疾病类型：	慢性阻塞性肺疾病

Abstract

BACKGROUND: The heterogeneous nature of chronic obstructive pulmonary disease (COPD) complicates the identification of the predictors of disease progression. We aimed to improve the prediction of disease progression in COPD by using machine learning and incorporating a rich dataset of phenotypic features. METHODS: We included 4496 smokers with available data from their enrollment and 5-year follow-up visits in the COPD Genetic Epidemiology (COPDGene(®)) study. We constructed linear regression (LR) and supervised random forest models to predict 5-year progression in forced expiratory in 1 second (FEV(1)) from 46 baseline features. Using cross-validation, we randomly partitioned participants into training and testing samples. We also validated the results in the COPDGene 10-year follow-up visit. RESULTS: Predicting the change in FEV(1) over time is more challenging than simply predicting the future absolute FEV(1) level. For random forest, R-squared was 0.15 and the area under the receiver operator characteristic (ROC) curves for the prediction of participants in the top quartile of observed progression was 0.71 (testing) and respectively, 0.10 and 0.70 (validation). Random forest provided slightly better performance than LR. The accuracy was best for Global initiative for chronic Obstructive Lung Disease (GOLD) grades 1-2 participants, and it was harder to achieve accurate prediction in advanced stages of the disease. Predictive variables differed in their relative importance as well as for the predictions by GOLD. CONCLUSION: Random forest, along with deep phenotyping, predicts FEV(1) progression with reasonable accuracy. There is significant room for improvement in future models. This prediction model facilitates the identification of smokers at increased risk for rapid disease progression. Such findings may be useful in the selection of patient populations for targeted clinical trials.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用；引用内容仅为补充信息，不代表本站立场。

2、若认为本页面引用内容涉及侵权，请及时与本站联系，我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容，需注明“来源：[生知库]”并获得授权；使用引用内容的，需自行联系原作者获得许可。

4、投稿及合作请联系：info@biocloudy.com。