Machine-learning based prediction for high health care utilizers using a multi-institution registry

利用多机构注册库,通过机器学习预测医疗资源利用率高的患者。

阅读:1

Abstract

BACKGROUND: Rising healthcare costs worldwide have spurred interest in using machine learning to pinpoint major healthcare utilizers for population health efforts. Previous studies have focused on individuals who impose the highest financial burden, such as targeting high-need high-cost utilizers, a small group with limited potential for cost reduction. Therefore, we developed models to predict future healthcare utilization across various thresholds. METHODS: We leveraged data from a multi-institutional diabetes database to develop binary classification models capable of predicting healthcare utilization in the following year. The models predicted six different outcomes: patients with an annual total length of stay of ≥ 7, ≥14, and ≥30 days, and emergency department (ED) attendance of ≥ 3, ≥5, and ≥10 visits. To mitigate class imbalance, we applied random and synthetic minority oversampling techniques and compared against models trained without oversampling. Models were trained with 2019 data and tested on unseen data from 2020 and 2021. RESULTS: Models trained with random oversampling, including logistic regression, multivariate adaptive regression splines, boosted trees, and multilayer perceptron, consistently demonstrated high AUC (>0.80) and sensitivity (>0.60) across both training-validation and test datasets. Addressing class imbalance was found to be crucial, as models without oversampling displayed satisfactory AUC (>0.80) but significantly lower sensitivity (<0.10). Key predictors identified included age, number of ED visits in the present year, chronic kidney disease stage, inpatient bed days in the present year, and mean HbA1c levels. CONCLUSIONS: Our machine learning models successfully predict high service level utilization with strong performance, offering valuable tools for policymakers and health planners to develop targeted health initiatives based on patient utilization patterns. KEY MESSAGES: • Machine learning models can be leveraged to predict future healthcare utilization and facilitate interventional program development. • Annual total length of stay and total emergency department visits are useful service level indicators.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。