Machine Learning-Based Non-Invasive Prediction of Metabolic Dysfunction-Associated Steatohepatitis in Obese Patients: A Retrospective Study

基于机器学习的肥胖患者代谢功能障碍相关性脂肪性肝炎的非侵入性预测:一项回顾性研究

阅读:1

Abstract

Objectives: We aimed to develop and validate machine learning (ML) models that integrate clinical and laboratory data for the non-invasive prediction of metabolic dysfunction-associated steatohepatitis (MASH) in an obese population. Methods: In this retrospective study, clinical and laboratory data were collected from obese patients undergoing bariatric surgery. The cohort was divided using stratified random sampling, and optimal features were selected with SHapley Additive exPlanations (SHAP). Various ML models, including K-nearest neighbors, linear support vector machine, radial basis function support vector machine, Gaussian process, random forest, multilayer perceptron, adaptive boosting, and naïve Bayes, were developed through cross-validation and hyperparameter tuning. Diagnostic performance was assessed via the area under the curve (AUC) in both training and validation sets. Results: A total of 558 patients were analyzed, with 390 in the training set and 168 in the validation set. In the training cohort, the median age was 35 years, the median body mass index (BMI) was 39.8 kg/m(2), 39.0% were male, 37.9% had diabetes mellitus, and 62.8% were diagnosed with MASH. The validation cohort had a median age of 34.1 years, a median BMI of 42.5 kg/m(2), 41.7% male, 32.7% with diabetes, and 39.9% with MASH. Among the models, the random forest achieved the highest performance among the models with AUC values of 0.94 in the training set and 0.88 in the validation set. The Gaussian process model attained an AUC of 0.97 in the training cohort but 0.79 in the validation cohort, while the other models achieved AUC values ranging from 0.63 to 0.88 in the training cohort and 0.62 to 0.75 in the validation set. Conclusions: ML models, particularly the random forest, effectively predict MASH using readily available data, offering a promising non-invasive alternative to conventional serological scoring. Prospective studies and external validations are needed to further establish clinical utility.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。