Transformer-based multimodal model for estimation of appendicular lean mass using incomplete chest radiographs and electronic health record

基于Transformer的多模态模型,利用不完整的胸部X光片和电子健康记录估算四肢瘦体重

阅读:2

Abstract

BACKGROUND: Sarcopenia is a muscle disease that increases the risk of falls, fractures, and mortality. Appendicular lean mass (ALM) assessment is central to its diagnosis, but standard methods like dual-energy X-ray absorptiometry (DXA) have accessibility and cost issues. Previous artificial intelligence (AI) studies for sarcopenia assessment have been limited to single modalities and have not adequately addressed missing data modalities. This study aimed to develop and validate a multimodal AI model using frontal and lateral chest radiographs and electronic health record (EHR) data to estimate ALM and detect low muscle mass, and to investigate the robustness of a transformer-based algorithm to missing modalities. METHODS: This model development and validation study adhered to TRIPOD guidelines. The derivation cohort included 3,295 observations from 1,524 participants (mean age 61.9 years, 30% female). External validation was performed on an independent cohort of 3,771 observations from 1,976 participants (mean age 58.4 years, 59% female). Our multimodal model uses a transformer-based TabPFN to predict ALM from 75 features, which integrate 18 features extracted from each of the frontal and lateral chest radiographs by TorchXray with 39 features from EHR data (demographics and blood tests). Model performance for ALM estimation was evaluated using the root mean square error (RMSE), and mean absolute error (MAE), and Pearson correlation coefficient. Estimation of appendicular lean mass performance was assessed using the area under the receiver operating characteristic curve (AUROC). RESULTS: The prevalence of low muscle mass was 30% in the derivation set and 29% in the external validation set. The multimodal model achieved high accuracy in ALM estimation, with an RMSE of 1.23 kg, MAE of 0.96 kg, and correlation coefficient of 0.958 in the internal test set. In external validation, the model yielded an RMSE of 1.87 kg, and MAE of 1.41 kg, and correlation of 0.930. For estimation of appendicular lean mass in the external validation set, the model yielded an AUROC of 0.825 for males and 0.730 for females. Sensitivity analyses demonstrated the model’s robustness to missing modalities, maintaining stable prediction errors in participants missing part of the EHR (RMSE: 2.00 kg), frontal (RMSE: 1.99 kg), or lateral (RMSE: 2.08 kg) chest radiographs. CONCLUSIONS: Our transformer-based multimodal AI model accurately estimates ALM and detects low muscle mass from routinely collected clinical data, outperforming unimodal approaches. The model demonstrated robustness even with missing data modalities, supporting its potential utility as a screening tool in clinical settings where data completeness varies. This approach has the potential to serve as an accessible screening tool for low muscle mass, especially in settings where DXA is not readily available. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12967-026-08079-0.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。