Quantifying lumbar paraspinal intramuscular fat: Accuracy and reliability of automated thresholding models

量化腰椎旁肌内脂肪:自动阈值模型的准确性和可靠性

阅读:1

Abstract

BACKGROUND: The reported level of lumbar paraspinal intramuscular fat (IMF) in people with low back pain (LBP) varies considerably across studies using conventional T(1)- and T(2)-weighted magnetic resonance imaging (MRI) sequences. This may be due to the different thresholding models employed to quantify IMF. In this study we investigated the accuracy and reliability of established (two-component) and novel (three-component) thresholding models to measure lumbar paraspinal IMF from T(2)-weighted MRI. METHODS: In this cross-sectional study, we included MRI scans from 30 people with LBP (50% female; mean (SD) age: 46.3 (15.0) years). Gaussian mixture modelling (GMM) and K-means clustering were used to quantify IMF bilaterally from the lumbar multifidus, erector spinae, and psoas major using two and three-component thresholding approaches (GMM(2C); K-means(2C); GMM(3C); and K-means(3C)). Dixon fat-water MRI was used as the reference for IMF. Accuracy was measured using Bland-Altman analyses, and reliability was measured using ICC(3,1). The mean absolute error between thresholding models was compared using repeated-measures ANOVA and post-hoc paired sample t-tests (α = 0.05). RESULTS: We found poor reliability for K-means(2C) (ICC(3,1) ≤ 0.38), moderate to good reliability for K-means(3C) (ICC(3,1) ≥ 0.68), moderate reliability for GMM(2C) (ICC(3,1) ≥ 0.63) and good reliability for GMM(3C) (ICC(3,1) ≥ 0.77). The GMM (p < .001) and three-component models (p < .001) had smaller mean absolute errors than K-means and two-component models, respectively. None of the investigated models adequately quantified IMF for psoas major (ICC(3,1) ≤ 0.01). CONCLUSIONS: The performance of automated thresholding models is strongly dependent on the choice of algorithms, number of components, and muscle assessed. Compared to Dixon MRI, the GMM performed better than K-means and three-component performed better than two-component models for quantifying lumbar multifidus and erector spinae IMF. None of the investigated models accurately quantified IMF for psoas major. Future research is needed to investigate the performance of thresholding models in a more heterogeneous clinical dataset and across different sites and vendors.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。