Diagnostic accuracy of ChatGPT for 12-lead ECG-based localisation of ventricular ectopic foci prior to catheter ablation

ChatGPT在导管消融术前基于12导联心电图定位室性异位灶的诊断准确性

阅读:1

Abstract

BACKGROUND: Precise pre-procedural localisation of ventricular ectopic (VE) foci shortens mapping time, reduces fluoroscopy, and improves ablation success. Large language models such as ChatGPT offer instant, free-text clinical support; however, their accuracy in ECG-based VE localisation is unknown. METHODS: In this single-centre pilot study, we assessed the diagnostic accuracy of ChatGPT in 50 consecutive adults (average age: 43 ± 14 years; 58% women) scheduled for first-time VE ablation. ChatGPT served as the index test, and invasive electroanatomical mapping during the ablation served as the reference standard. A blinded electrophysiologist converted each index 12-lead ECG into a structured textual description of QRS morphology. ChatGPT-4o (temperature 0.2) was then tasked with assigning one of five anatomical origins (RVOT, LVOT, papillary muscle, fascicular, and epicardial). Predictions were compared with electro-anatomical mapping during catheter ablation, and agreement was measured using Cohen's κ (κ). RESULTS: Electro-anatomical mapping identified 30 RVOT, 11 LVOT, 4 papillary, 1 fascicular, and 4 epicardial foci. ChatGPT correctly localised 17/50 cases (34%), yielding an overall Cohen's κ of -0.02 (95% CI -0.18 to 0.14). Sensitivity/specificity was 40%/55% for the RVOT and 36%/62% for the LVOT; no fascicular or epicardial origins were correctly predicted. The performance of ChatGPT did not differ based on the presence of structural heart disease (p = 0.43). The duration of the procedure and the acute ablation success rate (96%) were unaffected by the accuracy of ChatGPT. CONCLUSION: Freetext querying of ChatGPT failed to provide clinically meaningful VE localisation, performing no better than chance and markedly below published ECG-based algorithms. This likely reflects the model's lack of domain-specific training and its reliance on purely text-based reasoning without direct access to ECG signals. Current general-purpose language models should not be relied upon for procedural planning in VE ablation; future work must integrate multimodal training and domain-specific optimisation before LLMs can augment electrophysiology practice.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。