Explainable Patient-Level Cognitive Impairment Screening via Temporal, Semantic, and Psycholinguistic Multimodal AI

基于时间、语义和心理语言学多模态人工智能的可解释患者级认知障碍筛查

阅读：1

作者：Abdullah,Fatima,Zulaikha,Ruiz,Miguel Jesús Torres,Espinosa-Sosa,Osvaldo,Sánchez-Mejorada,Carlos Guzmán,Téllez,Rolando Quintero,Rodríguez,José Luis Oropeza,Sidorov,Grigori

期刊：	Journal of Intelligence	影响因子：	3.400
时间：	2026	起止号：	2026 Apr 15;14(4)
doi：	10.3390/jintelligence14040066	研究方向：	神经科学

Abstract

Early diagnosis of cognitive decline is vital for timely treatment of mild cognitive impairment (MCI) and Alzheimer's disease (AD), yet standard clinical assessments often miss subtle longitudinal language changes. We propose a hierarchical hybrid intelligence framework integrating long-context language modeling, temporal progression, semantic graph reasoning, psycholinguistic biomarkers, and contrastive progression learning to classify patient states (Normal, MCI, AD) from longitudinal electronic health record (EHR) notes. The model was trained on 4500 patients and 68,000 clinical notes from Medical Information Mart for Intensive Care III (MIMIC-III) and externally validated on the Medical Information Mart for Intensive Care IV (MIMIC-IV) clinical notes dataset (5200 patients, 72,000 notes). Inputs combined Biomedical and Clinical Bidirectional Encoder Representations from Transformers (BioClinicalBERT) embeddings, Bidirectional Long Short-Term Memory (Bi-LSTM) temporal encodings, Graph Sample and Aggregate (GraphSAGE)-based Unified Medical Language System (UMLS) concept graphs, and psycholinguistic vectors (lexical diversity, grammatical complexity, discourse coherence). On the MIMIC-III hold-out set, the model achieved 99.999% accuracy, a macro F1-score of 0.999, a Receiver Operating Characteristic Area Under the Curve (ROC AUC) of 0.999, and a temporal stability variance of 0.0008. Monte Carlo cross-validation (10,000 folds) yielded 99.997±0.003% accuracy and 0.999±0.001 macro F1. Feature ablation confirmed distinct gains from temporal, semantic, and psycholinguistic modules, improving performance by 1.1% over text-only baselines. Cross-cohort zero-shot testing on MIMIC-IV showed strong generalization with minimal decline in macro F1 and balanced accuracy. Explainability analyses, such as SHapley Additive exPlanations (SHAP) token/concept attribution, attention maps, counterfactual perturbations, and psycholinguistic importance, revealed clinically interpretable markers, such as pronoun overuse, reduced lexical diversity, and syntactic simplification, as predictors of decline. Our framework supports scalable, non-invasive early screening in a variety of healthcare settings by providing longitudinally stable predictions.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用；引用内容仅为补充信息，不代表本站立场。

2、若认为本页面引用内容涉及侵权，请及时与本站联系，我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容，需注明“来源：[生知库]”并获得授权；使用引用内容的，需自行联系原作者获得许可。

4、投稿及合作请联系：info@biocloudy.com。