Abstract
BACKGROUND: This study aimed to identify risk factors and develop statistical models to predict cerebral amyloid angiopathy (CAA). METHODS: Associations between demographic, cognition, cardiovascular, and AD-related neuropathology and CAA were analyzed using data from three longitudinal cohorts of aging and dementia. Logistic regression with LASSO was used for feature selection. Predictive performance was assessed using ROC-AUC and decision curve analysis (DCA). Predictor importance was quantified using Shapley Variable Importance Cloud (ShapleyVIC), which provides a robust estimate of individual feature contribution in prediction. RESULTS: Stratified analyses showed that the strength of association between episodic memory or tau pathology and CAA was greater in males, while the amyloid pathology-CAA association was stronger in females. Among APOE ε4 carriers, the amyloid/tau pathology-CAA associations were pronounced. Episodic memory and amyloid/tau pathology were identified as key factors in our predictive model. DCA demonstrated the model’s clinical utility, and SHAP values confirmed the importance of individual features. CONCLUSION: We identified sex- and APOE-specific risk factors for CAA and developed models to support CAA risk stratification. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13195-025-01948-8.