Abstract
OBJECTIVE: This study aims to develop and validate an interpretable deep learning framework that leverages self-supervised time reversal (TR) pretraining to identify consistent, biologically plausible functional network biomarkers across multiple neurological and psychiatric disorders. METHODS: We pretrained a hierarchical LSTM model using a TR pretext task on the Human Connectome Project (HCP) dataset. The pretrained weights were transferred to downstream classification tasks on five clinical datasets (FBIRN, BSNIP, ADNI, OASIS, and ABIDE) spanning schizophrenia, Alzheimer's disease, and autism spectrum disorder. After fine-tuning, we extracted latent features and employed a logistic regression probing analysis to decode class-specific functional network contributions. Models trained from scratch without pretraining served as a baseline. Statistical tests (one-sample and two-sample t-tests) were performed on the latent features to assess their discriminative power and consistency. RESULTS: TR pretraining consistently improved classification performance in four out of five datasets, with AUC gains of up to 5.3%, particularly in data-scarce settings. Probing analyses revealed biologically meaningful and consistent patterns: schizophrenia was associated with reduced auditory network activity, Alzheimer's with disrupted default mode and cerebellar networks, and autism with sensorimotor anomalies. TR-pretrained models produced more statistically significant latent features and demonstrated higher consistency across datasets (e.g., Pearson correlation = 0.9003 for schizophrenia probing vs. -0.67 for non-pretrained). In contrast, non-pretrained models showed unstable performance and inconsistent feature importance. CONCLUSIONS: Time Reversal pretraining enhances both the performance and interpretability of deep learning models for fMRI classification. By enabling more stable and biologically plausible representations, TR pretraining supports clinically relevant insights into disorder-specific network disruptions. This study demonstrates the utility of interpretable self-supervised models in neuroimaging, offering a promising step toward transparent and trustworthy AI applications in psychiatry.