Abstract
Background: Emotion recognition using electroencephalography (EEG) offers a non-invasive means of measuring brain responses to affective stimuli. However, since EEG signals can vary significantly between subjects, developing a deep learning model capable of accurately predicting emotions is challenging. Methods: To address that challenge, this study proposes a deep learning approach that fuses EEG features with demographic information, specifically age, sex, and nationality, using an attention-based mechanism that learns to weigh each modality during classification. The method was evaluated using three benchmark datasets: SEED, SEED-FRA, and SEED-GER, which include EEG recordings of 31 subjects of different demographic backgrounds. Results: We compared a baseline model trained solely on the EEG-derived features against an extended model that fused the subjects' EEG and demographic information. Including demographic information improved the performance, achieving 80.2%, 80.5%, and 88.8% for negative, neutral, and positive classes. The attention weights also revealed different contributions of EEG and demographic inputs, suggesting that the model learns to adapt based on subjects' demographic information. Conclusions: These findings support integrating demographic data to enhance the performance and fairness of subject-independent EEG-based emotion recognition models.