Abstract
Background/Objectives: Wound infections cause significant morbidity, yet current diagnostics rely on time-consuming microbial culture. Volatile organic compounds (VOCs) from bacterial metabolism offer potential for early diagnosis. This study aimed to validate the volatile metabolites profiled by gas chromatography-ion mobility spectrometry (GC-IMS) combined with machine learning for rapid identification of wound infections and certain bacterial infections. Methods: Headspace of clinical wound samples were analyzed using GC-IMS. Volatile metabolite profiles were compared between infected and non-infected groups and between Escherichia coli (E. coli)-positive and negative samples. Partial least squares discriminant analysis (PLS-DA) and Mann-Whitney U test were used for preliminary screening with variable importance in projection (VIP) > 1 and p-value < 0.05. Three machine learning algorithms, namely support vector machine (SVM), logistic regression (LR), and random forest (RF), were trained on the selected features for classification, using 5-fold cross-validation with 10 repeated runs. Model performance was assessed using key evaluation metrics, including accuracy, sensitivity, specificity, the area under the curve (AUC) and feature importance ranking to identify the most relevant biomarkers. Results: A total of 19 volatile metabolites associated with clinical wound samples were identified. The RF model achieved 90.15% sensitivity and 0.91 AUC for bacterial infection detection. For E. coli identification, LR reached 85.35% sensitivity and 0.89 AUC. Potential volatile metabolic biomarkers including elevated 3-methyl-1-butanol, 2-methyl-1-butanol, and ethyl hexanoate for identifying bacterial infection were selected through the cross-validation results of the three algorithms. Conclusions: Untargeted metabolomics by GC-IMS effectively captures infection-specific volatile metabolic signatures in complex wound samples. Integration with machine learning enables rapid, high-accuracy diagnosis of bacterial infections and E. coli identification at point of care. This approach addresses clinical metabolomics translational challenges by providing a portable and cost-effective method, potentially reducing antibiotic misuse through more timely and targeted therapy.