Abstract
BACKGROUND: Enterobacter cloacae complex (ECC) is an important nosocomial pathogen and consists of multiple similar species. The taxonomy of ECC has been consecutively updated, adding to its identification difficulty. METHODS: A total of 92 ECC strains isolated from bloodstream infections during 2015–2020 were collected from a tertiary hospital in China. All the strains were identified by Vitek 2 Compact and Vitek MS and then subjected to whole genome sequencing (WGS) for average nucleotide identity (ANI) analysis. Surfaced-enhanced Raman spectroscopy (SERS) with a set of machine learning algorithms was applied in identifying species within ECC. RESULTS: Seven species were identified through ANI, including 28 E. hormaechei subsp. steigerwaltii, 17 E. hormaechei subsp. xiangfangensis, 12 E. cloacae, 11 each of E. hormaechei subsp. hoffmannii and E. bugandensis, seven E. kobei and six E. roggenkampii. The Vitek 2 compact indistinguishably identified all the strains as ECC and Vitek MS correctly identified one strain of E. kobei while achieving ambiguous results for all the other isolates. SERS combined with XGBoost model achieved 96.9% accuracy with an area under the ROC curve value of 0.998 in the identification of ECC. CONCLUSION: SERS coupled with machine learning algorithms holds a promising potential to acquire early prediction of ECC, outperforming the capabilities of other methods. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12866-026-04769-3.