Extracting True Virus SERS Spectra and Augmenting Data for Improved Virus Classification and Quantification

提取真实病毒SERS光谱并扩充数据以改进病毒分类和定量

阅读:2

Abstract

Surface-enhanced Raman spectroscopy (SERS) is a transformative tool for infectious disease diagnostics, offering rapid and sensitive species identification. However, background spectra in biological samples complicate analyte peak detection, increase the limit of detection, and hinder data augmentation. To address these challenges, we developed a deep learning framework utilizing dual neural networks to extract true virus SERS spectra and estimate concentration coefficients in water for 12 different respiratory viruses. The extracted spectra showed a high similarity to those obtained at the highest viral concentration, validating their accuracy. Using these spectra and the derived concentration coefficients, we augmented spectral data sets across varying virus concentrations in water. XGBoost models trained on these augmented data sets achieved overall classification and concentration prediction accuracy of 92.3% with a coefficient of determination (R(2)) > 0.95. Additionally, the extracted spectra and coefficients were used to augment data sets in saliva backgrounds. When tested against real virus-in-saliva spectra, the augmented spectra-trained XGBoost models achieved 91.9% accuracy in classification and concentration prediction with R(2) > 0.9, demonstrating the robustness of the approach. By delivering clean and uncontaminated spectra, this methodology can significantly improve species identification, differentiation, and quantification and advance SERS-based detection and diagnostics.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。