Graph Neural Networks vs. Traditional QSAR: A Comprehensive Comparison for Multi-Label Molecular Odor Prediction

图神经网络与传统QSAR:多标签分子气味预测的全面比较

阅读:1

Abstract

Molecular odor prediction represents a fundamental challenge in computational chemistry with significant applications in fragrance design, food science, and chemical safety assessment. While traditional Quantitative Structure-Activity Relationship (QSAR) methods rely on hand-crafted molecular descriptors, recent advances in graph neural networks (GNNs) enable direct end-to-end learning from molecular graph structures. However, systematic comparison between these approaches for multi-label odor prediction remains limited. This study presents a comprehensive evaluation of traditional QSAR methods compared with modern GNN approaches for multi-label molecular odor prediction. Using the GoodScent dataset containing 3304 molecules with six high-frequency odor types (fruity, green, sweet, floral, woody, herbal), we systematically evaluate 23 model configurations across traditional machine learning algorithms (Random Forest, SVM, GBDT, MLP, XGBoost, LightGBM) with three feature-processing strategies and three GNN architectures (GCN, GAT, NNConv). The results demonstrate that GNN models achieve significantly superior performance, with GCN achieving the highest macro F1-score of 0.5193 compared to 0.4766 for the best traditional method (MLP with basic preprocessing), representing a 24.1% relative improvement. Critically, we discover that threshold optimization is essential for multi-label chemical classification. These findings establish GNNs as the preferred approach for molecular property prediction tasks and provide crucial insights for handling class imbalance in chemical informatics applications.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。