Mining Structural Information from Gas Chromatography-Electron-Impact Ionization-Mass Spectrometry Data for Analytical-Descriptor-Based Quantitative Structure-Activity Relationship

从气相色谱-电子轰击电离-质谱数据中挖掘结构信息,用于基于分析描述符的定量构效关系研究

阅读:1

Abstract

Recently developed quantitative structure-activity relationship (QSAR) prediction uses machine learning techniques with analytical signals from the full scan of mass spectra as input, and does not need exhaustive structural determination to assess unknown compounds. The QSAR approach assumes that a mass spectral pattern reflects the structure of a target chemical. However, the relationship between the spectrum and structure is complex, and requirement of its interpretation could restrict further development of QSAR prediction methods based on analytical signals. In this study, whether gas chromatography-electron-impact ionization-mass spectrometry (GC-EI-MS) data contain meaningful structural information that assists QSAR prediction was determined by comparing it with the traditional molecular descriptor used in QSAR prediction. Four molecular descriptors were used: ECFP6, topological descriptor in CDK, MACCS key, and PubChem fingerprint. The predictive performance of QSAR based on analytical and molecular descriptors was evaluated in terms of molecular weight, log K(o-w), boiling point, melting point, water solubility, and two oral toxicities in rats and mice. The influential variables were further investigated by comparing analytical-descriptor-based and linear regression models using simple indicators of the mass spectrum. The investigation indicated that the analytical and molecular descriptors preserved structural information differently. However, their performance was comparable. The analytical-descriptor-based approach predicted the physicochemical properties and toxicities of structurally unknown chemicals, which was beyond the scope of the molecular-descriptor-based approach. The QSAR approach based on analytical signals is valuable for evaluating unknown chemicals in many scenarios.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。