ThermoPred: AI-Enhanced Quantum Chemistry Data Set and ML Toolkit for Thermochemical Properties of API-Like Compounds and Their Degradants

ThermoPred:用于研究活性药物成分及其降解产物热化学性质的AI增强型量子化学数据集和机器学习工具包

阅读:2

Abstract

In this work, we present an open-access quantum-chemistry database of more than 14,500 API-like molecules and their degradation products, all optimized at the M06-2X/6-31G(d) compound model. The data set delivers a comprehensive suite of thermochemical and quantum descriptors─including Gibbs free energy, enthalpy, electronic energy, vibrational frequencies and Cartesian geometries─tailored for large-scale modeling. Leveraging these data, we trained and validated three machine-learning models (XGBoost, Random Forest and Multi-Layer Perceptron) to enable rapid, accurate prediction of Gibbs free energy and enthalpy. These models are bundled in ThermoPred, an open-source Python package that offers a scalable, computationally efficient alternative to traditional quantum-chemical calculations. All data sets, models and source code are freely available to support reproducibility and foster community-driven development.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。