Confidently Uncertain: Probabilistic Machine Learning to Predict Soil Biotransformation Half-Lives

充满信心的不确定性:利用概率机器学习预测土壤生物转化半衰期

阅读:2

Abstract

Predicting environmental persistence of chemicals from molecular structure is an open challenge, yet indispensable in regulatory screenings for potentially harmful substances and to advance the development of safe-and-sustainable-by-design chemicals. Limited availability of biotransformation half-life data makes persistence prediction difficult, and models typically struggle to generalize beyond their training data. Therefore, reliable estimates of prediction confidence are key. Here, we propose a probabilistic model for the prediction of soil biotransformation half-lives. A Gaussian Process Regressor was trained on 867 mean pesticide half-lives with data uncertainty estimates. Instead of single half-life values, our model predicts well-calibrated probability distributions that can be used to calculate a compound's probability of being persistent. Although the overall model performance remains moderate, the predictions are reliable when the confidence in the prediction is high. We applied our model to pesticide transformation products with unknown half-lives, and to a database of globally marketed chemicals. We show that our model is able to identify chemicals that are known, or suspected to be, persistent in the environment. The model is available as an online app (https://pepper-app.streamlit.app/) and as a Python library (pepper-lab) to meet diverse user needs.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。