Surface Variable-Based Machine Learning for Scalable Arsenic Prediction in Undersampled Areas

基于表面变量的机器学习方法在欠采样区域中实现可扩展的砷预测

阅读:1

Abstract

In the United States, private wells are not federally regulated, and many households do not test for Arsenic (As). Chronic exposure is linked with multiple health outcomes, and risk can change sharply over short distances and with well depth. Coarse maps or sparse sampling often miss exceedances. Most existing models operate at ∼1 km resolution and use groundwater chemistry or detailed geologic logs, which limits their use in undersampled areas where improved guidance is most needed. We overcome these limitations by developing a machine learning model for Minnesota, USA, that predicts As exposure risk using only surficial variables from remote sensing and global data sets. Variables related to surface water hydrology and geomorphology are selected based on mechanistic links that control redox conditions and As mobilization. Local training was essential, and surficial geology variables that are more sensitive to local conditions were needed to maximize model accuracy. The resulting complete model was sufficiently sensitive to generate accurate and detailed risk maps and depth profiles of As concentrations above the 10 μg/L maximum contaminant level. Accuracy depended on local training data density. We identified a training data density of 0.07 wells/km(2) as a practical target for stable county-level performance. Maps of exceedance probabilities highlight priority areas for testing that are particularly important in rural communities that have received less sampling. These results support public health action by guiding where to install wells and where to test them, how much new sampling is needed, and where treatment outreach is most urgent.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。