Abstract
BACKGROUND: Air pollution is a major public health threat globally. Health studies, regulatory actions, and policy evaluations typically rely on air pollutant concentrations from single exposure models, assuming accurate estimations and ignoring related uncertainty. We developed a modeling framework, bneR, to apply the Bayesian Nonparametric Ensemble (BNE) prediction model that combines existing exposure models as inputs to provide air pollution estimates and their spatio-temporal uncertainty. METHODS: The bneR modeling framework (1) harmonizes air pollutant datasets to use standardized inputs for the BNE algorithm; (2) applies the BNE algorithm to obtain the posterior predictive distribution of pollutant concentrations; and (3) generates visualizations. We applied bneR to estimate NO(2) concentrations and characterize uncertainty levels at high spatio-temporal resolution (daily, 1 km(2)) over New York State (NYS) for 2015. We met with stakeholders and modelers to discuss bneR user-friendliness and interpretation of its estimates. RESULTS: Using bneR, we harmonized the spatial scale of four input NO(2) models (using the finer resolution, 1 km(2) for BNE estimations), applied BNE to obtain the NO(2) daily posterior predictive distribution, and visualized the results. Over NYS, the daily average NO(2) concentration was 6.0 (interquartile range, IQR: 4.6-6.8) pbb with daily average uncertainty (as SD) of 1.2 (IQR: 1.0-1.3) ppb. BNE performed well with cross-validated RMSE=2.84 ppb and R(2)=0.80. CONCLUSION: Meeting stakeholders and modelers allowed us to understand that efficient communication on how uncertainty is estimated and interpreted is a key feature for these communities to engage in using bneR and its data products.