Feature Engineering and Supervised Machine Learning to Forecast Biogas Production during Municipal Anaerobic Co-Digestion

利用特征工程和监督式机器学习预测市政厌氧共消化过程中的沼气产量

阅读:2

Abstract

Municipalities with excess anaerobic digestion capacity accept offsite wastes for co-digestion to meet sustainability goals and create more biogas. Despite the benefits inherent to co-digestion, the temporal and compositional heterogeneity of external waste streams creates operational challenges that lead to upsets or conservative co-digestion. Given the complex microbial bioprocesses occurring during anaerobic digestion, prediction and modeling of the outcomes can be challenging, and machine learning has the potential to improve understanding and control of co-digestion processes. Biogas flows are a surrogate for process health, and here, we predicted biogas production from historical data collected by a water resource recovery facility (WRRF) during normal operation. We tested a daily lab and operational data set (n = 1089 after cleaning) and a minute-by-minute supervisory control and data acquisition (SCADA) operational data set (n = 491,761 after cleaning) to determine if forecasting biogas flow for a 24 h time horizon is feasible without collecting additional data. We found that a multilayer perceptron (MLP) neural network model outperformed tree-based and multiple linear regression models. Using a high-resolution SCADA data set for the first time, we showed that MLP neural networks could predict biogas production with an adjusted coefficient of determination (R(2)) of 0.78 and a mean absolute percentage error of 13.4% on a holdout test set. Adding daily laboratory analyses to the model did not appreciably improve the prediction of biogas flows. Feature engineering was essential to an accurate prediction, and 11 of the 15 most important features in the SCADA model were calculated from raw SCADA outputs. In summary, this paper demonstrates that minute-scale SCADA information collected at a municipal co-digestion facility can forecast biogas production, as a first step toward a digital twin model, without additional data collection.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。