Dynamic ensemble deep learning with multi-source data for robust influenza forecasting in Yangzhou

基于多源数据的动态集成深度学习在扬州流感预测中的稳健应用

阅读:2

Abstract

BACKGROUND: Traditional influenza surveillance often suffers from reporting delays, hindering timely public health response. This study aimed to mitigate this limitation by developing an accurate deep learning framework for influenza prediction. METHODS: We constructed a 13-year (652-week) multi-source dataset for Yangzhou City, China, integrating influenza-like illness (ILI) surveillance data with Baidu search indices and meteorological variables. Six state-of-the-art deep learning models-GRU, Transformer, LSTM, TFT (Temporal Fusion Transformer), TCN (Temporal Convolutional Network), and N-BEATS-were systematically compared under 1-, 5-, and 9-week sliding time windows. Based on their robust performance across windows, GRU, TCN, and Transformer were selected as core base learners. Using these models, we designed several ensemble strategies, culminating in a dynamic weighted ensemble with seasonal residual adjustment (DWE + SRA). RESULTS: Multi-source data integration improved predictive accuracy relative to using surveillance data alone. Forecasting performance varied with the sliding time window: under a 1-week window, all models showed comparable accuracy, with GRU achieving the lowest RMSE and highest R² and Transformer the lowest MAE; at 5 weeks, TCN achieved the best overall performance, while Transformer, and GRU also maintained relatively good accuracy; at 9 weeks, GRU, TCN, and Transformer remained competitive, whereas TFT and N-BEATS degraded. Overall, GRU, TCN, and Transformer exhibited the most robust performance across window lengths. The proposed DWE + SRA strategy, built on these three base learners, further enhanced forecasting accuracy and stability, reducing test-set RMSE and MAE by approximately 28% and 17%, respectively, compared with the best single model (GRU), and closely tracking observed ILI dynamics during both peak and off-peak periods. CONCLUSION: This study presents a multi-source deep learning framework that effectively integrates heterogeneous data to compensate for surveillance delays. Key contributions include: (i) a systematic sliding-window comparison that reveals the temporal strengths of different architectures; and (ii) a DWE + SRA ensemble strategy that dynamically adjusts model weights and corrects seasonal biases to substantially improve prediction stability. This work provides a scalable, data-driven paradigm for localized influenza forecasting and early warning.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。