Data anomaly repair method based on fuzzy voting and multi-segment interpolation.

阅读:5
作者:Lv Yanling, Han Qingdong, Xue Shulei
Wind turbines are often situated in remote areas under harsh environmental conditions, where external noise and electromagnetic interference can corrupt the data, negatively impacting downstream tasks such as predictive alerts and diagnostics. Consequently, this paper proposes a comprehensive data processing workflow, encompassing both anomaly detection and data interpolation, to preprocess data for wind farms effectively. Firstly, an outlier detection method based on fuzzy voting theory is proposed, utilizing multiple anomaly detectors to ensure accurate detection of outliers within voluminous datasets. Secondly, a multi-segment data interpolation method based on segmented recognition is introduced. This method captures statistical features of the dataset to establish dynamic thresholds for identifying the upper limits of missing segments. For middle gaps, interpolation is performed using forward-backward LOESS, while large gaps are filled using thermal card filling based on similar trend recognition. This approach not only enhances the quality of data interpolation but also optimally balances the training time cost. Finally, the proposed method was validated using real-world wind field data. The results of the analysis demonstrate that compared to LSTM and other interpolation methods, the multi-segment interpolation approach achieved significant improvements in performance metrics, with MAE, MSRE, and RSE reduced by 24%, 7.1%, and 8.2%, respectively, indicating a notable enhancement in data quality. After completing the full data processing workflow, the wind field data showed a substantial improvement in model performance: the test set F1 score of the DLinear model increased by 3.8-19.1%, and Accuracy improved by 2.3-13.3% compared to the unprocessed data. These results highlight the enhanced precision and stability of the early warning model, along with faster convergence speeds.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。