Detecting automobile insurance fraud using a novel penalty-driven feature selection method with particle swarm optimization and machine learning classifiers

利用基于粒子群优化和机器学习分类器的新型惩罚驱动特征选择方法检测汽车保险欺诈

阅读:1

Abstract

Automobile insurance fraud poses a significant challenge for insurers, leading to substantial financial losses through fabricated claims and exaggerated damages. Traditional machine learning approaches often struggle with high-dimensional, imbalanced data and limited interpretability, reducing their practical applicability. To address these issues, we propose a penalty-driven feature selection method with particle swarm optimization, which penalizes highly correlated features to improve model generalization and maintain interpretability. The method was evaluated on the real-world "Angoss carclaims" dataset, comprising 33 features and 15,420 records, and balanced using the synthetic minority oversampling technique. Eleven machine learning classifiers, including random forest, support vector machine, K-nearest neighbors, logistic regression, decision tree, artificial neural networks, gradient boosting, adaptive boosting, categorical boosting, light gradient boosting machine, and stacking classifier were tested, including ensemble and boosting methods, with hyperparameters tuned via grid search and assessed under four threshold values (α = 0.85, 0.75, 0.65, 0.50). The Stacking Classifier achieved the most reliable performance, reaching 97.55% accuracy with a balanced F1-score of 0.9754 when the feature set was reduced to 16 at α = 0.65. These findings demonstrate that the proposed framework effectively balances predictive accuracy with interpretability, offering a practical tool for fraud detection in insurance analytics.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。