Supervised model based polycystic ovarian syndrome detection in relation to vitamin d deficiency by exploring different feature selection techniques

通过探索不同的特征选择技术,利用监督模型检测与维生素D缺乏相关的多囊卵巢综合征。

阅读:1

Abstract

Due to urbanization and modern lifestyle, most of women in today's world are prone to Polycystic Ovarian Syndrome (PCOS), which is a hormonal disorder. Though the symptoms shown by this disease are often uncared, it seriously affects the reproductive health of women. Early detection of PCOS helps in managing several other attributes that are closely related to it. This article aims to study the impact of Vitamin D3 in PCOS and non-PCOS individuals. The goal is attained by building a tailored dataset with 1368 records and 43 attributes. Initially, the acquired dataset is pre-processed by handling missed values, outlier detection and data balancing by employing Probabilistic Principal Component Analysis (PPCA), Interquartile Range (IQR), Z-score standardization and SMOTE respectively. The significant features are selected by exploring different approaches such as filter based (Chi-Square, ANOVA), wrapper based (Electric Eel Foraging Optimization Algorithm) and embedded methods (LASSO, XGBoost). The selected features are utilized to train classifiers such as Random Forest (RF), k-Nearest Neighbour (k-NN), Decision Tree (DT) and Support Vector Machine (SVM). The experimental results show that the performance of EEFOA with RF prove the best accuracy rates of 98.8% with a F-measure of 98.19%. Explainable Artificial Intelligence (XAI) techniques such as SHAP and LIME are then employed to showcase the feature importance. It is observed that over 40% of PCOS patients are affected by deficiency and insufficiency of vitamin D3.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。