SNPs and blood inflammatory marker featured machine learning for predicting the efficacy of fluorouracil-based chemotherapy in colorectal cancer

利用单核苷酸多态性(SNP)和血液炎症标志物进行机器学习,预测氟尿嘧啶类化疗在结直肠癌中的疗效

阅读:1

Abstract

Fluorouracil-based chemotherapy responses in colorectal cancer (CRC) patients vary widely, highlighting the role of pharmacogenomics in developing better predictive models. We analyzed 379 CRC patients receiving fluorouracil-based chemotherapy, collecting data on fluorouracil metabolism-related SNPs (TYMS, MTHFR, DPYD, RRM1), blood inflammatory markers, and clinical status. Six machine learning models-K-nearest neighbors, support vector machine, gradient boosting decision trees (GBDT), eXtreme Gradient Boosting (XGBoost), LightGBM, and random forest-were compared against multivariate logistic regression and a deep learning model (i.e., multilayer perceptron, MLP). Feature importance analysis highlighted seven predictors: histological grade, N and M staging, monocyte count, platelet-to-lymphocyte ratio, MTHFR rs1801131, and RRM1 rs11030918. In a five-fold cross-validation, XGBoost and GBDT exhibited superior performance, with Area Under Curve (AUC) of 0.88 ± 0.02. XGBoost excelled in identifying favorable prognosis (recall = 0.939). GBDT demonstrated balance in recognizing both categories, with a recall for favorable prognosis of 0.908 and a precision for unfavorable prognosis of 0.863. MLP had a similar AUC (0.87) with high precision for favorable prognosis (recall = 0.946). In external validation, XGBoost model achieved an accuracy of 0.79. An online prognostic tool based on XGBoost was developed, integrating metabolism-related SNPs and inflammatory markers, enhancing CRC treatment precision and supporting tailored chemotherapy.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。