Construction of the prediction model and analysis of key winning factors in world women's volleyball using gradient boosting decision tree

基于梯度提升决策树的世界女子排球冠军预测模型构建与关键获胜因素分析

阅读:1

Abstract

This study aims to analyze the key factors contributing to victories in world women's volleyball matches and predict match win rates using machine learning algorithms. Initially, Grey Relational Analysis (GRA) was employed to analyze the fundamental match data of the top six teams over three major world tournaments during the 2020 Olympic cycle (a total of 142 matches, 505 sets, and 27 metrics). The 27 metrics were used as subsequences, and the set win rate served as the parent sequence to identify metrics with a high contribution to match victories. Subsequently, the Gradient Boosting Decision Tree (GBDT) algorithm was utilized to construct a prediction model for match win rates, using the selected metrics as input features and set win rates as output features. The input metrics were ranked by their contribution to determine the most influential factors on match victories. The results indicate that spike scoring rate, blocking height, excellent defense rate, serve scoring rate, block scoring rate, proportion of serve scores, and proportion of block scores significantly impact match victories. Among these, spike scoring rate and blocking height are decisive, with feature importance values of 0.45 and 0.3, respectively. The constructed GBDT model demonstrated good predictive performance, capable of predicting match win rates. The model parameters are as follows: learning rate (learning-rate) of 0.1, number of trees (n-estimators) of 150, and maximum depth of the tree model (max-depth) of 2. The model's accuracy metrics on the test set are: MSE = 0.002, MAE = 0.0322, R(2) = 0.8497, and MAPE = 4.77%. The average relative error of the model validation is 5.30%, with R(2) = 0.743. This study not only identifies the key factors contributing to victories in world women's volleyball but also demonstrates the innovative application of combining Grey Relational Analysis with the Gradient Boosting Decision Tree algorithm in the volleyball domain. The findings provide data-driven insights to support coaches in training design and in-game decision-making, highlighting important practical value.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。