Multivariable machine learning prediction of risky alcohol use in contemporary youth

利用多变量机器学习方法预测当代青少年危险饮酒行为

阅读:2

Abstract

BACKGROUND AND AIMS: Risky alcohol use in young adulthood is a significant public health concern. Understanding the predictors of risky drinking during this period is essential for prevention. This study aimed to measure the predictive accuracy of ensemble machine learning and identify the most important predictors of risky alcohol use in early adulthood. DESIGN AND SETTING: Secondary analysis of the Longitudinal Study of Australian Children, an Australian national longitudinal cohort study. PARTICIPANTS: A total of 4983 children, aged 4-5 years in 2004 (Wave 1), followed up for eight waves (to age 18/19 in 2018). MEASUREMENTS: Risky alcohol use was measured at age 18 and defined as more than 10 standard drinks per week, as per Australian National guidelines. Predictors from multiple domains-sociodemographic, adolescent substance use, adolescent mental health and behaviours, parental mental health and substance use, school factors, peer influences, parenting practices and parental stress-were included, measured from Wave 1 to 7. The SuperLearner package in R was used to test a series of models [regularised regression (LASSO, ridge and elastic net), random forest and kernel support vector machine (SVM)] using nested 10-fold cross-validation to identify the overall predictive ability of the model (measured by area under the curve; AUC) and the most important predictors of risky alcohol use across childhood and adolescence. Predictor importance was derived by normalising algorithm-specific scores per fold, weighting them by SuperLearner coefficients and aggregating across folds to rank predictors by mean weighted importance on a scale of 0 to 1 (higher scores indicating greater importance). FINDINGS: The ensemble model showed good prediction on the test set, with an AUC of 0.792, a slight improvement over any single algorithm (AUC = 0.783 for the best performing individual algorithm). The most important predictors were weekly drinking at the previous wave (mean weighted importance 0.999), lifetime cannabis use (0.446), lifetime parent financial stress (0.420), identifying as female (0.365), identifying as male (0.344; compared with a reference category of gender diverse), lifetime attention deficit hyperactivity disorder (0.248), pre-natal alcohol exposure (0.248), housing insecurity (0.243), religious involvement (0.238) and parent alcohol use problems (0.215). CONCLUSIONS: An ensemble learning approach appears to have good predictive ability of risky alcohol use among a contemporary cohort of young Australians. It underscores the complex interplay of individual, familial and social factors occurring across childhood and adolescence that influences risky alcohol use in early adulthood.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。