Exploration of the Relationships between Men's Healthy Life Expectancy in Japan and Regional Variables by Integrating Statistical Learning Methods

运用统计学习方法探索日本男性健康预期寿命与区域变量之间的关系

阅读:1

Abstract

A quantitative understanding of the relationship between comprehensive health levels, such as healthy life expectancy and their related factors, through a highly explanatory model is important in both health research and health policy making. In this study, we developed a regression model that combines multiple linear regression and a random forest model, exploring the relationship between men's healthy life expectancy in Japan and regional variables from open sources at the city level as an illustrative case. Optimization of node-splitting in each decision tree was based on the total mean-squared error of multiple regression models in binary-split child nodes. Variations of standardized partial regression coefficients for each city were obtained as the ensemble of multiple trees and visualized on scatter plots. By considering them, interaction terms with piecewise linear functions were exploratorily introduced into a final multiple regression model. The plots showed that the relationship between the healthy life expectancy and the explanatory variables could differ depending on the cities' characteristics. The procedure implemented here was suggested as a useful exploratory method for flexibly implementing interactions in multiple regression models while maintaining interpretability.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。