Abstract
BACKGROUND: Tuberculosis (TB) is a major global health problem, and the pathogenesis of TB is determined by multiple variables. The complicated relationship between geographic determinants and incidence rates is poorly understood, and multicollinearity and spatial heterogeneity were not considered when exploring this relationship. METHODS: In this study, the factors influencing the incidence of TB in China were investigated, considering spatial heterogeneity, to develop a multidimensional TB indicator system that incorporates geographic factors. A comprehensive linear-nonlinear two-stage feature screening model was developed to identify key factors contributing to TB. The ordinary least squares model was constructed at the national scale using these key indicators to understand the macro-relationships between TB incidence rates and key indicators. A geographically weighted regression (GWR) model was constructed at a provincial scale, and a multiscale geographically weighted regression (MGWR) model was developed to conduct an in-depth comparative analysis of the fitting effects of the GWR and MGWR models on the TB incidence rates. The goal of this study is to investigate the impact of the GWR and MGWR models on TB incidence. The adjustable bandwidth mechanism of the MGWR model was compared with the fixed bandwidth mechanism of the GWR model to determine the best model for geographical analysis of TB incidence. RESULTS: The MGWR model had the best fit (R(2) = 0.942; AICc = 57.060) for TB incidence and provided unique bandwidths for important variables to improve model geographic analysis. The analysis of geographic components using the MGWR model revealed that the fitting coefficients of mean height, topographic relief, and average annual precipitation were spatially heterogeneous. CONCLUSION: These results provide the theoretical foundation for developing TB prevention and control measures.