Abstract
Aboveground biomass (AGB) is a key indicator of vegetation productivity and terrestrial carbon stocks; therefore, robust AGB estimation is critical for assessing ecosystem services and carbon cycle research. Previous studies have largely focused on forest and cropland ecosystems. In contrast, roadside vegetation along highways and other linear transport corridors remains comparatively underexplored despite its potentially important role as a carbon sink. Here, we integrate field-measured AGB samples with GF-2 high-resolution satellite imagery to evaluate the suitability of multiple remote-sensing predictors and machine-learning algorithms for estimating AGB in highway roadside vegetation. Six remote-sensing variables were used as predictors, including four vegetation indices (Normalized Difference Vegetation Index (NDVI), Perpendicular Vegetation Index (PVI), Enhanced Vegetation Index (EVI), and Modified Soil-Adjusted Vegetation Index (MSAVI) and two-band ratios (B342 and B12/34). Five regression models-multiple linear regression (MLR), partial least squares regression (PLSR), random forest (RF), support vector regression (SVR), and extreme gradient boosting (XGBoost)-were developed and systematically compared under both single-variable and multi-variable scenarios. Model performance was evaluated using five-fold cross-validation, with the coefficient of determination (R(2)) and the root mean square error (RMSE) as metrics of evaluation. The results indicate that the RF model under the multi-variable scenario achieved the best overall performance, with a training R(2) of 0.83 and a testing RMSE of 0.84 kg·m(-2), substantially outperforming the other linear and non-linear models. The optimal RF model was further applied to GF-2 imagery to produce a spatially explicit AGB map for a 32 km highway segment and a 30 m roadside buffer on both sides, yielding an estimated total aboveground biomass of 566.97 t for the corridor. These findings demonstrate that combining high-resolution remote sensing with machine-learning approaches can effectively improve AGB estimation for linear roadside vegetation systems, providing technical support for ecological monitoring, roadside greening management, and carbon accounting for transport infrastructure.