Abstract
In multi-resolution remote sensing imagery, roads typically exhibit sparse, elongated, and structurally complex morphological characteristics, posing formidable connectivity modeling challenges for semantic segmentation models. Existing approaches predominantly focus on pixel-level accuracy, often neglecting the topological integrity of road networks, which leads to frequent discontinuities and omissions in predicted results. To address this, this paper proposes an end-to-end road extraction framework equipped with multi-receptive field modeling and structural connectivity preservation capabilities. The model incorporates a multi-receptive-field module to capture road patterns across varying spatial scales, a connectivity-aware decoding mechanism to strengthen structural coherence, and a topology-aware loss that explicitly guides the restoration of continuous road networks during training. On the DeepGlobe-Road dataset, TopoRF-Net achieves OA 98.57%, IoU 69.76%, F1-score 82.18%, Precision 85.50%, and Recall 79.12%; on the Massachusetts dataset, TopoRF-Net similarly achieved outstanding results: OA 96.65%, IoU 59.68%, F1-score 74.75%, Precision 77.98%, and Recall 71.77%. These results conclusively demonstrate that the proposed method significantly outperforms existing approaches in both precision and connectivity metrics, whilst exhibiting favorable parameter efficiency and inference performance.