Abstract
This work introduces Uni-Removal, an innovative two-stage framework that effectively addresses the critical challenge of domain adaptation in unified image restoration. Contemporary approaches often face significant performance degradation when transitioning from synthetic training environments to complex real-world scenarios due to the substantial domain discrepancy. Our proposed solution establishes a comprehensive pipeline that systematically bridges this gap through dual-phase representation learning. In the first stage, we implement a structured multi-teacher knowledge distillation mechanism that enables a unified student architecture to assimilate and integrate specialized expertise from multiple pre-trained degradation-specific networks. This knowledge transfer is rigorously regularized by our novel Instance-Grained Contrastive Learning (IGCL) objective, which explicitly enforces representation consistency across both feature hierarchies and image spaces. The second stage introduces a groundbreaking output distribution calibration methodology that employs Cluster-Grained Contrastive Learning (CGCL) to adversarially align the restored outputs with authentic real-world image characteristics, effectively embedding the student model within the natural image manifold without requiring paired supervision. Comprehensive experimental validation demonstrates Uni-Removal's superior performance across multiple real-world degradation tasks including dehazing, deraining, and deblurring, where it consistently surpasses existing state-of-the-art methods. The framework's exceptional generalization capability is further evidenced by its competitive denoising performance on the SIDD benchmark and, more significantly, by delivering a substantial 4.36 mAP improvement in downstream object detection tasks, unequivocally establishing its practical utility as a robust pre-processing component for advanced computer vision systems.