Underwater image dehazing using a hybrid GAN with bottleneck attention and improved Retinex-based optimization

基于混合生成对抗网络(GAN)的水下图像去雾算法,结合瓶颈注意力机制和改进的基于Retinex的优化方法

阅读:1

Abstract

Autonomous underwater vehicles (AUVs) are essential for marine exploration, monitoring, and surveillance, especially in hazardous or inaccessible environments for human divers. Underwater imaging systems frequently face considerable difficulties in detecting and tracking objects due to image degradation resulting from light scattering, colour distortion, and haze. Conventional enhancement methods-like histogram equalisation and gamma correction-struggle with non-uniform illumination and frequently do not maintain critical structural details and perceptual quality. To address these limitations, this work proposes a novel framework for underwater image dehazing and enhancement that incorporates three essential components: a generative adversarial network (GAN), a bottleneck attention module (BAM), and an enhanced Retinex-based contrast enhancement technique. The GAN acquires the intricate correspondence between deteriorated and high-quality underwater images, facilitating the restoration of fine textures and the attenuation of noise. The BAM selectively amplifies spatial and channel-specific features, thereby augmenting the network's capacity to preserve natural hues and intricate details. The modified Retinex algorithm adaptively distinguishes between illumination and reflectance components, facilitating context-sensitive contrast enhancement across various lighting conditions. This integrated architecture facilitates collaborative learning among generative modelling, attention-driven feature refinement, and physics-based enhancement. The proposed method undergoes thorough evaluation on the underwater Image enhancement benchmark (UIEB) dataset, which consists of 890 authentic underwater images. This study presents exceptional quantitative performance across various evaluation metrics: a UIQM score of 3.71 (indicating image quality), a PSNR of 28.4 dB (reflecting signal fidelity), an SSIM of 0.88 (representing structural similarity), and a perceptual LPIPS score of 0.082. The low LPIPS score underscores the perceptual realism of the enhanced images, correlating effectively with human visual preferences. These results distinctly surpass current classical and learning-based enhancement methods, demonstrating the efficacy and resilience of this approach for practical underwater vision applications.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。