基于残差注意力和半监督学习的图像去雾算法

doi:10.16180/j.cnki.issn1007-7820.2023.09.008

摘要/Abstract

摘要：

基于训练合成图像的去雾算法往往不能在真实图像数据集上取得较好效果。针对泛化能力不理想等问题,文中提出了一种基于残差注意力机制的半监督学习网络用于单幅图像去雾算法。其主干网络由编码器和解码器构成,通过使用堆叠的残差注意力模块调整不同尺度的特征权重,赋予重要特征更多权重。局部残差学习选择绕过薄雾区域,使模型关注有效信息。文中训练分为有监督学习和无监督学习两个分支,分别输入合成数据和真实数据,其中使用暗通道损失和全变分损失来约束无监督分支。实验结果表明,文中所提算法在合成数据集和真实数据集上均取得了较好的结果,图像的平均处理时间仅为0.01 s,在去雾效果和处理时间上实现了平衡。

关键词: 图像去雾, 编码解码结构, 半监督框架, 注意力机制, 残差连接, SOS增强策略, 暗通道损失, SSIM损失

Abstract:

Dehazing algorithms based on training synthetic images cannot achieve satisfactory results on real image data sets. In view of these problems of unsatisfactory generalization ability, this study proposes a semi-supervised learning network based on residual attention mechanism for single image dehaze. The backbone network of the proposed model consists of an encoder and a decoder. Using stacked residual attention modules, the feature weights of different scales are adjusted to give more weight to important features. Local residuals choose to bypass hazy regions so that the model can focus on valid information. The training in this study is divided into two branches: supervised learning and unsupervised learning, which input synthetic data and real data respectively. Dark channel loss and total variational loss are used to constrain the unsupervised branches. The results show that the proposed algorithm obtains ideal results on both synthetic data sets and real data sets, and the average processing time of images is only 0.01 s, achieving a balance between dehazing effect and processing time.

Key words: image dehazing, encoding and decoding structure, semi-supervisory framework, attention mechanism, residual connection, SOS enhancement strategy, dark channel loss, SSIM loss

中图分类号:

TP391.41

孙曦,于莲芝. 基于残差注意力和半监督学习的图像去雾算法[J]. 电子科技, 2023, 36(9): 50-57.

SUN Xi,YU Lianzhi. Image Dehazing Algorithm Based on Residual Attention and Semi-Supervised Learning[J]. Electronic Science and Technology, 2023, 36(9): 50-57.

图/表 11

图1

图2

图3

图4

图5

表1

表2

表3

图6

图7

表4

参考文献 19

[1]	He L Y, Zhao J Z, Zheng N N, et al. Hardy variation framework for restoration of weather degraded images[J]. Mathematical Problems in Engineering, 2015(7):1-11.
[2]	He K, Sun J, Fellow, et al. Single image haze removal using dark channel prior[J]. IEEE Transactions on Pattern Analysis & Machine Intelligence, 2011, 33(12):2341-2353.
[3]	陆欢. 基于暗原色优化算法的去雾研究[J]. 电子科技, 2020, 33(4):61-64.
	Lu Huan. Research on fog removal based on dark primary color optimization algorithms[J]. Electronic Science and Technology, 2020, 33(4):61-64.
[4]	Zhu Q, Mai J, Shao L. A fast single image haze removal algorithm using color attenuation prior[J]. IEEE Transactions on Image Processing, 2015, 24(11):3522-3533. doi: 10.1109/TIP.2015.2446191 pmid: 26099141
[5]	Berman D, Avidan S. Non-local image dehazing[C]. Las Vegas: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2016:47-56.
[6]	Cai B, Xu X, Jia K, et al. DehazeNet:An end-to-end system for single image haze removal[J]. IEEE Transactions on Image Processing, 2016, 25(11):5187-5198. doi: 10.1109/TIP.2016.2598681
[7]	孙红, 赵迎志. 基于多尺度梯度的轻量级生成对抗网络[J]. 电子科技, 2023, 36(7):32-38.
	Sun Hong, Zhao Yingzhi. Lightweight generative adversarial networks based on multi-scale gradient[J]. Electronic Science and Techology, 2023, 36(7):32-38.
[8]	Li B, Peng X, Wang Z, et al. Aod-net: All-in-one dehazing network[C]. Venice: Proceedings of the IEEE International Conference on Computer Vision, 2017:121-131.
[9]	Engin D, Genç A, Kemal Ekenel H. Cycle-dehaze: Enhanced cyclegan for single image dehazing[C]. Salt Lake City: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2018:37-46.
[10]	Chen D, He M, Fan Q, et al. Gated context aggregation network for image dehazing and deraining[C]. Waikoloa: Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2019:98-106.
[11]	Li L, Dong Y, Ren W, et al. Semi-supervised image dehazing[J]. IEEE Transactions on Image Processing, 2020, 29(11):2766-2779. doi: 10.1109/TIP.83
[12]	Shao Y, Li L, Ren W, et al. Domain adaptation for image dehazing[C]. Seattle: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020:67-74.
[13]	李硕士, 刘洪瑞, 甘永东, 等. 基于残差密集块与注意力机制的图像去雾网络[J]. 湖南大学学报(自然科学版), 2021, 48(6):112-118.
	Li Shuoshi, Liu Hongrui, Gan Yongdong, et al. Image defogging Networks based on residual dense blocks and attention mechanism[J]. Journal of Hunan University (Natural Science), 201, 48(6):112-118.
[14]	Qin X, Wang Z, Bai Y, et al. FFA-Net: Feature fusion attention network for single image dehazing[C]. New York: Proceedings of the AAAI Conference on Artificial Intelligence, 2020:54-64.
[15]	Dong H, Pan J, Xiang L, et al. Multi-scale boosted dehazing network with dense feature fusion[C]. Seattle: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020:91-97.
[16]	Zhu J Y, Park T, Isola P, et al. Unpaired image-to-image translation using cycle-consistent adversarial networks[C]. Venice: Proceedings of the IEEE International Conference on Computer Vision, 2017:77-83.
[17]	Deng J, Dong W, Socher R, et al. Imagenet: A large-scale hierarchical image database[C]. Miami: IEEE Conference on Computer Vision and Pattern Recognition, 2009:24-31.
[18]	Li B, Ren W, Fu D, et al. Benchmarking single-image dehazing and beyond[J]. IEEE Transactions on Image Processing, 2018, 28(1):492-505. doi: 10.1109/TIP.83
[19]	Liu Z, Lin Y, Cao Y, et al. Swin transformer: Hierarchical vision transformer using shifted windows[C]. Montreal: Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021:108-115.

算法	PSNR/SSIM
DCP	14.846 dB/0.702
AOD-Net	18.362 dB /0.801
GCANet	23.715 dB /0.869
SSID	22.368 dB /0.845
本文算法	25.755 dB /0.913

算法	PSNR/dB
DCP	15.97
AOD-Net	17.51
GCANet	19.62
SSID	24.69
DehazeFormer-t	27.55
本文算法	28.18

算法	运行平台	运行时间/s
DCP	Python(GPU)	1.60
AOD-Net	Caffe(GPU)	0.40
GCANet	Pytorch(GPU)	2.13
SSID	Pytorch(GPU)	0.85
DehazeFormer-t	Pytorch(GPU)	0.05
本文算法	Pytorch(GPU)	0.01

模型	SSIM损失	残差注意力机制	SOS	PSNR/dB
模型1	√	×	×	22.988
模型2	√	√	×	24.392
模型3	√	√	√	25.755