全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...

基于多尺度特征提取与融合的单幅图像去雾算法
A Single Image Dehazing Algorithm Based on Multi-Scale Feature Extraction and Fusion

DOI: 10.12677/jisp.2024.132011, PP. 117-129

Keywords: 单幅图像去雾,多尺度特征融合,U形网络,深度监督,自建数据集
Single Image Dehazing
, Multi-Scale Feature Fusion, U-Net, Deep Supervision, Self-Built Dataset

Full-Text   Cite this paper   Add to My Lib

Abstract:

为解决随着CNN网络层数加深而导致的学习成本过高或过拟合问题,提出了一种基于多尺度特征提取与融合的单幅图像去雾算法。该算法结合U-Net思想,对输入图像进行物理分割和下采样得到多个尺度的特征图,采用残差连接的方式进行多维度融合,可以更好的适配大尺度数据集。同时,在网络中加入了深度监督模块,引入额外的监督信号有助于梯度传播,加快收敛速度,保证了训练的稳定性,这种多任务的学习形式提高了网络对不同输入的适应性,可以增强去雾效果。此外,使用自带多维度天气系统渲染的3D游戏引擎,自建了一份大尺度全高清数据集,模型训练的鲁棒性和泛化能力得到显著提升。实验结果表明,所提算法在训练速度和模型大小控制上具有一定优势,在主观评价上,远景去雾效果明显,峰值信噪比(Peak Signal-to-Noise Ratio, PSNR)和结构相似性(Structure Similarity, SSIM)两个客观评价指标分别为26.75 dB和0.907,相较于对比算法中性能第二的模型分别提高了3.5和5.9个百分点,加入自建数据集进行组合训练后进一步提升了模型的去雾性能。
To solve the problem of high learning cost or overfitting caused by the deepening of CNN network layers, a single image dehazing algorithm based on multi-scale feature extraction and fusion is proposed. This algorithm combines the U-Net idea to physically segment and down-sampling the input image to obtain multi-scale feature maps. It uses residual connections for multi-dimensional fusion, which can better adapt to large-scale datasets. At the same time, a deep supervision module has been added to the network, introducing additional supervision signals to facilitate gradient propagation, accelerate convergence speed, and ensure training stability. This multi-task learning form improves the network’s adaptability to different inputs and can enhance the dehazing effect. In addition, a 3D game engine with a built-in multi-dimensional weather system rendering was used, and a large-scale high-definition dataset was built. The robustness and generalization ability of the model training was significantly improved. The experimental results show that the proposed algorithm has certain advantages in training speed and model size control. In terms of subjective evaluation, the long-range dehazing effect is obvious. The objective evaluation indicators of Peak Signal-to-Noise Ratio (PSNR) and Structure Similarity (SSIM) are 26.75 dB and 0.907, respectively, which are 3.5 and 5.9 percentage points higher than the second-best-performing model in the comparison algorithm. The addition of a self-built dataset for combined training further improves the model’s dehazing performance.

References

[1]  Rizzi, A., Gatta, C. and Marini, D. (2003) A New Algorithm for Unsupervised Global and Local Color Correction. Pattern Recognition Letters, 24, 1663-1677.
https://doi.org/10.1016/S0167-8655(02)00323-9
[2]  He, K., Sun, J. and Tang, X. (2011) Single Image Haze Removal Using Dark Channel Prior. IEEE Transactions on Pattern Analysis and Machine Intelligence, 33, 2341-2353.
https://doi.org/10.1109/TPAMI.2010.168
[3]  Ren, W., Zhou, L. and Chen, J. (2023) Unsupervised Single Image Dehazing with Generative Adversarial Network. Multimedia Systems, 29, 2923-2933.
https://doi.org/10.1007/s00530-021-00852-z
[4]  Zhu, J.Y., Park, T., Isola, P., et al. (2017) Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks. Proceedings of the IEEE International Conference on Computer Vision, Venice, 22-29 October 2017, 2242-2251.
https://doi.org/10.1109/ICCV.2017.244
[5]  Cai, B., Xu, X., Jia, K., et al. (2016) Dehazenet: An End-to-End System for Single Image Haze Removal. IEEE Transactions on Image Processing, 25, 5187-5198.
https://doi.org/10.1109/TIP.2016.2598681
[6]  Ren, W., Liu, S., Zhang, H., et al. (2016) Single Image Dehazing via Multi-Scale Convolutional Neural Networks. In: Leibe, B., Matas, J., Sebe, N. and Welling, M., Eds., Computer VisionECCV 2016, Lecture Notes in Computer Science, Vol 9906, Springer, Cham, 154-169.
https://doi.org/10.1007/978-3-319-46475-6_10
[7]  王柯俨, 王迪, 赵熹, 等. 基于卷积神经网络的联合估计图像去雾算法[J]. 吉林大学学报(工学版), 2020, 50(5): 1771-1777.
https://doi.org/10.13229/j.cnki.jdxbgxb20190443
[8]  Wang, L., Lee, C.Y., Tu, Z., et al. (2015) Training Deeper Convolutional Networks with Deep Supervision.
[9]  He, K., Zhang, X., Ren, S., et al. (2015) Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, 37, 1904-1916.
https://doi.org/10.1109/TPAMI.2015.2389824
[10]  Gong, Y., Wang, L., Guo, R., et al. (2014) Multi-Scale Orderless Pooling of Deep Convolutional Activation Features. In: Fleet, D., Pajdla, T., Schiele, B. and Tuytelaars, T., Eds., Computer VisionECCV 2014, Lecture Notes in Computer Science, Vol. 8695, Springer, Cham, 392-407.
https://doi.org/10.1007/978-3-319-10584-0_26
[11]  Li, B., Peng, X., Wang, Z., et al. (2017) An All-in-One Network for Dehazing and Beyond.
[12]  Koschmieder, H. (1924) Theorie der horizontalen Sichtweite. Beitragezur Physik der freien Atmosphare, 33-53.
[13]  Ronneberger, O., Fischer, P. and Brox, T. (2015) U-Net: Convolutional Networks for Biomedical Image Segmentation. In: Navab, N., Hornegger, J., Wells, W. and Frangi, A., Eds., Medical Image Computing and Computer-Assisted InterventionMICCAI 2015, Lecture Notes in Computer Science, Vol. 9351, Springer, Cham, 234-241.
https://doi.org/10.1007/978-3-319-24574-4_28
[14]  Zhou, Z., Siddiquee, M.M.R., Tajbakhsh, N., et al. (2018) A Nested U-Net Architecture for Medical Image Segmentation.
https://doi.org/10.1007/978-3-030-00889-5_1
[15]  Dong, H., Pan, J., Xiang, L., et al. (2020) Multi-Scale Boosted Dehazing Network with Dense Feature Fusion. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Seattle, 13-19 June 2020, 2157-2167.
https://doi.org/10.1109/CVPR42600.2020.00223
[16]  Zhang, J. and Tao, D. (2019) FAMED-Net: A Fast and Accurate Multi-Scale End-to-End Dehazing Network. IEEE Transactions on Image Processing, 29, 72-84.
https://doi.org/10.1109/TIP.2019.2922837
[17]  Li, B.Y., Ren, W.Q., Fu, D.P., et al. (2019) Benchmarking Single-Image Dehazing and Beyond. IEEE Transactions on Image Processing, 28, 492-505.
https://doi.org/10.1109/TIP.2018.2867951
[18]  Li, B., Gou, Y., Gu, S., et al. (2021) You Only Look Yourself: Unsupervised and Untrained Single Image Dehazing neural Network. International Journal of Computer Vision, 129, 1754-1767.
https://doi.org/10.1007/s11263-021-01431-5
[19]  Zhang, H. and Patel, V.M. (2018) Densely Connected Pyramid Dehazing Network. IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, 18-23 June 2018, 3194-3203.
https://doi.org/10.1109/CVPR.2018.00337
[20]  Ren, W., Ma, L., Zhang, J., et al. (2018) Gated Fusion Network for Single Image Dehazing. IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, 18-23 June 2018, 3253-3261.
https://doi.org/10.1109/CVPR.2018.00343
[21]  Zhao, S., Zhang, L., Shen, Y., et al. (2021) RefineDNet: A Weakly Supervised Refinement Framework for Single Image Dehazing. IEEE Transactions on Image Processing, 30, 3391-3404.
https://doi.org/10.1109/TIP.2021.3060873
[22]  Hu, F., Xia, G.S., Hu, J., et al. (2015) Transferring Deep Convolutional Neural Networks for the Scene Classification of High-Resolution Remote Sensing Imagery. Remote Sensing, 7, 14680-14707.
[23]  https://doi.org/10.3390/rs71114680

Full-Text

comments powered by Disqus

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133

WeChat 1538708413