全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...

融合Transformer和CNN的U型神经网络遥感影像道路提取算法
Remote Sensing Image Road Extraction Algorithm Based on U-Type Neural Network and Transformer Combined with CNN

DOI: 10.12677/CSA.2024.141015, PP. 134-146

Keywords: U型网络,遥感图像,蛇形动态卷积,Swin-Transformer,道路提取
U-Shaped Network
, Remote Sensing Images, Serpentine Dynamic Convolution, Swin-Transformer, Road Extraction

Full-Text   Cite this paper   Add to My Lib

Abstract:

U型网络作为一种经典的编码–解码结构网络,不只在医学影像领域内发挥出色,在图像分割领域也有广泛的影响。以U型网络为基础其它衍生网络层出不穷。U型网络最经典的思想是编码和解码,再加上编解码之间的跳跃连接。由于道路遥感图像和医学影像有众多相似的地方,如今U型网络又被用于从遥感图像中提取道路。U型网络使用跳跃连接的方式将下采样低维特征拼接到上采样的高维特征中,以保留更多的空间位置信息和语义信息。因此U型网络更能处理一些特征信息明显的图像数据。但浅层的UNet无法准确提取道路丰富多维的细节信息,在高分辨率卫星遥感图像上无法奏效。所以本文提出一种融合蛇形动态卷积和Swin-Transformer的U型网络用于提高道路提取任务的分割精确度。
As a classical coding-decoding structure network, U-shaped network not only plays an excellent role in the field of medical imaging, but also has a wide impact in the field of image segmentation. Based on U-shaped network, other derivative networks emerge in an endless stream. The most classic idea of U-shaped networks is encoding and decoding, plus jumping connections between coding and de-coding. Due to the similarities between road remote sensing images and medical images, U-shaped networks are now used to extract roads from remote sensing images. The U-shaped network spliced the low- dimensional features from the down-sampled to the high-dimensional features from the up-sampled by means of jump connection, so as to retain more spatial location information and se-mantic information. Therefore, U-shaped network is more capable of processing some image data with obvious feature information. However, shallow UNet can not accurately extract the rich mul-ti-dimensional detailed information of the road, and can not be effective in high-resolution satellite remote sensing images. Therefore, this paper proposes a U-shaped network combining serpentine dynamic convolution and Swin-Transformer to improve the segmentation accuracy of road extraction tasks.

References

[1]  Qi, Y., He, Y., Qi, X., et al. (2023) Dynamic Snake Convolution based on Topological Geometric Constraints for Tubu-lar Structure Segmentation. 2023 IEEE/CVF International Conference on Computer Vision (ICCV), Paris, Francem, 1-6 October 2023, 6070-6079.
https://doi.org/10.1109/ICCV51070.2023.00558
[2]  Yu, F., Koltun, V. and Funk-houser, T. (2017) Dilated Residual Networks. Proceedings of the IEEE Conferenceon Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA, 21-26 July 2017.
https://doi.org/10.1109/CVPR.2017.75
[3]  Dai, J.F., Qi, H.Z., Xiong, Y.W., et al. (2017) Deformable Convo-lutional Networks. 2017 IEEE International Conference on Computer Vision (ICCV), Venice, Italy, 22-29 October 2017.
https://doi.org/10.1109/ICCV.2017.89
[4]  Dong, S.J., Pan, Z.X., Fu, Y., et al. (2022) Deu-net 2.0: Enhanced de-formable u-net for 3d cardiac cine mri segmentation. Medical Image Analysis, 78, 102389.
https://doi.org/10.1016/j.media.2022.102389
[5]  Zhao, C.H., Zhu, W.X. and Feng, S. (2022) Superpixel Guided Deformable Convolution Network for Hyperspectral Image Classification. IEEE Transactions on Image Processing, 31, 3838-3851.
https://doi.org/10.1109/TIP.2022.3176537
[6]  Jin, Q.G., Meng, Z.P., Pham, T.D., et al. (2019) Dunet: A De-formable Network for Retinal Vessel Segmentation. Knowledge-Based Systems, 178, 149-162.
https://doi.org/10.1016/j.knosys.2019.04.025
[7]  Yang, X., Li, Z.q., Guo, Y.q., et al. (2022) DCU-net: A De-formable Convolutional Neural Network Based on Cascade U-Net for Retinal Vessel Segmentation. Multimedia Tools and Applications, 81, 15593-15607.
https://doi.org/10.1007/s11042-022-12418-w
[8]  Wang, D., Zhang, Z., Zhao, Z.W., et al. (2022) Pointscatter: Point Set Representation for Tubular Structure Extraction. In: Avidan, S., Brostow, G., Cissé, M., Farinella, G.M. and Hassner, T., eds., Computer Vision-ECCV 2022, Springer, Cham.
https://doi.org/10.1007/978-3-031-19803-8_22
[9]  Kong, B., Wang, X., Bai, J.J., et al. (2020) Learning Tree-Structured Representation for 3d Coronary Artery Segmentation. Computerized Medical Imaging and Graphics, 80, 101688.
https://doi.org/10.1016/j.compmedimag.2019.101688
[10]  Liu, Z., Lin, Y., Cao, Y., et al. (2021) Swin Transformer: Hierarchical Vision Transformer Using Shifted Windows. 2021 IEEE/CVF International Conference on Computer Vi-sion (ICCV), Montreal, QC, Canada, 10-17 October 2021, 10012-10022.
https://doi.org/10.1109/ICCV48922.2021.00986

Full-Text

comments powered by Disqus

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133

WeChat 1538708413