|
DA-YOLO:基于双重注意力的松枯病检测模型
|
Abstract:
松枯病(PWD)是一种传播迅速、杀伤力极强的森林病害,对我国森林生态安全构成严重威胁,并造成巨大的林业经济损失。考虑到我国森林面积广阔,人工巡查监测难度大且成本高,因此利用无人机遥感技术监测病树成为控制松枯病传播的有效途径。尽管目前松枯病检测算法取得了相对较好的性能,但由于松枯病的强传染性,检测效果仍需进一步提高。基于此,本文提出了一种基于YOLOv5的双重注意力混合模型——DA-YOLO,用于更有效地检测病害树木区域。该算法使用基于自注意力的CoT模块加强骨干特征网络的提取能力,并结合ECA注意力机制整体提升定位精度。实验结果显示,在使用PWD遥感数据集时,该模型的AP@0.5:0.95较之基线提高了5.2个百分点。并将本文提出的算法DA-YOLO与Faster R-CNN、RetinaNet、YOLOv5、YOLOv6、YOLOx、YOLOv7等算法的模型复杂度和精度进行对比,并分析了它们在检测松枯萎线虫树木方面的效果。实验结果表明,本文提出的DA-YOLO模型在检测方面具有明显的优势。
This Pine wilt disease (PWD) is a rapidly spreading and highly lethal forest disease that poses a serious threat to China’s forest ecological security and causes significant economic losses in forestry. Considering the vast forest area in China, the difficulty and high cost of manual inspection and monitoring, the use of drone remote sensing technology to monitor diseased trees has become an effective way to control the spread of pine wilt disease. Although the current pine wilt disease detection algorithm has achieved relatively good performance, the detection effect still needs to be further improved due to the strong infectivity of pine wilt disease. Based on this, this article proposes a dual attention hybrid model called DA-YOLO based on YOLOv5, which is used to more effectively detect areas of diseased trees. This algorithm uses a self attention based CoT module to enhance the extraction ability of the backbone feature network, and combines ECA attention mechanism to improve the overall positioning accuracy. The experimental results show that when using the PWD remote sensing dataset, the model’s AP@0.5 0.95 is 5.2 percentage points higher than the baseline. And the model complexity and accuracy of the algorithm DA-YOLO proposed in this article were compared with Faster R-CNN, RetinaNet, YOLOv5, YOLOv6, YOLOx, YOLOv7 and other algorithms, and their performance in detecting pine wilt nematode trees was analyzed. The experimental results show that the DA-YOLO model proposed in this paper has significant advantages in detection.
[1] | Pan, C.S. (2011) Development of Studies on Pinewood Nematodes Diseases. Journal of Xiamen University, 50, 476-483. |
[2] | Duarte, A., Borralho, N., Cabral, P., et al. (2022) Recent Advances in Forest Insect Pests and Diseases Monitoring Using UAV-Based Data: A Systematic Review. Forests, 13, Article 911. https://doi.org/10.3390/f13060911 |
[3] | 许可, 季卓, 夏凯, 等. 结合注意力机制与YOLOv5的松材线虫病受害木检测[J]. 林业工程学报, 2023, 8(3): 156-164. https://doi.org/10.13360/j.issn.2096-1359.202208018 |
[4] | 李博, 姚黎帆, 王广彪, 等. 基于改进RetinaNet算法和无人机影像的松材线虫病疫木检测[J]. 现代信息科技, 2023, 7(14): 71-74+79. https://doi.org/10.19850/j.cnki.2096-4706.2023.14.016 |
[5] | Li, Y., Yao, T., Pan, Y., et al. (2022) Contextual Transformer Networks for Visual Recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, 45, 1489-1500. https://doi.org/10.1109/TPAMI.2022.3164083 |
[6] | Wang, Q., Wu, B, Zhu, P., et al. (2020) ECA-Net: Efficient Channel Attention for Deep Convolutional Neural Networks. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Seattle, 13-19 June 2020, 11534-11542. https://doi.org/10.1109/CVPR42600.2020.01155 |
[7] | Vaswani, A., Shazeer, N., Parmar, N., et al. (2017) Attention Is All You Need. In: Guyon, I., et al., Eds., Advances in Neural Information Processing Systems 30, Long Beach, 04-09 December 2017, 1-11. |
[8] | Ren, S., He, K., Girshick, R., et al. (2015) Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. In: Cortes, C., et al., Eds., Advances in Neural Information Processing Systems 28, Montreal, 07-12 December 2015, 1-9. |
[9] | Lin, T.Y., Goyal, P., Girshick, R., et al. (2017) Focal Loss for Dense Object Detection. Proceedings of the IEEE International Conference on Computer Vision, Venice, 22-29 October 2017, 2980-2988.
https://doi.org/10.1109/ICCV.2017.324 |
[10] | Li, C., Li, L., Jiang, H., et al. (2022) YOLOv6: A Single-Stage Object Detection Framework for Industrial Applications. arXiv preprint arXiv:2209.02976. |
[11] | Wang, C.Y., Bochkovskiy, A. and Liao, H.Y.M. (2023) YOLOv7: Trainable Bag-of-Freebies Sets New State-of-the- Art for Real-Time Object Detectors. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Vancouver, 17-24 June 2023, 7464-7475. https://doi.org/10.1109/CVPR52729.2023.00721 |
[12] | Ge, Z., Liu, S., Wang, F., et al. (2021) YOLOX: Exceeding YOLO Series in 2021. arXiv preprint arXiv:2107.08430. |