%0 Journal Article %T 融合空间结构权重优化注意力机制的建筑物立面元素检测
Building Facade Element Detection Based on Spatial Structure Weight Optimization Attention Mechanism %A 江涛 %A 常莉红 %A 魏征 %A 董震 %J Geomatics Science and Technology %P 122-134 %@ 2329-7239 %D 2023 %I Hans Publishing %R 10.12677/GST.2023.112014 %X 本文针对街景图像立面元素检测问题,提出了融合空间结构权重优化注意力机制的立面元素目标检测网络。在主干网络部分使用嵌入基于空间结构优化坐标注意力机制的C3模块,增加横纵坐标权重分支,有效利用空间结构编码信息,提升立面元素定位精度;其次针对立面最主要组成元素窗户、阳台的小目标特性,使用改进的递归门控卷积模块替换原始卷积模块,融合丰富的多尺度上下文信息,并增加小目标检测分支,提升检测精度;最后设计了ECIOU损失同时对检测框的长宽比以及定位中心进行监督,增强网络对立面元素的感知能力,提升网络收敛速度。在FacadeWHU数据集上实验结果表明,本文模型的平均精度比相较于基线网络Yolov5s而言整体平均精度提升了16.4%,窗户目标的平均精度提升了22.4%,阳台目标的平均精度提升了25.5%,可以有效检测立面元素,更好的服务于病害检测、能耗分析等下游任务。
Aiming at the problem of facade element detection in street view image, this paper proposes a fa-cade element object detection network integrating spatial structure weight optimization mecha-nism. C3 module embedded in the coordinate attention mechanism based on spatial structure op-timization is used in the backbone network to increase the weight branches of horizontal and verti-cal coordinates, effectively use the spatial structure coding information, and improve the position-ing accuracy of elevation elements. Secondly, in view of the small target characteristics of Windows and balconies, which are the main components of the facade, an improved recursive gated convolu-tional module is used to replace the original convolutional module, integrate rich multi-scale con-text information, and add small target detection branches to improve detection accuracy. Finally, ECIOU loss is designed to supervise the aspect ratio of the detection frame and the positioning cen-ter, which enhances the perception ability of the opposite elements of the network and improves the convergence speed of the network. Experimental results on Facade WHU data set show that compared with baseline network yolov5s, the average accuracy of the proposed model is improved by 16.4% overall, 22.4% for window target and 25.5% for balcony target, which can effectively de-tect facade elements. Better service for disease analysis, energy consumption analysis and other downstream tasks. %K 立面解析,建筑物立面,立面元素检测;Facade Parsing %K Building Facade %K Facade Elements Detection %U http://www.hanspub.org/journal/PaperInformation.aspx?PaperID=64383