%0 Journal Article %T 基于深度学习的模糊遮挡船号文本识别
Deep Learning-Based Recognition of Blurred Ship Identification Text in Maritime Context %A 余胜任 %J Modeling and Simulation %P 450-459 %@ 2324-870X %D 2024 %I Hans Publishing %R 10.12677/MOS.2024.131043 %X 随着我国船运业的蓬勃发展,船号作为船舶身份的唯一标识,确保准确识别船舶船号对于船舶管理至关重要。然而,在实际场景中,船号常常存在文字模糊或者被遮挡的情况,这会极大地降低识别的精准度。由于传统的图像处理和文字识别方法不能很好地解决这一问题,而且现有的大多数方式通常分二个步骤解决这个问题,先进行图像恢复再对恢复图像进行文字识别,但这种方式忽视了图像恢复和文字识别间的关联性,因此本文提出了一种联合生成对抗网络(GAN)和卷积循环神经网络(CRNN)的针对模糊遮挡船号文本的双分支耦合文字识别框架,称为SRC-GAN,通过对抗性学习将文字识别和图像恢复集成起来。通过将识别模型和GAN模型联合训练,学习更多图像的共性特征,从而对低质量的图像有更好的识别性能。在船舶数据集和CTW数据集上的识别实验表明,该方法相较于原始CRNN识别精度平均分布提升了11.98%和10.68%,相较于二阶段识别模型也有一定的优势,SRC-GAN对于模糊遮挡文本图像有着更好的识别效果。
With the vigorous development of China’s shipping industry, the ship number serves as the unique identifier of the ship’s identity, ensuring accurate identification of the ship number is crucial for ship management. However, in practical scenarios, ship numbers often have blurred or obstructed text, which greatly reduces the accuracy of recognition. Due to the fact that traditional image pro-cessing and text recognition methods cannot effectively solve this problem, and most existing methods usually solve this problem in two steps, first performing image restoration and then per-forming text recognition on the restored image, this approach ignores the correlation between im-age restoration and text recognition. Therefore, this article proposes a dual branch coupled text recognition framework called SRC-GAN, which combines Generative Adversarial Network (GAN) and Convolutional Recurrent Neural Network (CRNN) for fuzzy occluded ship number texts. It integrates text recognition and image restoration through adversarial learning. By jointly training the recog-nition model and GAN model, more common features of images can be learned, resulting in better recognition performance for low-quality images. The recognition experiments on the ship dataset and CTW dataset show that this method has improved the average recognition accuracy distribu-tion by 11.98% and 10.68% compared to the original CRNN, and also has certain advantages over the two-stage recognition model. SRC-GAN has better recognition performance for fuzzy occluded text images. %K 深度学习,文字识别,GAN,CRNN
Deep Learning %K Text Recognition %K GAN %K CRNN %U http://www.hanspub.org/journal/PaperInformation.aspx?PaperID=79538