基于Inception结构的手写汉字档案文本识别方法

A Handwritten Chinese Characters Files Text Recognition Method Based on Inception Structure

  • 摘要: 针对手写汉字文本识别准确率不高的问题,提出了一种结合卷积神经网络和循环神经网络进行手写汉字文本识别的端到端方法。首先,通过Inception模块构建的卷积神经网络提取文本图像的基本特征;然后,使用循环神经网络对提取的特征进行预测,并输出一个关于汉字字符集的概率分布;最后,采用连接主义序列分类算法计算识别结果并构建损失函数。利用所提方法在手写汉字文本数据集上进行实验, 结果表明,Inception模块和数据增强方法可以有效提升算法的性能,并取得了71.2%的识别准确率和0.060的文本编辑距离,较现有方法性能有所提升,证明了所提方法的有效性。

     

    Abstract:
      Objectives  In order to solve the problem that the accuracy of handwritten Chinese character text recognition is not high, an end-to-end method for handwritten Chinese character text recognition based on convolutional neural network and recurrent neural network is proposed.
      Methods  Firstly, the convolutional neural network constructed by inception module is used to extract the basic features of the text image. Secondly, the recurrent neural network is used to predict the extracted features and output a probability distribution about the Chinese character set. Finally, the connectionist temporal classification algorithm is used to calculate the recognition results and construct the loss function.
      Results  The proposed method is tested on the handwritten Chinese character text dataset, experimental result shows that the Inception module and data enhancement method can effectively improve the performance of the algorithm, obtain the recognition accuracy of 71.2% and the text editing distance of 0.060.
      Conclusion  Our proposed method can conduct end-to-end handwritten Chinese character text recognition, and improve the recognition accuracy compared with the existing methods.

     

/

返回文章
返回