Adaptive Multi-level Feature Fusion for Scene Ancient Chinese Text Recognition

TU Chao-hu; YI Yao-hua; WANG Kai-li; PENG Ji-bing; YIN Ai-guo

doi:10.13203/j.whugis20230176

Turn off MathJax

Article Contents

Abstract

TU Chao-hu, YI Yao-hua, WANG Kai-li, PENG Ji-bing, YIN Ai-guo. Adaptive Multi-level Feature Fusion for Scene Ancient Chinese Text Recognition[J]. Geomatics and Information Science of Wuhan University. DOI: 10.13203/j.whugis20230176

Citation:

PDF (1561 KB)

Adaptive Multi-level Feature Fusion for Scene Ancient Chinese Text Recognition

1 School of Remote Sensing and Information Engineering, Wuhan University, Wuhan 430079, China
2 Digital Imaging and Intelligent Perception Research Center, Wuhan University, Wuhan 430079, China
3 School of Computer Science and Engineering, Wuhan Institute of Technology, Wuhan 430205, China
4 Zhuhai Pantum Electronics Co., Ltd, Zhuhai 519060, China

More Information

Received Date: October 25, 2023
Available Online: December 14, 2023

Graphical Abstract

Abstract

Abstract

Objectives: Ancient Chinese text are widely distributed in inscriptions, couplets, stone engravings and other scenes, which have the characteristics of complex background, large number of characters, and diverse writing forms. The large number of characters and writing forms directly lead to the difference in text structure complexity. Methods: To solve the difficulty of recognizing ancient Chinese text with complex structures, we propose an adaptive multilevel feature fusion network. First, ResNet152 is the main backbone network, and its deeper network and residual structure can fit more parameters to learn the features of ancient Chinese text and avoid the degradation of the model. Second, according to the structural complexity of ancient Chinese text, the importance of each feature map is automatically obtained through learning, so that the model adaptively selects and merges the shallow detail information and high-level semantic information of ancient Chinese text, obtains the high discrimination features of ancient Chinese text and improves the recognition ability of the model. Finally, the maximum boundary cosine loss is used to minimize the cosine similarity between different ancient Chinese text, increase the inter-class distance of ancient Chinese text, and reduce the intra-class distance between similar Chinese text. Combined with the cross entropy loss function as a loss function, the model can improve the discrimination ability of ancient Chinese text with similar structures. Results: The experimental results show that when the multistage feature fusion module is added to the proposed method, the Top-1 accuracy rate is increased by 1.59%, and when the maximum boundary cosine loss function is added, the Top-1 accuracy rate is increased by 1.09%. The best effect of Top-1 identification accuracy rate on the multi-scene ancient Chinese character dataset is 79.58%. Compared with the current optimal method, it improves the recognition accuracy of scene ancient Chinese text by 3.27%. Conclusions: In this paper, a multistage feature fusion network is designed to improve the feature extraction ability of the model, and the maximum boundary cosine loss is introduced to increase the distance between ancient Chinese text and narrow the distance within ancient Chinese text.
- multi-scene ancient Chinese text,
- adaptive multi-level feature fusion,
- large margin cosine loss

FullText(HTML)

References (0)

[1]	YANG Wan-ling, GAO Wu-tong, LIU Lu, WANG Bo, YAN Jian-guo. Orbit determination and accuracy analysis for comet 311P based on space-based and ground-based optical data[J]. Geomatics and Information Science of Wuhan University. DOI: 10.13203/j.whugis20220710
[2]	LI Duoduo, ZHOU Xuhua, LI Kai, XU Kexin, TAO Enzhe. Precise Orbit Determination for HY2B Using On-Board GPS Data[J]. Geomatics and Information Science of Wuhan University, 2023, 48(12): 2060-2068. DOI: 10.13203/j.whugis20210303
[3]	WANG Bo, LIU Lu, YAN Jianguo, GAO Wutong. Development of Asteroid Optical Determination Software and Data Processing Analysis[J]. Geomatics and Information Science of Wuhan University, 2023, 48(2): 277-284. DOI: 10.13203/j.whugis20200195
[4]	ZHOU Xuhua, WANG Xiaohui, ZHAO Gang, PENG Hailong, WU Bin. The Precise Orbit Determination for HY2A Satellite Using GPS,DORIS and SLR Data[J]. Geomatics and Information Science of Wuhan University, 2015, 40(8): 1000-1005. DOI: 10.13203/j.whugis20130730
[5]	LI Wenwen, LI Min, SHI Chuang, ZHAO Qile. Jason-2 Precise Orbit Determination Using DORIS RINEX Phase Data[J]. Geomatics and Information Science of Wuhan University, 2013, 38(10): 1207-1211.
[6]	ZHANG Xiaohong, LI Pan, ZUO Xiang. Kinematic Precise Orbit Determination Based on Ambiguity-Fixed PPP[J]. Geomatics and Information Science of Wuhan University, 2013, 38(9): 1009-1013.
[7]	LI Min, ZHAO Qile, GE Maorong. Simulation Research on Precise Orbit Determination for GIOVE-A[J]. Geomatics and Information Science of Wuhan University, 2008, 33(8): 818-820.
[8]	ZHAO Qile, SHI Chuang, LIU Xianglin, GE Maorong. Determination of Precise Orbit Using Onboard GPS Data for Gravity Modeling Oriented Satellites[J]. Geomatics and Information Science of Wuhan University, 2008, 33(8): 810-814.
[9]	ZHAO Qile, LIU Jingnan, GE Maorong, SHI Chuang. Precision Orbit Determination of CHAMP Satellite with cm-level Accuracy[J]. Geomatics and Information Science of Wuhan University, 2006, 31(10): 879-882.
[10]	GUO Jinlai, ZHAO Qile, GUO Daoyu. Reducing the Influence of Gravity Model Error on Precise Orbit Determination of Low Earth Orbit Satellites[J]. Geomatics and Information Science of Wuhan University, 2006, 31(4): 293-296.

Cited By

Get Citation

PDF

XML

Article views (162) PDF downloads (33)

Adaptive Multi-level Feature Fusion for Scene Ancient Chinese Text Recognition

Abstract

Related Articles

Catalog

Related

Adaptive Multi-level Feature Fusion for Scene Ancient Chinese Text Recognition

Abstract

Related Articles

Catalog

Related

Export File

Citation

Format

Content