结合YOLO与句法模型的统计专题地图信息提取方法

An Information Extraction Method for Statistical Thematic Maps Combining YOLO and Syntactic Model

  • 摘要: 人工智能时代,从海量专题地图中快速智能提取地图信息成为地图学研究的重要目标。本研究从符号句法的角度分析统计地图,收集133张泛源统计专题地图,标注图面元素3512个作为训练集,根据地图结构层级,分两次使用改进YOLOv8模型,先后对地图图面元素和专题符号中的图元进行提取和分类;再利用多种特征识别手段,提取图元中的视觉变量,最后基于专题地图句法模型生成结构化描述信息,并转换为由专题、位置、类别、数值构成的元组。结果表明,改进YOLOv8模型在地图元素及图元提取中效果极佳,mAP50分别达到92.6%和99.4%。本文梳理了统计地图结构与信息表达逻辑,依此提取并解读地图信息,并整理成元组结构,将会方便后续构建知识图谱等应用。

     

    Abstract: Objectives: With the advancement of artificial intelligence, automatic extraction and interpretation of map information from large-scale statistical thematic maps have become significant research topics in cartography. This study aims to develop a method that combines the YOLOv8 model with a syntactic model to extract structured information from statistical thematic maps, thereby supporting applications such as knowledge graph construction and intelligent cartographic recommendations. Methods: The research employed an improved YOLOv8 model to extract and classify various map elements from a dataset of 133 statistical thematic maps with 3,512 annotated elements. To enhance the performance of YOLOv8 model for this specific application, the model was modified by incorporating Residual Hybrid Attention Groups (RHAG) in the Backbone and replacing the upsampling part with DySample upsampler. Subsequently, pattern recognition techniques were employed to extract the visual variables of graphical elements. Finally, based on a syntactic model, structured descriptions of the map content were generated, which involved analyzing the relationships between graphical elements and reconstructing the hierarchical and syntactic structure of the symbols. Results: The improved YOLOv8 model demonstrated remarkable effectiveness in extracting both map elements and graphical primitives, achieving mean Average Precision (mAP50) values of 92.6% and 99.4%, respectively. This indicates a high level of precision in identifying and classifying various elements within the maps. The method successfully captured the syntactic structure of statistical thematic maps, enabling the extraction of semantic information such as thematic categories, spatial relationships, and quantitative values. Conclusions: The combination of object detection and syntactic modeling can efficiently extract structured information from statistical thematic maps. This approach provides an in-depth interpretation of map content and highlights its practical utility in cartographic analysis. Future research will continue to explore and optimize this method for broader applications.

     

/

返回文章
返回