结合YOLOv8与句法模型的统计专题地图信息提取方法

An Information Extraction Method for Statistical Thematic Maps Combining YOLOv8 and Syntactic Model

  • 摘要: 人工智能时代,从海量专题地图中快速智能地提取地图信息成为地图学研究的重要目标。首先,从符号句法的角度分析统计地图,收集133张泛源统计专题地图,标注3 512个图面元素作为训练集;其次,根据地图结构层级,分两次使用改进的YOLOv8模型,先后对地图图面元素和专题符号中的图元进行提取和分类;然后,利用多种特征识别手段提取图元中的视觉变量;最后,基于专题地图句法模型生成结构化描述信息,并转换为由专题、位置、类别、数值构成的元组。实验结果表明,改进的YOLOv8模型在地图元素及图元提取中效果极佳,均值平均精度分别达到92.6%和99.4%。此外,梳理了统计地图结构与信息表达逻辑,依此提取并解读地图信息,将其整理成元组结构,便于后续构建知识图谱等应用。

     

    Abstract:
    Objectives With the advancement of artificial intelligence, automatic extraction and interpretation of map information from large-scale statistical thematic maps have become significant research topics in cartography. The aim to develop a method that combines the YOLOv8 model with a syntactic model to extract structured information from statistical thematic maps. The primary objectives are to enhance extraction accuracy, decode semantic meanings based on cartographic syntax, and generate tuple-structured data for applications like knowledge graph construction and intelligent cartographic recommendations.
    Methods We employed an improved YOLOv8 model to extract and classify various map elements. First, a dataset of 133 statistical thematic maps was compiled from diverse sources, including online atlases and scanned documents, with 3 512 map elements annotated as the training set. Elements were categorized into six types: thematic symbols, map title, legend, scale, other text, and other charts. To enhance the performance of YOLOv8 model for this specific application, the model was modified by incorporating residual hybrid attention groups (RHAG) in the Backbone and replacing the upsampling part with DySample upsampler. Subsequently, the model detected and classified map surface elements; subsequently, thematic symbols were processed to extract geometric primitives through a second YOLOv8 application. Pattern recognition techniques, such as Hough transform, were then used to extract visual variables, including construction attributes and color attributes. Finally, based on a syntactic model, structured descriptions of the map content were generated, which involved analyzing the relationships between graphical elements and reconstructing the hierarchical and syntactic structure of the symbols.
    Results The improved YOLOv8 model demonstrates remarkable effectiveness. In map element extraction, it achieves a mean average precision (mAP)50 of 92.6% and mAP50:95 of 73.8%, with precision rates up to 99.5% for categories like scale and other text. For geometric primitive extraction, the model achieved an mAP50 of 99.4% and mAP50:95 of 90.3%, accurately identifying primitives even in complex symbols. Ablation studies confirmes the contributions of RHAG and DySample, showing improvements over the baseline model. The method successfully captures the syntactic structure of statistical thematic maps, enabling the extraction of semantic information such as thematic categories, spatial relationships, and quantitative values.
    Conclusions The combination of object detection and syntactic modeling can efficiently extract structured information from statistical thematic maps. This approach provides an in-depth interpretation of map content and highlights its practical utility in cartographic analysis. Future research will continue to explore and optimize this method for broader applications.

     

/

返回文章
返回