基于中文分词的加权地理编码在COVID-19疫情防控空间定位中的应用

Weighted Geocoding Method Based on Chinese Word Segmentation and Its Application to Spatial Positioning of COVID-19 Epidemic Prevention and Control

  • 摘要: 地理编码是实现带有地址描述的信息空间定位的重要途径。比较研究了国内外地理编码方法,分析了中文地址的组成方式和定位方法。针对中文地址高度复杂性和多样性的特征,设计了一种顾及多种语义的地址匹配算法,并以武汉市新型冠状病毒肺炎(coronavirus disease 2019,COVID-19)病人入院时登记的地址描述信息为例,对匹配算法进行了实验验证,将匹配结果进行空间定位。结果表明,所提出的中文分词的加权地理编码方法匹配高效、定位准确、方法智能,能够实现基于语义的COVID-19病人入院时登记地址的快速定位,可为疫情防控提供准确的空间定位信息。

     

    Abstract: Locating the coronavirus disease 2019 (COVID-19)cases in the accurate place is important in epidemic prevention and control. Geocoding is an effective method to achieve information space positioning with address description. The English based geocoding methodology is not suitable for Chinese address. Composition and positioning methods of Chinese address are discussed. According to the characteristics of high complexity and diversity of Chinese address, a Chinese word segmentation weighted address matching algorithm considering a variety of semantics is designed, including the same pronunciation but different Chinese word address, abbreviation and alias of Chinese address, different description of the same address. And the matching accuracy and efficiency of the algorithm are tested by using the COVID-19 cases' addresses in Wuhan. The result indicates the algorithm is efficient, accurate, and intelligent, which can realize the efficient location of the COVID-19 cases address, and provide accurate spatial location information for epidemic prevention and control by quickly positioning of the COVID-19 cases.

     

/

返回文章
返回