YING Shen, LI Weiyang, HE Biao, WANG Wei, WAN Yuan. Chinese Segmentation of City Address Set Based on the Statistical Decision Tree[J]. Geomatics and Information Science of Wuhan University, 2019, 44(2): 302-309. DOI: 10.13203/j.whugis20170072
Citation: YING Shen, LI Weiyang, HE Biao, WANG Wei, WAN Yuan. Chinese Segmentation of City Address Set Based on the Statistical Decision Tree[J]. Geomatics and Information Science of Wuhan University, 2019, 44(2): 302-309. DOI: 10.13203/j.whugis20170072

Chinese Segmentation of City Address Set Based on the Statistical Decision Tree

  • Different from the conventional address word segmentation model, which relies on the city address dictionary or the rule set, this paper proposes a word segmentation method which does not depend on the address dictionary but based on massive address data mining. This method combines the statistic rules to calculate the distribution of the address elements in the address dataset, excavates the suffix points and the drop points of the address elements in the address data. The method constructs the statistical decision tree based on their relative position relations to extract the address elements, uses the investigation data of building address in Shenzhen to verify and to make a useful supplement to the current gazetteers.
  • loading

Catalog

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return