众源地理空间数据的空间文本相关性分析

Spatial-Textal Correlation Analysis Based on Crowdsource Geospatial Data

  • 摘要: 众源地理空间数据作为一种由大众采集并向大众提供的开放地理数据,蕴含着丰富的空间信息和规律性知识,其中具有代表性的是签到数据。基于地理学第一定律:所有的事物都是相互联系的,但离得越近,彼此之间的联系越强,利用移动社交网站中的签到数据,研究空间与文本的相关性,在对数据进行了预处理和地理映射处理的前提下,统计出各区域的文本属性值,在空间尺度的变化下采用探索性空间分析法分别对美国各州、纽约市和洛杉矶市做全局空间自相关性分析和局部空间自相关性分析。结果表明,不同的文本属性信息在空间上存在着不同全局空间自相关特性,局部自相关的分析也揭示了文本的聚集规律,为相关决策部门或企业制定合理决策提供了合理科学的依据。

     

    Abstract: Crowdsource geospatial data is one kind of open geographic data collected by the public. A wealth of spatial information and knowledge are hidden in crowdsource geospatial data. Check-in data is one of the representative crowdsource geospatial data. Most existing work on spatial-textual objects, such as evaluating the similarity of two spatial-textual objects in spatial keyword query, considers that spatial similarity and text similarity are independent of each other. According to the first law of geography: Everything is connected; the closer two objects are, the stronger their connection is. We explore the correlation between spatial information and textual information in the real check-in data scrawled from the location based social networks. After data preprocess and geographic mapping, we computed the textual attribute values in each region. Then, we use the exploratory spatial analysis to analyze the global spatial autocorrelation and local spatial autocorrelation in different the spatial scales, that is different states in United States and the two cities such as New York and Los Angeles, respectively. The results show that different textual attributes in different regions have different global spatial autocorrelation; the results obtained from the local autocorrelation analysis show the phenomenon that the textual attributes get together. Both the above results provide the basis for research on the assumption that "the texts are similar in the near space". Furthermore, the conclusion can help departments or enterprises to make reasonable decisions.

     

/

返回文章
返回