面向城市增强现实信息标注的建筑物场景结构提取方法

Building Scene Structure Extraction Method for Urban Augmented Reality Annotation

  • 摘要: 在信息标注中融入场景结构信息能有效解决城市增强现实(augmented reality, AR)信息指示不明、空间遮挡表达不清和视图布局叠置等问题。针对城市AR信息标注中场景结构缺失的问题,提出了一种考虑地理语义且兼顾精度、效率和稳健性的建筑物场景结构提取方法。首先构建了面向建筑物场景结构提取的场景感知网络,从单张场景图像中提取语义、深度和法向量信息;然后将其转化为建筑物立面角点和朝向等结构特征,并计算与2D地图中建筑物轮廓之间的最佳匹配;最后将场景图像按建筑物立面进行重构,提取区域轮廓、场景深度和立面朝向等场景结构。试验采用Google街景数据生成的9 470组样本训练和测试场景感知网络,并在Graz地区32组建筑物场景中测试场景结构提取的有效性。结果表明,所提方法能够达到近实时的效果,且立面轮廓更规则。在存在一定配准误差或局部遮挡的条件下,立面提取的质量显著优于基于地图解析场景图像的方法,具有更好的稳健性。

     

    Abstract:
      Objectives  In urban augmented reality (AR), the problems such as unclear indication, confusing occlusion and overlapping can be effectively solved by incorporating scene structure into information annotation. Aiming at the problem of missing scene structure in information annotation, a method for extracting building scene structure is proposed, which distinguishes geographical entities and also considers the accuracy, efficiency and robustness.
      Methods  First, a scene perception network for building scene structure extraction is constructed to extract semantic label, scene depth and surface normal from a single scene image. Second, structure features such as building facade corners and orientation are obtained by transforming previous results. Third, the best matching between structure features and the building outlines in 2D map is calculated. Finally, the scene image is reconstructed according to geographical entities and the structure information is generated, including region contours, scene depth and facades orientation.
      Results  Experiments are conducted with self-constructed and public datasets. The results show that the proposed method can extract the structure of building scene in 25-45 ms, and the facade contours are more regular. Despite the geo-registration errors or partial occlusion, the quality of facade extraction is significantly better than results from the map-based method.
      Conclusions  The proposed method can extract the structure of building scene in near real time with regular facade contours and good robustness, which is very useful for information annotation in urban AR.

     

/

返回文章
返回