几种具有编解码结构的深度学习模型在建筑物综合中的应用与比较

杜佳威; 武芳; 行瑞星; 李彩霞; 李靖涵

doi:10.13203/j.whugis20200143

几种具有编解码结构的深度学习模型在建筑物综合中的应用与比较

Trial and Comparison of Some Encoder-Decoder Based Deep Learning Models for Automated Generalization of Buildings

摘要

摘要: 深度学习技术促使诸多领域研究取得突破性进展, 基于深度神经网络的地图综合研究备受期待。将建筑物综合过程抽象解释为编解码过程, 系统地研究基于编解码结构的深度神经网络在建筑物综合中的应用。首先, 利用空间划分与矢量-栅格数据转换相结合的方式构建样本和样本集; 然后, 利用样本集训练基于编解码结构的深度神经网络, 实现建筑物综合学习泛化并测试、评估其效果; 最后, 搭建5种代表性的基于编解码结构的深度神经网络, 分析比较各模型在建筑物综合中的应用效果。实验结果表明, 基于编解码结构的深度神经网络能够从建筑物综合样本中学习或推理出部分建筑物综合知识和综合操作, 且5种模型中Pix2Pix更适用于建筑物综合的学习模拟。

Abstract:
Objectives The deep learning technology has made a breakthrough in many fields, and the automated map generalization based on the deep neural network is highly expected. Building generalization is a typical generalization case and can be abstracted and interpreted as the process of encoding and decoding. It is worth systematically studying the simulation of building generalization with encoder-decoder based deep neural networks.
Methods Samples are drawn from vector buildings via the segmentation of spatial data and the conversion between vector data and raster data, which is based on a natural principle of objective generalization (i.e., the legibility principle drawn by scale). The encoder-decoder based deep neural networks are trained and tested with these samples to learn buildings generalization and to evaluate this application. Five representative models based on encoder-decoder structure are constructed for learning of generali-zing buildings from 1∶10 000 to 1∶50 000. The experimental data cover 44.1 km×38.6 km and contain 89 539 and 21 732 buildings before and after generalization respectively.Five representative models are EDnet, Unet, ResUnet, Unet++ and Pix2Pix representing the basic multi-layers encoder-decoder neural network, the encoder-decoder neural network with skip connection, the residual encoder-decoder neural network, the dense encoder-decoder neural network, and the encoder-decoder neural network with generative adversarial mechanism respectively. The training process and test results of the three constructed models are compared, and effects of these five models while they are applied to the automated generalization of buildings are analyzed.
Results The pixel accuracies of predicted results during EDnet training and testing are lowest obviously, Unet takes the second lowest place, and the pixel accuracies of results generated by other models(ResUnet, Unet++ and Pix2Pix) are higher and relatively close. Additionally, there are some holes and fuzzy representations in generalized buildings predicted by ResUnet and Unet++, which does not conform with the regular representation of buildings. Generalization results predicted by the Pix2Pix show that most of generalized building edges are clear and generalized buildings can be represented independently and completely.
Conclusions The Pix2Pix is more suitable than the other four models for learning and simulating buildings generalization in our experiment. The encoder-decoder based deep neural networks are likely to deduce some knowledge and operators of buildings generalization by learning from samples of the original and generalized data. The encoder-decoder based deep neural networks have the potential in the learning of map generalization.

HTML全文

参考文献(25)

施引文献

资源附件(0)