点云智能理解：从点云中心至语言中心的转变

张云生; 王旭颖; 陈斯飏; 李海峰

doi:10.13203/j.whugis20240497

点云智能理解：从点云中心至语言中心的转变

Point Cloud Intelligent Understanding： Transition from Point-Centric to Language-Centric Paradigm

摘要

摘要: 三维时空信息在实景三维中国建设的国家新型基础设施战略中占据重要地位，机载航空点云数据在大规模三维时空信息获取中发挥了关键作用。围绕点云数据的深度解译与语义信息提取需求，系统梳理了点云理解范式的发展脉络，从前深度学习时代的模型驱动与数据驱动范式，到后深度学习时代的自监督信号驱动范式，再到当前基于预训练基础大模型的大模型驱动范式。在此基础上，提出了以语言为中心的机载航空点云理解范式，系统分析了其面临的关键挑战及未来发展方向，旨在为点云智能理解的技术创新与研究拓展提供新的视角和思路。

Abstract:
Objectives In recent years, 3D spatiotemporal information has become a cornerstone of national new infrastructure construction, with airborne point clouds playing a critical role in large-scale 3D data acquisition. However, effectively interpreting these point clouds and extracting their semantic richness remain significant challenges. Traditional methods, including geometry-driven feature extraction and data-driven deep learning models, have achieved limited progress but struggle with the complexity and scale of airborne point clouds. Advances in large-scale foundation models (FM), particularly in natural language processing and vision-language integration, offer new opportunities for point cloud understanding. Large language models exhibit exceptional generalization and cross-modal semantic capabilities, enabling a language-centered paradigm. By leveraging language models to map point cloud data into a semantically enriched space, this approach addresses limitations of traditional methods.
Methods The evolution of point cloud understanding is examined across geometry- and data-driven approaches, self-supervised paradigms, and FM-driven methodologies. A language-centered framework for airborne point cloud understanding is proposed, tackling high-level semantic modeling, cross-modal alignment, and downstream task adaptation. Results demonstrate enhanced semantic representation, improved generalization, and significant advantages in complex scenarios.
Results The findings provide new insights into point cloud understanding and establish a foundation for integrating large-scale models into 3D applications.
Conclusions These contributions offer innovative perspectives and technical solutions for advancing point cloud technologies in national infrastructure projects.

HTML全文

参考文献(111)

施引文献

资源附件(0)