基于国产超算的遥感图像菱形三十面体六边形格网组织与并行处理算法

Parallel Processing Algorithm for Organizing Remote Sensing Images on a Rhombic Triacontahedron Hexagonal Grid Using Domestic Supercomputing

  • 摘要: 遥感对地观测数据体量的爆炸增长对数据组织模型和处理能力提出较高要求。全球离散格网系统(Discrete Global Grid System,DGGS)有助于建立以空间位置为主键的数据关联新模式,是支持遥感对地观测数据统一组织的全新发展思路。中央处理器(CentralProcessing Unit,CPU)+深度计算单元(Deep Computing Unit,DCU)异构计算架构是具备完全自主知识产权的高性能计算体系,为海量遥感数据的高效处理创造有利条件。基于该架构实现遥感数据菱形三十面体等积四孔六边形DGGS并行格网化处理。首先,设计基于CPU+DCU的遥感数据格网化组织总体框架,重点分析关键算法的效率瓶颈和计算热点;然后,针对异构编程模型要求对算法热点并行化重构与代码适配;最后,在单个计算节点上验证算法移植优化的有效性以及单DCU和多DCU加速卡的并行计算能力。实验结果表明,与传统CPU串行算法相比,算法并行重构后单DCU加速比可约达50~200倍,最高可达590倍;多DCU并行加速进一步缩短了算法运行时长,在处理大规模、复杂数据处理任务时的优势更显著。该成果丰富了国产CPU+DCU异构计算生态,有助于推动遥感数据DGGS组织应用与国产自主可控高性能计算平台的深度融合。

     

    Abstract: Objectives: The rapid advancement of remote sensing Earth observation technology has triggered an explosive growth in data volume, imposing stringent demands on data organization and processing efficiency. There is a pressing need for more suitable data models and enhanced computational power to provide robust support for the organization, processing, and analysis of massive remote sensing datasets. The Discrete Global Grid System (DGGS), characterized by its multi-resolution, discrete, and hierarchical structure formed through recursive partitioning of Earth's space, offers a promising solution. It facilitates a novel data association model with spatial location as the primary key, representing an innovative paradigm for the unified organization and integrated processing of Earth observation data. Concurrently, domestic supercomputers, as high-performance computing facilities with fully independent intellectual property rights, possess unique architectures that deliver formidable computational capabilities. This creates favorable conditions for the efficient processing of massive remote sensing data. This study aims to bridge these two frontiers by implementing and parallelizing the organization of remote sensing data based on the Rhombic Triacontahedron Leeuwen Equal-area Aperture 4 Hexagonal Discrete Global Grid (RTLEA4HDGGS) on a domestic supercomputing platform. Methods: We designed and implemented a parallel processing framework for organizing remote sensing data within the RTLEA4H DGGS on a domestic supercomputing platform. First, the overall strategy for grid-based remote sensing data organization on the supercomputing platform was formulated, with a focused analysis on the efficiency bottlenecks and computational hotspots of the core algorithms. Subsequently, to meet the requirements of the platform's hardware and heterogeneous programming models, we conducted parallel restructuring and code adaptation of these algorithmic hotspots. This involved optimizing data structures, refining parallel granularity, and efficiently managing data transfer between the host (CPU) and accelerators. The goal was to achieve large-scale, grid-based processing of remote sensing imagery. Finally, the effectiveness of the algorithm migration and optimization was validated, and the parallel computing performance was assessed using computing nodes of the supercomputer. Evaluations were conducted for scenarios utilizing a single Deep Computing Unit (DCU) and multiple DCU acceleration cards, respectively. Results: Experimental results demonstrate significant performance gains after parallel restructuring. The resampling algorithm for remote sensing images achieved a speedup ratio ranging from 50 to 200 times. The theoretical acceleration limit for a single DCU was approximately 590 times. Employing multiple DCUs further reduced the algorithm's execution time, enhancing its adaptability to high-resolution and large-scale application scenarios. Analysis revealed that data transfer between the CPU and DCU constitutes an overhead that must be considered in algorithm parallelization. However, as the processing hierarchy increases, the relative impact of this overhead diminishes compared to the total computational time on a single CPU. Consequently, the overall algorithm speedup ratio gradually increases with higher processing levels, highlighting the advantage of the parallelized approach for complex, multi-scale tasks. Conclusions: The successful parallel implementation of the RTLEA4HDGGS based data organization method on the domestic supercomputing platform validates its high efficiency and scalability. The performance advantages are especially pronounced for large-scale and complex data processing tasks. This research contributes to promoting the deep integration of advanced remote sensing data management frameworks with autonomous, controllable domestic high-performance computing infrastructure. It provides a technical pathway for leveraging powerful national computing resources to address the challenges posed by massive Earth observation data, thereby supporting applications in areas such as environmental monitoring, resource management, and climate studies.

     

/

返回文章
返回