GRACE反演长江流域陆地水储量变化的机器学习降尺度方法研究

Machine Learning Downscaling Methods for Inversion of Terrestrial Water Storage Anomalies in the Yangtze River Basin by GRACE

  • 摘要: 重力恢复与气候实验(gravity recovery and climate experiment,GRACE)卫星时变重力场反演陆地水储量变化(terrestrial water storage anomaly,TWSA)的空间分辨率有限,制约了其在中小尺度区域水循环和气候变化研究中的应用潜力。当前,基于机器学习的降尺度方法已在提升GRACE TWSA空间分辨率方面取得了显著进展。然而,预测因子的合理选取及其对模型性能的影响机制,以及降尺度结果的精度评估与不确定性分析等问题,仍有待进一步深入研究。以长江流域TWSA的降尺度处理为例,对比传统的水文模型降尺度方法和人工神经网络(artificial neural network,ANN)、随机森林(random forest,RF)、极端梯度提升(extreme gradient boosting,XGBoost)3种机器学习模型,将GRACE反演的长江流域空间分辨率为1°×1°的TWSA数据降尺度到0.25°×0.25°和0.1°×0.1°。为了对不同降尺度方法进行评价,首先使用全球陆面数据同化系统水文模型的TWSA数据进行闭合模拟实验,评估不同降尺度方法的性能;然后对GRACE TWSA数据进行降尺度处理,并采用水文站水位数据评估不同降尺度方法的性能和结果。研究结果表明,机器学习降尺度模型的性能受到预测因子数量的影响,随着预测因子数量的增加,降尺度性能也会持续增强,3种机器学习模型的降尺度性能优劣也会发生改变。采用偏最小二乘回归分析模型得到重要性较高的前6个预测因子,即归一化植被指数、土壤湿度、降水量、气温、径流和U形风,在机器学习降尺度模型中取得较好的效果,增加更多的预测因子对降尺度模型性能的提升相对较小。RF和XGBoost降尺度模型的性能表现较好且非常接近,ANN和水文模型降尺度模型的效果略差。此外,水文模型的降尺度结果依赖水文模型与TWSA数据的相关性,机器学习降尺度模型能够更好地融合水文、气象和植被等多种数据的变化特征,从而恢复中小尺度区域精细的TWSA信号。

     

    Abstract:
    Objectives The spatial resolution of the time-varying gravity field model provided by gravity recovery and climate experiment(GRACE) satellite gravity for retrieving terrestrial water storage anomaly (TWSA) is limited, which restricts its application potential in the study of regional water cycle and climate change. The current machine learning downscaling methods have been effective in improving the spatial resolution of GRACE TWSA data, but further exploration is needed on the reasonable selection of predictive factors and their impacts on the performance of machine learning models, as well as the accurate evaluation of downscaling results.
    Methods A hydrological model downscaling method and three machine learning model downscaling methods, including random forest (RF), extreme gradient boosting (XGBoost), and artificial neural network (ANN), were adopted to downscale the TWSA data obtained from GRACE inversion of the Yangtze River Basin with the spatial resolutions from 1°×1° to 0.25°×0.25° and 0.1°×0.1°, respectively. To evaluate different downscaling methods, a closed-loop simulation experiment was conducted using TWSA data from global land data assimilation system hydrological model to evaluate the performance of different downscaling methods. Subsequently, the TWSA data obtained from GRACE was downscaled, and the performance and results of different downscaling methods were comprehensively evaluated using measured water level data.
    Results The performance of machine-learning-models-based downscaling method is sensitive to the number of predictive factors. As the number of predictors increases, the overall downscaling performance improves, although the magnitude of improvement varies among models. Partial least squares regression analysis indicates that the six most important variables ,normalized difference vegetation index (NDVI), soil moisture, precipitation, temperature, runoff, and U-wind, are sufficient to achieve robust performance, while the inclusion of additional predictors yields only marginal gains. Closed-loop simulation experiments further reveal differences among models. Compared with RF and XGBoost models, ANN model shows relatively lower performance metrics; however, it produces the most favorable downscaled spatial patterns. When evaluated against water level observations in the Yangtze River Basin, TWSA remains consistent with observed water levels both before and after downscaling. In particular, RF-based results show a marked improvement in correlation, with all coefficients exceeding 0.7. Using Poyang Lake as a representative case, the long-term trends of the machine learning downscaling results were compared with those of the three predictors with the highest variable importance in projection scores, including NDVI, soil moisture, and precipitation. The results indicate that the downscaled outputs exhibit long-term trends highly consistent with NDVI and soil moisture, and their spatial variations closely correspond to the extent of water-covered areas in Poyang Lake. In contrast, precipitation does not display a similar trend pattern.
    Conclusions The best downscaling methods are RF and XGBoost, while the downscaling methods of ANN and hydrological model perform poorly. Additionally, the results of hydrological model downscaling depend on the correlation between the hydrological model and GRACE data. However, machine learning downscaling methods can better integrate the changing characteristics of different auxiliary data such as hydrology, meteorology, and vegetation (especially important predictive factors), thus enabling better recovery of detailed TWSA signals in the watershed.

     

/

返回文章
返回