融合颜色特征的随机森林特征优选的黄河三角洲植被信息分类

Classification of Vegetation Information by Integrating Color Features with Multi-feature Optimization of Random Forest in the Yellow River Delta

  • 摘要: 湿地盐沼植被的监测是黄河三角洲湿地生态功能保护与恢复的基础。以黄河三角洲湿地部分区域为研究区,以高分辨率航空影像为数据源,生成了光谱特征、颜色特征、指数特征和纹理特征4种特征变量并构建了不同的分类方案。利用随机森林方法对每种提取方案进行植被分类并验证其精度,旨在探求不同特征变量对分类的影响及原因,选取最佳的优选特征改善植被分类的效果。结果表明,指数特征对碱蓬提取有积极作用,纹理特征会降低植被分类的精度,融合颜色特征进行分类是提高总体精度的关键;基于随机森林特征优选提取效果最佳,总体精度为88%,Kappa系数为0.85。所提方法能有效区分植被与非植被,同时将各植被类型提取出来。该研究为黄河三角洲植被信息提取在特征选取与方法上提供了一种有效的技术路线。

     

    Abstract:
    Objectives The monitoring of salt marsh vegetation is the basis for protection and restoration of ecological functions in the Yellow River Delta (YRD) wetland. Due to the complexity of vegetation growth conditions and the spread of the invasive species Spartina alterniflora, vegetation detection and classification are particularly important.
    Methods This paper takes part of the YRD wetland as the study area, and chooses high-resolution aerial images as data source. Four feature variables are generated, including spectral features, color features, index features and texture features. Six different classification schemes are constructed. Scheme 1 with only spectral features is regarded as the control group. Index features, texture features and color features are integrated into scheme 2, scheme 3 and scheme 4, respectively. Scheme 5 contains all features, and scheme 6 constructs a multi-feature optimization feature set. Random forest method is used to classify vegetation for each extraction scheme and the corresponding accuracies are verified, aiming to explore the influences and reasons of different feature variables on the classification. The best preferred features are selected to improve the effect of vegetation classification.
    Results Based on the visible and near-infrared spectra, just adding different features to the experiment has different effects on the accuracies of vegetation classification. Scheme 1, scheme 2 and scheme 3 have unsatisfactory extraction effect on Phragmites australis and Suaeda salsa. Scheme 4, scheme 5 and scheme 6 with the addition of color features can better distinguish between the two, probably because Suaeda salsa shows dark red on the image, which is quite different from other vegetation. Scheme 4 divides non-vegetated tidal flat areas and water body edges into Phragmites australis, which are similar in color, and adds color features results in some misclassifications. Scheme 5 and scheme 6 are well classified, but due to the mixed nature of vegetation, all schemes have varying degrees of misclassification of Tamarix chinensis, Phragmites australis and Spartina alterniflora. Index features have positive effects on Suaeda salsa extraction, texture features reduce the accuracy of vegetation classification, and the integration of color features is the key to improve overall accuracy of classification. Based on multi-feature optimization of random forest, the extraction effect is the best, with overall accuracy of 88% and Kappa coefficient of 0.85.
    Conclusions The main advantages of this study are the acquisition of new data sources, the introduction of multiple feature variables, and the experimental evaluation and classification accuracy analysis of different feature variables. The importance of color features is verified and multi-feature optimization of random forest by integrating color features is a feasible method to classify vegetation information in the YRD. The proposed method can effectively distinguish vegetation from non-vegetation and extract each vegetation type at the same time. This study provides an effective technical route in feature selection and methodology for vegetation information extraction in the YRD.

     

/

返回文章
返回