欢迎访问林业科学,今天是

林业科学 ›› 2025, Vol. 61 ›› Issue (4): 9-19.doi: 10.11707/j.1001-7488.LYKX20240485

• 专题:智慧林业 • 上一篇    

基于GF-1数据的二阶段遥感特征优选与森林地上生物量反演模型

杨菲菲1, 张王菲1, 赵磊2, 赵含1, 姬永杰3, 王梦金1   

  1. 1. 西南林业大学林学院 昆明 650224;
    2. 中国林业科学研究院资源信息研究所 北京 100091;
    3. 西南林业大学水土保持学院 昆明 650224
  • 收稿日期:2024-08-13 修回日期:2024-11-20 发布日期:2025-04-21
  • 通讯作者: 张王菲为通信作者。E-mail:mewhff@163.com。
  • 基金资助:
    国家自然科学基金项目(42161059,32371869,32160365,32471865);云南省农业基础研究联合专项(202301BD070001-058,202401BB070001-021);云南省兴滇人才项目(YNWR-QNBJ-2019-146)。

Two-Stage Remote Sensing Feature Optimization and GF-1 Data-Supported Forest Above-Ground Biomass Inversion

Yang Feifei1, Zhang Wangfei1, Zhao Lei2, Zhao Han1, Ji Yongjie3, Wang Mengjin1   

  1. 1. College of Forestry, Southwest Forestry University Kunming 650224;
    2. Research Institute of Forest Resource Information Techniques, Chinese Academy of Forestry Beijing 100091;
    3. College of Soil and Water Conservation, Southwest Forestry University Kunming 650224
  • Received:2024-08-13 Revised:2024-11-20 Published:2025-04-21

摘要: 目的 提出二阶段遥感特征选择方法与多机器学习模型结合的思路,探索适用于国产高分一号(GF-1)数据估测森林地上生物量(AGB)的有效模式,为高分系列卫星数据在林业监测中的应用提供参考。方法 首先,从GF-1数据中提取波段光谱特征、纹理特征、植被指数等44个特征变量;然后,基于二阶段特征优选策略优选特征,第一阶段采用过滤式法(Relief)、嵌入式法(Lasso、RF)进行初步筛选,第二阶段通过包装式法(RFE)进一步优化特征子集,提升模型估测能力;最后,利用以往森林AGB估测研究中表现较好的K最近邻(KNN)、支持向量回归(SVR)、随机森林(RF)、梯度提升回归树(GBRT)和极限梯度提升树(XGBoost)机器学习模型进行森林AGB估测,并评估不同特征选择方法与模型之间的匹配度及其对森林AGB估测精度的影响,探寻最优的基于GF-1数据估测森林AGB的策略。结果 1) Relief-RFE方法在XGBoost模型森林AGB反演中取得较优效果(R2 = 0.811,RMSE = 8.45 t·hm–2,rRMSE = 23.39%)。2) 绿光波段纹理特征(mea_b2)能够捕捉到森林地表覆盖物的空间分布信息,如森林的冠层密度、树木分布等空间结构变化信息;蓝光波段光谱特征(b1)对叶绿素和叶色素浓度变化反应敏感,可表征植被健康状况和生长阶段信息,在多种方法的二阶段特征选择中上述2种特征均被选为核心特征用于森林AGB估测。3) 特征优选可显著提升模型性能,XGBoost模型的提升效果明显大于KNN、SVR、RF和GBRT模型。结论 通过对比不同特征选择方法与机器学习模型结合估测森林AGB的效果,验证了二阶段特征优选策略在基于GF-1数据估测森林AGB中的有效性,尤其是将该策略与XGBoost模型结合,能够构建出高精度、高稳健性的森林AGB估测策略。

关键词: GF-1数据, 森林地上生物量, 混合特征选择, 机器学习, 二阶段优化策略

Abstract: Objective In this study, a two-stage remote sensing feature selection method combined with multiple machine learning models was proposed to explore an efficient model for estimating forest aboveground biomass (AGB) using domestic GF-1 satellite data, thereby providing scientific reference for the application of GF-1 satellite data in forestry monitoring.Method A total of 44 remote sensing feature variables, such as spectral features, texture features, and vegetation indices, were first extracted from the GF-1 data. Then a two-stage feature selection strategy was proposed and then applied for feature optimization. In the first stage, the filtered method (Relief) and embedded method (Lasso, RF) were used for initial screening, and in the second stage, the feature subset was further optimized by the wrapper method (RFE) to enhance the model estimation capability. Finally, 5 machine learning models including the K nearest neighbor (KNN), support vector regression (SVR), random forest (RF), gradient boosting (GBRT), gradient boosted regression tree (GBRT) and extreme gradient boosting tree (XGBoost) models were selected and applied for AGB estimation. They were applied for evaluating the match between the different feature selection methods and models and their effects on forest AGB estimation accuracy. According to the results, the optimal strategy was explored for estimating for forest AGB based on GF-1 data.Result 1) The Relief-RFE method achieved the best results in AGB inversion using the XGBoost model with R2 = 0.811, RMSE = 8.45 t·hm–2, rRMSE = 23.39%. 2) The green light band texture feature (mea-b2) was able to capture spatial distribution information of forest surface cover, such as canopy density, tree distribution, and other spatial structural changes of the forest; The spectral characteristics of the blue light band (b1) were sensitive to changes in chlorophyll and leaf pigment concentrations, and was able to characterize vegetation health status and growth stage information. In the two-stage feature selection of various methods, both of the above two features were selected as core features for forest AGB estimation. 3) Feature selection significantly improved model performance, with XGBoost showing a more pronounced improvement compared to KNN, SVR, RF, and GBRT.Conclusion By comparing the application of different feature selection methods combined with machine learning model in forest AGB estimation, the results have demonstrated the effectiveness of the proposed two-stage hybrid feature selection strategy in forest AGB estimation based on GF-1 data, especially combining it with the XGBoost model, a highly accurate and robust forest AGB estimation strategy can be constructed.

Key words: GF-1 data, forest above-ground biomass (AGB), hybrid feature selection, machine learning, two-stage optimization strategy

中图分类号: