欢迎访问林业科学,今天是

林业科学 ›› 2019, Vol. 55 ›› Issue (11): 137-144.doi: 10.11707/j.1001-7488.20191115

• 论文与研究报告 • 上一篇    下一篇

基于空间聚类的杉木生长预测方法

张英凯1,刘鹏举1,*,刘长春1,任怡2   

  1. 1. 中国林业科学研究院资源信息研究所 北京 100091
    2. 国家林业和草原局调查规划设计院 北京 100714
  • 收稿日期:2019-02-21 出版日期:2019-11-25 发布日期:2019-12-21
  • 通讯作者: 刘鹏举
  • 基金资助:
    十三五"林业资源培育及高效利用技术创新"专项资金"人工林培育经营智能化决策技术"(2017YFD0600906)

Prediction Method of Cunninghamia lanceolata Growth Based on Spatial Clustering

Yingkai Zhang1,Pengju Liu1,*,Changchun Liu1,Yi Ren2   

  1. 1. Research Institute of Forest Resource Information Techniques, CAF Beijing 100091
    2. Academy of Forestry Inventory and Planning, National Forestry and Grassland Administration Beijing 100714
  • Received:2019-02-21 Online:2019-11-25 Published:2019-12-21
  • Contact: Pengju Liu
  • Supported by:
    十三五"林业资源培育及高效利用技术创新"专项资金"人工林培育经营智能化决策技术"(2017YFD0600906)

摘要:

目的: 基于空间聚类对杉木分布区进行分组,在不同组内分别建立杉木生长模型,为适用于全国范围的杉木生长高精度预测提供方法。方法: 以杉木分布的16个省区为研究区,基于第七次、第八次全国森林资源连续清查杉木固定样地复测数据和研究区的地形、土壤、气象等环境数据,采用随机森林-递归特征消除(RF-RFE)算法对影响杉木生长的环境因子进行分析。选择对杉木生长影响较大的前8个环境因子,利用ArcGIS 10.2的分组分析功能,根据环境相似性对杉木分布区进行分组,分别建立林分分组和未分组蓄积生长率模型。以全国未分组模型作为参考,采用决定系数(R2)、均方根误差(RMSE)、平均相对误差(MRE)、系统误差(SE)和剩余标准差(S)对建模结果进行分析。结果: 对杉木生长影响较大的前8个环境因子分别为bio4(温度季节变化标准差)、elevation(海拔)、bio3(等温性)、bio8(最湿季度平均温)、bio1(年均温)、bio14(最干月降水量)、bio12(年均降水量)和bio2(昼夜温差月均值)。将研究区划分为7组时可使组内环境相似性最大、组间环境相似性最小。7组中只有4组有杉木样地数据,在4组中采用2种模型分别建立杉木林分生长率模型,与全国未分组模型相比,分组模型的决定系数提高0.1左右,拟合度更好;分组建模精度也明显提高,RMSE降低0.5左右,MRE降低6%左右,SE降低3%左右,S降低1左右。结论: 根据环境相似性对研究区分组并在此基础上建立生长模型是一种适用于全国范围杉木生长的高精度预测方法,可用于全国区域范围杉木林分生长量的整体预估和森林资源小班数据的模型更新,为实现主要人工林树种的大区域生长预测提供新的方法。

关键词: 空间聚类, 随机森林模型, 杉木, 生长模型

Abstract:

Objective: Cunninghamia lanceolata is widely distributed in China, and the growth speed is quite different in different areas. The spatial clustering method was used to group the distribution area of C. lanceolata, and the growth model of C. lanceolata was established in different groups, which provided a method for the prediction of nationwide C. lanceolata high-precision growth. Method: 16 provinces(autonomous regions)where C. lanceolata was distributed were selected as study areas. Based on seventh and eighth continuous inventory of forest resources in China for the retest data of fixed plots of C. lanceolata and the topography, soil, meteorology and other environmental data of the study area, the importance of environmental factors affecting the growth of C. lanceolata were analyzed by the random forest model. Five environmental factors with great influences on the growth of C. lanceolata were selected. Using the group analysis function of ArcGIS 10.2, the C. lanceolata in the study area was grouped according to the environmental similarity and spatial proximity. The accumulation growth rate model of grouped and ungrouped was respectively established. Taking the national ungrouped model as a reference, the five indicators including coefficient of determination(R2), root mean square error(RMSE), mean relative error(MRE), systematic error(SE)and residual standard deviation(S)were used as the evaluation indexes of the model to analyze the modeling results. Result: The top eight environmental factors those have great influences on the growth of C. lanceolata by random forest model analysis were bio4(standard deviation of seasonal variation of temperature), elevation, bio3(isothermality), bio8(the wettest quarterly mean temperature), bio1(annual mean temperature), bio14(the most dry month precipitation), bio12(annual mean precipitation), and bio2(monthly mean diurnal temperature variation). The results of the group analysis showed that when the research area were divided into 7 groups, the internal environment similarity within the group could be maximized, and the environmental similarity between groups was the smallest. As only four of the 7 groups had data of C. lanceolata plot, the growth rate model of C. lanceolata stands was established using two model formulas in these four groups. Compared with the ungrouped models, the determination coefficients of the grouping models were all increased by more than 0.1, indicating that the fitting degree of the grouping model was better. At the same time, the accuracy of grouping modeling was also significantly improved:RMSE was reduced by about 0.5, MRE was reduced by about 6%, SE was reduced by about 3%, and S was reduced by about 1. Conclusion: The growth model based on the grouping of C. lanceolata study areas according to the environmental similarity is a high-precision forecasting method for C. lanceolata growth nationwide. The model could be used to estimate the growth of C. lanceolata stand and to update the data of forest subcompartment. The proposed method might provide a new method for realizing large area growth prediction of main plantation species.

Key words: spatial clustering, random forest model, Cunninghamia lanceolata, growth model

中图分类号: