欢迎访问林业科学,今天是

林业科学 ›› 2023, Vol. 59 ›› Issue (6): 74-87.doi: 10.11707/j.1001-7488.LYKX20220553

• • 上一篇    下一篇

基于多源数据和机器学习方法的大兴安岭地区雷击火驱动因子及火险预测模型

焦强英1,2(),韩宗甫2,王炜烨3,刘迪4,潘鹏旭5,李博6,张念慈7,王萍1,陶金花2,范萌2,*   

  1. 1. 山东科技大学测绘与空间信息学院 青岛 266590
    2. 中国科学院空天信息创新研究院 遥感科学国家重点实验室 北京 100101
    3. 黑龙江省应急航空救援幸福站 哈尔滨 150000
    4. 南京森林警察学院 南京 210023
    5. 黑龙江省森林草原防火预警监测中心 哈尔滨 150036
    6. 大兴安岭地区气象局 加格达奇 165000
    7. 黑龙江省森林保护研究所 哈尔滨 150040
  • 收稿日期:2022-08-10 出版日期:2023-06-25 发布日期:2023-08-08
  • 通讯作者: 范萌 E-mail:jqy19971110@163.com
  • 基金资助:
    黑龙江省应用技术研究与开发计划项目(GA20C013);国家自然科学基金面上项目(41871254)。

Driving Factors and Forecasting Model of Lightning-Caused Forest Fires in Daxing’ anling Mountains Based on Multi-Sources Data and Machine Learning Method

Qiangying Jiao1,2(),Zongfu Han2,Weiye Wang3,Di Liu4,Pengxu Pan5,Bo Li6,Nianci Zhang7,Ping Wang1,Jinhua Tao2,Meng Fan2,*   

  1. 1. College of Geodesy and Geomatics, Shandong University of Science and Technology Qingdao 266590
    2. State Key Laboratory of Remote Sensing Science, Aerospace Information Research Institute, Chinese Academy of Sciences Beijing 100101
    3. Heilongjiang Province Emergency Air Rescue Happiness Station Harbin 150000
    4. Nanjing Forest Police College Nanjing 210023
    5. Heilongjiang Province Forest and Grassland Fire Warning Monitoring Center Harbin 150036
    6. Daxing’ anling Meteorological Bureau Jiagedaqi 165000
    7. Heilongjiang Provincial Forest Protection Research Institute Harbin 150040
  • Received:2022-08-10 Online:2023-06-25 Published:2023-08-08
  • Contact: Meng Fan E-mail:jqy19971110@163.com

摘要:

目的: 基于长时间序列多源数据,开展雷击火驱动因子分析,采用机器学习方法构建动态、高分辨率雷击火火险预测模型,为雷击火防控提供支撑。方法: 分析大兴安岭地区2010—2020年雷击火时空分布规律,基于闪电监测数据、卫星遥感数据、气象再分析资料、DEM等多源数据选取闪电、气象、植被、地形4类18个雷击火潜在驱动因子,研究其特征及与雷击火发生的关系;提取历史雷击火点和随机生成的非雷击火点对应的驱动因子,构建原始样本集,计算各驱动因子的重要性和相关性矩阵进行驱动因子挑选;基于优化后的训练样本集,采用梯度提升决策树(GBDT)、随机森林(RF)和极端随机树(ERT)3种集成学习模型进行雷击火火险预测能力评估,选择表现最优的方法用于构建大兴安岭地区雷击火火险预测模型并应用。结果: 2010―2020年大兴安岭雷击火出现次数最多和最少的年份分别为2015和2012年,主要集中于5―7月,发生时段主要集中于10:00―17:00;雷击火高发区为漠河县、塔河县、新林区和呼中区。闪电与雷击火的空间分布基本一致,但闪电越多雷击火发生次数不一定越多,2011年闪电次数最多,为114 632次,雷击火仅出现11次。在闪电强度为?20~?40 kA、陡度为?4~?8 kA·μs?1、相对湿度小于40%、降水量小于4 mm、气温大于29 ℃、大气压为91~95 kPa且风速为1~3 m·s–1的气象条件下更易发生雷击火。NDVI(归一化植被指数)、GPP(总初级生产力)、Et(蒸散量)、NPP(净初级生产力)大小与雷击火发生呈正相关。雷击火在海拔300~900 m、坡度0~12°范围内出现次数较多,坡向对雷击火发生影响不大。特征选择后剩余13个特征参量分别参与3种集成学习算法模型构建,其中,ERT模型预测能力最好,其AUC达0.97,查准率、查全率和F1 Score均高于GBDT和RF模型。ERT模型预测的高风险区与实际雷击火点分布区的空间一致性很好。结论: 利用多源大数据,尤其是卫星观测数据,获取更多与雷击火发生有关的潜在驱动因子,并依靠机器学习,能够较好体现因子间非线性关系及自行学习参数间复杂关系能力的优势,本研究构建的雷击火火险预测模型具备很好的泛化性、自适应性和较高的空间分辨率。

关键词: 雷击火, 火险预测, 机器学习, 遥感, 闪电

Abstract:

Objective: Due to the complexity and strong concealment of lightning-caused forest fire occurrence, it is difficult to monitor and early warning. For most available forest fire forecasting models, although main meteorological factors are taken into account the models, their adaptability and precision are still relatively low. In this study, based on the long-term multi-sources data, the driving factors of lightning-caused forest fires were analyzed, and a dynamic lightning-caused fire forecasting model with high spatial resolution was built by using machine learning method, to provide support for the fire prevention and control. Method: The spatial and temporal distribution of lightning-caused fires from 2010 to 2020 was analyzed. Multi-source data such as ground-based lightning observations, satellite data, meteorological reanalysis data and DEM data were used to extract 18 driving factors from 4 categories (i.e., lightning, meteorology, vegetation and terrain). The characteristics of each driving factor and the relationship with lightning-caused fires were studied. The driving factors of forest lightning-caused fire records and randomly generated non-lightning fire spots were extracted to establish our initial training sample dataset. Driving factors were selected by calculating feature importance and correlation matrix. Based on the optimized training sample dataset, three integrated learning models, namely, gradient ascending decision tree (GBDT), random forest (RF) and extreme random tree (ERT) were trained and evaluated, respectively. The model with the best performance was used to forecast forest lightning-caused fires in Daxing’ anling Mountains. Result: The results showed that from 2010 to 2020, the maximum and minimum number of lightning fires occurred in 2015 and 2012, respectively, mainly in May, June and July, and the occurrence period was mainly concentrated from 10:00 to 17:00. The areas with high lightning-caused fire density appeared in Mohe County, Tahe County, Xinlin District and Huzhong District. The spatial distribution of lightning was consistent with that of lightning-caused fires to a certain extent, but the more lightning number might not lead more lightning-caused fires. In 2011, the maximum number of lightning occurrences was 114 632, but only 11 lightning-caused fires. Under the following conditions: lightning intensity in –20– –40 kA, steepness ranged of – 4– –8 kA·μs?1, relative humidity less than 40%, precipitation less than 4 mm, temperature more than 29 ℃, atmospheric pressure in 91–95 kPa and wind speed in 1–3 m?s–1, lightning fires were more prone to occur. NDVI (normalized vegetation index), GPP (total primary productivity), Et (evapotranspiration), and NPP (net primary productivity) were positively correlated with the occurrence of lightning-caused fires. Lightning-caused fires occurred more frequently at altitude of 300–900 m and slope of 0–12°, and aspect had little effect on the occurrence of lightning fires. After feature selection, the remaining 13 features were involved in the model construction of the three ensemble learning algorithms, and the comparison showed that the ERT prediction effect was the best. The AUC value of ERT model reached 0.97, and the precision, recall and F1 score values were all higher than GBDT and RF models. The high-risk region of lighting fires predicted by ERT model had a good agreement with the location of the actual lightning fire spots. Conclusion: The multi-sources big data, especially satellite observation data, are used to obtain more potential driving factors related to the occurrence of lightning-caused fires. Combined with the advantages of machine learning method, our forecasting model of forest lightning-caused fires well performs on the high-risk region prediction of lightning-caused fires in Daxing’ anling Mountains, which has good generalization ability, good adaptability and high spatial resolution.

Key words: lightning-caused fire, fire forecasting, machine learning, remote sensing, lightning

中图分类号: