欢迎访问林业科学,今天是

林业科学 ›› 2024, Vol. 60 ›› Issue (2): 97-105.doi: 10.11707/j.1001-7488.LYKX20220152

• 研究论文 • 上一篇    下一篇

基于生成对抗网络的树种识别方法

苏彤1,2, 许杰1   

  1. 1. 黑龙江八一农垦大学信息工程学院 大庆 163319;
    2. 山东华宇工学院信息工程学院 德州 253034
  • 收稿日期:2022-03-21 修回日期:2022-05-04 发布日期:2024-03-13
  • 通讯作者: 许杰
  • 基金资助:
    国家自然科学基金项目(31170518);黑龙江省科技攻关项目(GC01KC156);黑龙江省教育规划课题(GJC1319071)。

Tree Species Identification Method Based on Generative Adversarial Network

Su Tong1,2, Xu Jie1   

  1. 1. College of Information Engineering, Heilongjiang Bayi Agricultural Reclamation University Daqing 163319;
    2. College of Information Engineering, Shandong Huayu University of Technology Dezhou 253034
  • Received:2022-03-21 Revised:2022-05-04 Published:2024-03-13

摘要: 目的 利用卷积神经网络模型进行图像自动识别时,为防止模型过拟合通常需要大量训练样本。本研究为提高树种识别准确率,在原有叶片图像基础上进行图像样本扩充来保证训练质量,提出一种融合生成对抗网络与卷积神经网络的树种识别方法。方法 在Pytorch框架下,采集10种常见树种(山杨、梣叶槭、榆、刺槐、紫丁香、杜仲、火炬树、山荆子、水曲柳、红端木)叶片图像作为研究对象。首先,采用均值滤波去噪和尺寸归一化对图像进行预处理。其次,以生成对抗网络生成的图像扩充数据集,其中,以深度卷积生成对抗网络(DCGAN)模型为基础并对其进行改进,建立残差条件深度卷积生成对抗网络(RC-DCGAN)模型,将随机噪声和类别标签作为生成器的输入,以控制样本生成过程;在生成器中嵌入残差结构,使生成模型学习更多特征信息,以提高生成图像质量。然后,将原始图像和扩充图像作为卷积神经网络(CNN)的训练集,一方面,使用RC-DCGAN模型和旋转、镜像、改变对比度等传统图像扩充方法,扩充图像11 400幅;另一方面,将原始图像与生成图像、原始图像与传统扩充图像,分别输入至CNN中进行训练,并在原始图像的每个类别中随机挑选50幅对模型进行测试,以验证生成对抗网络对提升识别准确率的可行性。最后,确定适合试验要求的CNN分类模型,并与AlexNet模型、VGG-16模型、VGG-19模型、ResNet18模型的识别效果进行对比,以检验本研究方法的可行性。结果 RC-DCGAN模型比DCGAN模型生成的图像质量更高,贴合真实图像;利用生成对抗网络扩充图像的方法与ResNet30树种识别模型,训练准确率为99.03%,平均验证识别准确率为97.20%;而在相同树种识别模型下,传统图像扩充方法的识别率为95.50%;在相同数据集下,AlexNet模型、VGG-16模型、VGG-19模型、ResNet18模型所获得的识别率分别为86.52%、87.57%、91.43%、93.25%,均低于本研究模型的识别率。结论 联合生成对抗网络和卷积神经网络的方法对本研究10种树种叶片图像的识别准确率最高,且克服了使用传统图像处理扩充方法使模型泛化能力下降的问题,说明利用生成对抗网络对图像扩充的方法具有可行性和有效性,可为相关研究工作提供借鉴。

关键词: 卷积神经网络, 树种识别, 生成对抗网络, 残差结构

Abstract: Objective When using convolutional neural network models for automatic image recognition, a large number of training samples are usually required to prevent model overfitting. In this study, we propose a tree species recognition method that fuses generative adversarial networks with convolutional neural networks to improve the accuracy of tree species recognition and expand the image samples based on the original leaf images to ensure the training quality.Method Under the framework of Pytorch, images of the leaves of 10 common tree species (aspen, ash maple, house elm, locust, clove, eucommia, torch tree, mountain wattle, ash willow, red endosperm) were collected as research objects. First, the images were pre-processed using mean filter denoising and size normalization. Second, the datasets are expanded with images generated by generative adversarial network, in which the residual conditional deep convolutional generative adversarial network (RC-DCGAN) model is built based on and improved by the deep convolutional generative adversarial network (DCGAN) model, which takes random noise and category labels as the input of the generator to control the sample generation process; the residual structure is embedded in the generator so that the generative model learn more feature information to improve the quality of the generated images. Then, the original and augmented images are used as the training set of the convolutional neural network (CNN). On the one hand, 11 400 images were expanded using the RC-DCGAN model and traditional image expansion methods such as rotation, mirroring, and changing contrast, respectively. On the other hand, the original and generated images, the original and traditional expanded images were inputted to the CNN for training, and 50 images in each category of the original images were randomly selected to test the model to verify the feasibility of the generative adversarial networks to improve the recognition accuracy. Finally, the CNN classification model suitable for the experimental requirements are determined and compared with the recognition effect of AlexNet model, VGG-16 model, VGG-19 model and ResNet18 model to test the feasibility of the method in this study.Result The RC-DCGAN model generates higher quality images than the DCGAN model, which fit the real image; the training accuracy is 99.03% and the average validation recognition accuracy is 97.20% using the generative adversarial network augmented image method with the ResNet30 tree recognition model; while the recognition rate of the traditional image augmentation method is 95.50% under the same tree recognition model; under the same dataset, the recognition rates obtained by AlexNet model, VGG-16 model, VGG-19 model, and ResNet18 model were 86.52%, 87.57%, 91.43%, and 93.25%, respectively, which were lower than that of the models in this study.Conclusion The method of combining generative adversarial network and convolutional neural network has the highest recognition accuracy for the leaf images of 10 tree species in this study, which overcomes the problem of decreasing the generalization ability of the model by using the traditional image processing augmentation method, and shows that the method of image amplification by generating adversarial network is feasible and effective, which can provide reference significance for related research work.

Key words: convolutional neural network, tree species identification, generative adversarial network, residual structure

中图分类号: