基于生成对抗网络的树种识别方法

doi:10.11707/j.1001-7488.LYKX20220152

摘要/Abstract

摘要： 目的利用卷积神经网络模型进行图像自动识别时，为防止模型过拟合通常需要大量训练样本。本研究为提高树种识别准确率，在原有叶片图像基础上进行图像样本扩充来保证训练质量，提出一种融合生成对抗网络与卷积神经网络的树种识别方法。方法在Pytorch框架下，采集10种常见树种（山杨、梣叶槭、榆、刺槐、紫丁香、杜仲、火炬树、山荆子、水曲柳、红端木）叶片图像作为研究对象。首先，采用均值滤波去噪和尺寸归一化对图像进行预处理。其次，以生成对抗网络生成的图像扩充数据集，其中，以深度卷积生成对抗网络（DCGAN）模型为基础并对其进行改进，建立残差条件深度卷积生成对抗网络（RC-DCGAN）模型，将随机噪声和类别标签作为生成器的输入，以控制样本生成过程；在生成器中嵌入残差结构，使生成模型学习更多特征信息，以提高生成图像质量。然后，将原始图像和扩充图像作为卷积神经网络（CNN）的训练集，一方面，使用RC-DCGAN模型和旋转、镜像、改变对比度等传统图像扩充方法，扩充图像11 400幅；另一方面，将原始图像与生成图像、原始图像与传统扩充图像，分别输入至CNN中进行训练，并在原始图像的每个类别中随机挑选50幅对模型进行测试，以验证生成对抗网络对提升识别准确率的可行性。最后，确定适合试验要求的CNN分类模型，并与AlexNet模型、VGG-16模型、VGG-19模型、ResNet18模型的识别效果进行对比，以检验本研究方法的可行性。结果 RC-DCGAN模型比DCGAN模型生成的图像质量更高，贴合真实图像；利用生成对抗网络扩充图像的方法与ResNet30树种识别模型，训练准确率为99.03%，平均验证识别准确率为97.20%；而在相同树种识别模型下，传统图像扩充方法的识别率为95.50%；在相同数据集下，AlexNet模型、VGG-16模型、VGG-19模型、ResNet18模型所获得的识别率分别为86.52%、87.57%、91.43%、93.25%，均低于本研究模型的识别率。结论联合生成对抗网络和卷积神经网络的方法对本研究10种树种叶片图像的识别准确率最高，且克服了使用传统图像处理扩充方法使模型泛化能力下降的问题，说明利用生成对抗网络对图像扩充的方法具有可行性和有效性，可为相关研究工作提供借鉴。

关键词: 卷积神经网络, 树种识别, 生成对抗网络, 残差结构

Abstract: Objective When using convolutional neural network models for automatic image recognition, a large number of training samples are usually required to prevent model overfitting. In this study, we propose a tree species recognition method that fuses generative adversarial networks with convolutional neural networks to improve the accuracy of tree species recognition and expand the image samples based on the original leaf images to ensure the training quality.Method Under the framework of Pytorch, images of the leaves of 10 common tree species (aspen, ash maple, house elm, locust, clove, eucommia, torch tree, mountain wattle, ash willow, red endosperm) were collected as research objects. First, the images were pre-processed using mean filter denoising and size normalization. Second, the datasets are expanded with images generated by generative adversarial network, in which the residual conditional deep convolutional generative adversarial network (RC-DCGAN) model is built based on and improved by the deep convolutional generative adversarial network (DCGAN) model, which takes random noise and category labels as the input of the generator to control the sample generation process; the residual structure is embedded in the generator so that the generative model learn more feature information to improve the quality of the generated images. Then, the original and augmented images are used as the training set of the convolutional neural network (CNN). On the one hand, 11 400 images were expanded using the RC-DCGAN model and traditional image expansion methods such as rotation, mirroring, and changing contrast, respectively. On the other hand, the original and generated images, the original and traditional expanded images were inputted to the CNN for training, and 50 images in each category of the original images were randomly selected to test the model to verify the feasibility of the generative adversarial networks to improve the recognition accuracy. Finally, the CNN classification model suitable for the experimental requirements are determined and compared with the recognition effect of AlexNet model, VGG-16 model, VGG-19 model and ResNet18 model to test the feasibility of the method in this study.Result The RC-DCGAN model generates higher quality images than the DCGAN model, which fit the real image; the training accuracy is 99.03% and the average validation recognition accuracy is 97.20% using the generative adversarial network augmented image method with the ResNet30 tree recognition model; while the recognition rate of the traditional image augmentation method is 95.50% under the same tree recognition model; under the same dataset, the recognition rates obtained by AlexNet model, VGG-16 model, VGG-19 model, and ResNet18 model were 86.52%, 87.57%, 91.43%, and 93.25%, respectively, which were lower than that of the models in this study.Conclusion The method of combining generative adversarial network and convolutional neural network has the highest recognition accuracy for the leaf images of 10 tree species in this study, which overcomes the problem of decreasing the generalization ability of the model by using the traditional image processing augmentation method, and shows that the method of image amplification by generating adversarial network is feasible and effective, which can provide reference significance for related research work.

Key words: convolutional neural network, tree species identification, generative adversarial network, residual structure

中图分类号:

TP391.4

苏彤, 许杰. 基于生成对抗网络的树种识别方法[J]. 林业科学, 2024, 60(2): 97-105.

Su Tong, Xu Jie. Tree Species Identification Method Based on Generative Adversarial Network[J]. Scientia Silvae Sinicae, 2024, 60(2): 97-105.

参考文献

陈明健, 陈志泊, 杨猛, 等. 2017. 叶片传统特征和距离矩阵与角点矩阵相结合的树种识别算法. 北京林业大学学报, 39(2): 108-116.
Chen M J, Chen Z B, Yang M, et al. 2017. Research on tree species identification algorithm based on combination of leaf traditional characteristics and distance matrix as well as corner matrix. Journal of Beijing Forestry University, 39(2): 108-116. ［in Chinese］
李龙龙, 何东健, 王美丽. 2021. 基于改进型LBP算法的植物叶片图像识别研究. 计算机工程与应用, 57(19): 228-234.
Li L L, He D J, Wang M L. 2021. Study of plant leaf image recognition based on improved local binary pattern algorithm. Computer Engineering and Applications, 57(19): 228-234. ［in Chinese］
宋宇鹏, 边继龙, 安翔, 等. 2020. 基于注意力机制的DenseNet模型的树种识别应用. 实验室研究与探索, 39(7): 122-127,173.
Song Y P, Bian J L, An X, et al. 2020. Application of tree species identification based on DenseNet model with attention mechanism. Research and Exploration in Laboratory, 39(7): 122-127,173. ［in Chinese］
汪紫阳, 尹世逵, 李春旭, 等. 2019. 可见/近红外光谱技术识别树叶树种的研究. 西北林学院学报, 34(1): 229-236,260.
Wang Z Y, Yin S K, Li C X, et al. 2019. Identification of tree leaf and species by Vis/NIR spectroscopy. Journal of Northwest Forestry University, 34(1): 229-236,260. ［in Chinese］
于慧伶, 麻峻玮, 张怡卓. 2018. 基于双路卷积神经网络的植物叶片识别模型. 北京林业大学学报, 40(12): 132-137.
Yu H L, Ma J W, Zhang Y Z. 2018. Plant leaf recognition model based on two-way convolutional neural network. Journal of Beijing Forestry University, 40(12): 132-137. ［in Chinese］
张曼, 李杰, 朱新忠, 等. 2021. 基于改进DCGAN算法的遥感数据集增广方法. 计算机科学, 48(S01): 80-84.
Zhang M, Li J, Zhu X Z, et al. 2021. Augmentation technology of remote sensing dataset based on improved DCGAN algorithm. Computer Science, 48(S01): 80-84. ［in Chinese］
张帅, 淮永建. 2016. 基于分层卷积深度学习系统的植物叶片识别研究. 北京林业大学学报, 38(9): 108-115.
Zhang S, Huai Y J. 2016. Leaf image recognition based on layered convolutions neural network deep learning. Journal of Beijing Forestry University, 38(9): 108-115. ［in Chinese］
赵霖, 张晓丽, 吴艳双, 等. 2020. 面向机载高光谱数据的3D-CNN亚热带森林树种分类. 林业科学, 56(11): 97-107.
Zhao L, Zhang X L, Wu Y S, et al. 2020. Subtropical forest tree species classification based on 3D-CNN for airborne hyperspectral data. Scientia Silvae Sinicae, 56(11): 97-107. ［in Chinese］
Goodfellow I, Pouget-Abadie J, Mirza M, et al. 2014. Generative adversarial nets. Advances in Neural Information Processing Systems, 3: 2672-2680.
Guo T Y, Xu C, Huang J J, et al. 2020. On positive-unlabeled classification in GAN. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 8385-8393.
He K M, Zhang X Y, Ren S Q, et al. 2016. Deep residual learning for image recognition. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 770-778.
Heusel M, Ramsauer H, Unterthiner T, et al. 2017. GANs trained by a two time-scale update rule converge to a local nash equilibrium. 31st Conference on Neural Information Processing Systems (NIPS 2017), Long Beach, CA, USA.
Hinton G E, Salakhutdinov R R. 2006. Reducing the dimensionality of data with neural networks. Science, 313(5786): 504-507.
Ioffe S, Szegedy C. 2015. Batch normalization: accelerating deep network training by reducing internal covariate shift. Proceedings of the 32nd International Conference on Machine Learning, PMLR,37: 448-456.
Krizhevsky A, Sutskever I, Hinton G E. 2017. ImageNet classification with deep convolutional neural networks. Communications of the ACM, 60(6): 84-90.
Le V N T, Apopei B, Alameh K. 2019. Effective plant discrimination based on the combination of local binary pattern operators and multiclass support vector machine methods. Information Processing in Agriculture, 6(1): 116-131.
Pham L T H, Brabyn L, Ashraf S. 2016. Combining QuickBird, LiDAR, and GIS topography indices to identify a single native tree species in a complex landscape using an object-based classification approach. International Journal of Applied Earth Observation and Geoinformation, 50: 187-197.
Radford A, Metz L, Chintala S. 2015. Unsupervised representation learning with deep convolutional generative adversarial networks. arXiv: 1511.06434, https://arxiv.org/abs/1511.06434.
Sarah B, Rihab B, Guillaume C, et al. 2018. Bark and leaf fusion systems to improve automatic tree species recognition. Ecological Informatics, 46: 57-73.
Szegedy C, Liu W, Jia Y Q, et al. 2015. Going deeper with convolutions. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston: MA,USA,1-9.
Zeiler M D, Fergus R. 2014. Visualizing and understanding convolutional networks. Computer Vision–ECCV 2014: 13th European Conference, Zurich, Switzerland, 818-833.

[1]	齐建东,马鐘添,张德怀,田赟. 基于BS-ResNeXt-50的密云地区野生动物图像识别[J]. 林业科学, 2023, 59(8): 112-122.
[2]	苗宇杰,祝诗平,普京,李俊贤,马羚凯,黄华. 基于卷积神经网络的家具木材图像种类识别[J]. 林业科学, 2023, 59(8): 133-140.
[3]	毛英伍,郭颖,张王菲,苏勇,关塬. 联合LiDAR、高光谱数据及3D-CNN方法的树种分类[J]. 林业科学, 2023, 59(3): 73-83.
[4]	冯慧敏,金崑. 基于卷积神经网络的雄性海南长臂猿声纹识别[J]. 林业科学, 2023, 59(1): 119-127.
[5]	岳巍,李世明,李增元,刘清旺,庞勇,斯林. 基于多时相Sentinel-2影像和SNIC分割算法的优势树种识别[J]. 林业科学, 2022, 58(9): 60-69.
[6]	齐建东,谭新新. 长白山红松阔叶林的净碳交换变化及基于时间卷积神经网络的模拟[J]. 林业科学, 2022, 58(2): 1-12.
[7]	何拓,刘守佳,陆杨,张永刚,焦立超,殷亚方. iWood: 基于卷积神经网络的濒危珍贵树种木材自动识别系统[J]. 林业科学, 2021, 57(9): 152-159.
[8]	黄翀,张晨晨,刘庆生,李贺,杨晓梅,刘高焕. 结合光学与雷达影像多特征的热带典型人工林树种精细识别[J]. 林业科学, 2021, 57(7): 80-91.
[9]	赵子宇,杨霄霞,郭慧,葛浙东,周玉成. 基于卷积神经网络模型的木材宏、微观辨识方法[J]. 林业科学, 2021, 57(6): 134-143.
[10]	宿恒硕,吕军,丁志平,唐彦杰,陈旭东,周强,张哲宇,姚青. 基于改进残差神经网络的木材识别算法[J]. 林业科学, 2021, 57(12): 147-154.
[11]	刘璇昕,孙钰,崔剑,蒋琦,陈志泊,骆有庆. 钻蛀性害虫取食声音的人工智能早期识别[J]. 林业科学, 2021, 57(10): 93-101.
[12]	谢锦莹,丁丽霞,王志辉,刘丽娟. 基于FCN与面向对象的滨海湿地植被分类[J]. 林业科学, 2020, 56(8): 98-106.
[13]	郭颖,李增元,陈尔学,张旭,赵磊,陈艳,王雅慧. 一种改进的高空间分辨率遥感影像森林类型深度学习精细分类方法:双支FCN-8s[J]. 林业科学, 2020, 56(3): 48-60.
[14]	赵霖,张晓丽,吴艳双,张斌. 面向机载高光谱数据的3D-CNN亚热带森林树种分类[J]. 林业科学, 2020, 56(11): 97-107.
[15]	栗旭升,李虎,陈冬花,刘玉锋,刘赛赛,刘聪芳,胡国庆. 联合GF-5与GF-6卫星数据的多分类器组合亚热带树种识别[J]. 林业科学, 2020, 56(10): 93-104.