欢迎访问林业科学,今天是

林业科学 ›› 2018, Vol. 54 ›› Issue (5): 78-86.doi: 10.11707/j.1001-7488.20180509

• 论文与研究报告 • 上一篇    下一篇

基于自学习特征的林业业务图像分类方法

李英杰1, 张广群2, 汪杭军1   

  1. 1. 浙江农林大学暨阳学院 诸暨 311800;
    2. 浙江农林大学信息工程学院 临安 311300
  • 收稿日期:2016-09-05 修回日期:2017-02-22 出版日期:2018-05-25 发布日期:2018-06-05
  • 基金资助:
    浙江省自然科学基金项目(LY16C160007,LY14C130013)。

A Method for Forestry Business Images Classification Based on Auto-Learning Features

Li Yingjie1, Zhang Guangqun2, Wang Hangjun1   

  1. 1. Jiyang College, Zhejiang A & F University Zhuji 311800;
    2. School of Information Engineering, Zhejiang A & F University Lin'an 311300
  • Received:2016-09-05 Revised:2017-02-22 Online:2018-05-25 Published:2018-06-05

摘要: [目的]现代电子和通讯设备的普遍使用,使得林业管理部门可以及时获取和累积大量林业业务图像。对林业业务图像进行自动分类,可使林业各个管理部门,包括林业执法机构和执法队伍能够全面配合、相互协调,增强决策支持和应急处理能力,从而实现有效的森林资源监管。[方法]提出一种基于自学习特征的林业业务图像自动分类方法,该方法采用3层结构来获取图像局部特征的全局语义,并对全局语义进行建模。首先,将图像分块,对图像块采用线性稀疏自动编码器进行自动学习,获取局部特征的权值矩阵;之后,利用权值矩阵对图像进行卷积,获取各种局部特征在图像中的映射,再进行池化与连接形成特征向量;最后,利用特征向量,采用softmax分类方法进行图像类别建模与识别。[结果]确立了林业业务图像的4个类别:动物死亡、森林火灾、采伐和森林病虫害。收集4类图像355幅,每类图像数目不等,并裁剪为统一大小,建立试验用林业业务图像数据集。利用留一法进行多次交叉试验,识别精度达到80%。线性稀疏自动编码器有效地提取出了图像块中的色彩特征、色彩变化特征及不同方向和不同位置的梯度特征等;利用自动学习到的局部权值矩阵卷积整个图像,在每幅特征图像中激活了原图的不同特征,且这些特征是零散的;当被识别图像与同类图像有相似或者局部相似的特征时,其会被正确识别,反映了基于自学习特征的林业业务图像分类方法能联合图像中的局部特征识别全局语义。[结论]林业业务图像分类属于图像场景分类问题,其同一类图像的类内相似性较弱,利用传统的"特征提取+分类建模"方法进行分类,其分类特征的选取难度较大。相较而言,基于自学习特征的林业业务图像分类方法的局部特征是从图像中自动学习得到的,方法泛化性强;而且该方法只有3层,没有全部层的反馈学习过程,比目前流行的基于深度卷积神经网络的图像分类方法效率高;当累积的样本图像增多、更加全面时,可以期望获得更高的识别精度。

关键词: 林业业务图像, 线性稀疏自动编码器, 卷积, 池化, softmax

Abstract: [Objective] With the popularization of modern electronic and communication equipment, large numbers of forestry business images can be gotten in time and be accumulated in forestry administration. Automatic classification of forestry business images can achieve effective supervision of forest resources because it is the core of decision support, rapid reaction and cooperation among all forestry management departments, including forestry law enforcement agencies and law enforcement teams.[Method] A new method for forestry business image classification, which was based on auto-learning local features was proposed. Three layers of networks are constructed in the method. They are used to get the local features of images, to form global semantic by combine these local features, and to model classifiers. Firstly, patches are extracted from images and their features are learned automatically using linear sparse auto-encoder. The features are local. Then, the images are convoluted with the local features to get the global activation feature maps. The maps are then pooled and combined to form feature vectors. Finally, softmax is used to model and classify on the vectors.[Result] Four categories are established for forestry business images. They are animal death, forest fire, logging and forest pest. 355 images are gathered for the four categories with different numbers of images in different categories. They are cropped to the same size and constitute the experimental dataset. Leave-one-out cross validation is done on the dataset and we get the 80% classification accuracy. And, it can be seen that different color features, color variation features, gradient features and other features are extracted automatically from image patches by linear sparse auto-encoder automatically. We also can see that different features are activated in different feature maps which are gained by convoluting images with the weight matrices. Also, the active features are scattered in feature maps. An image will be identified correctly, if the features, which are the same or similar with those in other images of the same category, are included in this image. This illustrate that our method performs well.[Conclusion] The issue of forestry business images classification is a kind of scene classification problem. The images resemblance in the same category is weak. To classify them with traditional "feature extraction and classification modeling" method,the appropriate classification features are difficult to find. By comparison, our method learns the local features automatically and it is more general. Also, only three network layers are included in this method, and there is no feedback learning crossing all these three layers. So, it is more efficient than the popular methods which are based on deep convolutional neural networks. Higher accuracy can be expected while the accumulating images are more and comprehensive.

Key words: forestry business images, linear sparse auto-encoder, convolution, pooling, softmax

中图分类号: