Welcome to visit Scientia Silvae Sinicae,Today is

Scientia Silvae Sinicae ›› 2023, Vol. 59 ›› Issue (8): 112-122.doi: 10.11707/j.1001-7488.LYKX20220378

• Research papers • Previous Articles     Next Articles

Wildlife Image Recognition in Miyun District Based on BS-ResNeXt-50

Jiandong Qi1,2,Zhongtian Ma1,Dehuai Zhang3,Yun Tian4   

  1. 1. College of Information, Beijing Forestry University Beijing 100083
    2. Engineering Research Center for Forestry-Oriented Intelligent Information Processing of National Forestry and Grassland Administration Beijing 100083
    3. Management Office of Wulingshan Mountain Nature Reserve Beijing 101506
    4. School of Soil and Water Conservation, Beijing Forestry University Beijing 100083
  • Received:2022-05-31 Online:2023-08-25 Published:2023-10-16

Abstract:

Objective: In the wild environment, the background of wildlife images captured by camera traps is complex, which poses a challenge for identifying wild animals in images with a large number of images and a wide variety of wildlife species. Based on convolutional neural network, this research aims to improve the existing structure and so as to implement the automatic recognition for wildlife images. Method: In this study, 2 712 wildlife images of 8 categories were taken from Wuling Mountain Beijing Nature Reserve, Miyun Districts, Beijing. The Auto Augment policy was randomly selected from 14 augmentation policies to add noise to the images. SENet and BlurPool were used to construct an improved network based on ResNeXt-50: SE-ResNeXt-50 for enhancement feature extraction, BP-ResNeXt-50 for Shift-invariance maintenance, and BS-ResNeXt-50 for both. The influences of fixed learning rate, segmented learning rate, and cosine annealing learning rate on the accuracy of the BS ResNeXt-50 network were tested on the self-built dataset. VGG16, ResNeXt-50, EfficientNet-B0, InceptionV3, DenseNet-121, and BS-ResNeXt-50 were used to train on 16 common categories of images in CCT public wildlife dataset, and the recognition accuracy of single species was compared.e influences of fixed learning rate, segmented learning rate, and cosine annealing learning rate on the accuracy of the BS ResNeXt-50 network were tested on the self-built dataset. VGG16, ResNeXt-50, EfficientNet-B0, InceptionV3, DenseNet-121, and BS-ResNeXt-50 were used to train on 16 common categories of images in CCT public wildlife dataset, and the recognition accuracy of single species was compared.eXt-50 is used to test influence of different learning rate include fixed and CosineAnnealing learning rate on collected dataset. VGG16, ResNeXt-50, EfficientNet-B0, InceptionV3, DenseNet-121, BS-ResNeXt-50 were used for training on CCT dataset, and the recognition accuracy of single species was compared. on ResNeXt-50: SE-ResNeXt-50 for enhancement feature extraction, BP-ResNeXt-50 for Shift-invariance maintenance, and BS-ResNeXt-50 for both. The influences of fixed learning rate, segmented learning rate, and cosine annealing learning rate on the accuracy of the BS ResNeXt-50 network were tested on the self-built dataset. VGG16, ResNeXt-50, EfficientNet-B0, InceptionV3, DenseNet-121, and BS-ResNeXt-50 were used to train on 16 common categories of images in CCT public wildlife dataset, and the recognition accuracy of single species was compared.eXt-50 is used to test influence of different learning rate include fixed and CosineAnnealing learning rate on collected dataset. VGG16, ResNeXt-50, EfficientNet-B0, InceptionV3, DenseNet-121, BS-ResNeXt-50 were used for training on CCT dataset, and the recognition accuracy of single species was compared. Result: The accuracy of SE-ResNeXt-50 and BP-ResNeXt-50 reached 75.16%±0.14% and 73.74%±0.13%, respectively. The enhanced scheme BS-ResNeXt-50, which integrated SENet and BlurPool, achieved an accuracy of 78.04%±0.11% when tested on a self built dataset, which was the best improved scheme. When the cosine annealing learning rate is used, the accuracy of BS-ResNeXt-50 was improved to 81.54%, which was 3.5% higher than that with the constant learning rate. The step decay learning rate achieved 79.3% accuracy, which was 2.24% less than the cosine annealing learning rate. The classification accuracy of BS-ResNeXt-50 was able to reach 95.07%, which was 1.95% higher than that of ResNeXt-50 on CCT dataset. At the same time, it was also 85.5% higher than that of VGG16, 91.38% higher than that of EfficientNet-B0, 91.38% higher than that of InceptionV3 and 93.3% higher than that of DenseNet-121. The prediction accuracy of each single category was also higher than that of the above model. In the recognition of a single category, except for the least one category, the accuracy of BS-ResNeXt-50 was 90% higher than that in other categories, and the highest category accuracy was 98.6%. Conclusion: The BS-ResNeXt-50 can more accurately complete the recognition task, and also has good generalization ability on different datasets.

Key words: wildlife images, species recognition, deep learning, convolutional neural network

CLC Number: