WANG Ji-wu, LUO Hai-bao, YU Peng-fei, LI Chen-yang
(School of Mechanical, Electronic and Control Engineering, Beijing Jiaotong University, Beijing 100044, China)
Abstract: In order to solve the problem of small objects detection in unmanned aerial vehicle (UAV) aerial images with complex background, a general detection method for multi-scale small objects based on Faster region-based convolutional neural network (Faster R-CNN) is proposed. The bird’s nest on the high-voltage tower is taken as the research object. Firstly, we use the improved convolutional neural network ResNet101 to extract object features, and then use multi-scale sliding windows to obtain the object region proposals on the convolution feature maps with different resolutions. Finally, a deconvolution operation is added to further enhance the selected feature map with higher resolution, and then it taken as a feature mapping layer of the region proposals passing to the object detection sub-network. The detection results of the bird’s nest in UAV aerial images show that the proposed method can precisely detect small objects in aerial images.
Key words: Faster region-based convolutional neural network (Faster R-CNN); ResNet101; unmanned aerial vehicle (UAV); small objects detection; bird’s nest
CLD number: TP183 doi: 10.3969/j.issn.1674-8042.2020.01.002
References
[1]Ren S, Girshick R, Girshick R, et al. Faster R-CNN: towards real-time object detection with region proposal networks. IEEE Transactions on Pattern Analysis & Machine Intelligence, 2017, 39(6): 1137-1149.
[2]Liu W, Anguelov D, Erhan D, et al. SSD: Single shot multibox detector. In: Proceedings of the 14th European Conference on Computer Vision, Springer Verlag, 2016: 21-37.
[3]Redmon J, Divvals S, Grishick R, et al. You only look once: unified, real-time object detection. In: Proceedings of 2016 IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, 2016: 779-788.
[4]He K, Zhang X, Ren S, et al. Deep residual learning for image recognition. In: Proceedings of 2016 IEEE Conference on Computer Vision and Pattern Recognition. Las Vegas, 2016: 770-778.
[5]LeCun Y, Botton L, Bengio Y, et al. Gradient-based learning applied to document recognition. Proceedings of the IEEE, 1998, 86(11): 2278-2324.
[6]Hinton G E, Salakhutdinov R R. Reducing the dimensionality of data with neural networks. Science, 2006, 313(5786): 504-507.
[7]Krizhevsky A, Sutskever I, Hinton G E. ImageNet classification with deep convolutional neural networks. In: Proceedings of International Conference on Neural Information Processing Systems, Nevada, 2012: 1097-1105.
[8]Simonyan K, Zisserman A. Very deep convolutional networks for large-scale image recognition. Computer Science, 2014, 1409.1556: 1-8.
[9]Szegedy C, Liu W, Jia Y, et al. Going deeper with convolutions. In: Proceedings of 2015 IEEE Conference on Computer Vision and Pattern Recognition, Boston, 2015: 1-9.
[10]Hosang J, Benenson R, Dollár P, et al. What makes for effective detection proposals?. IEEE Transactions on Pattern Analysis & Machine Intelligence, 2015, 38(4): 814.
基于Faster R-CNN的无人机航拍图像小目标检测
王纪武, 罗海保, 鱼鹏飞, 李晨阳
(北京交通大学 机械与电子控制工程学院, 北京 100044)
摘 要:为了解决复杂图像背景下无人机航拍图像小目标检测问题, 提出了一种基于Faster R-CNN的多尺度小目标检测方法。 以高压塔上的鸟巢为检测对象, 首先通过改进卷积神经网络ResNet101对目标进行特征提取, 然后采用多尺度滑动窗口方式在不同分辨率卷积特征图上获取目标初始建议区域, 最后在选取的分辨率较高的卷积特征图上增加一个反卷积操作进一步对特征图的分辨率进行提升, 并作为建议窗口的特征映射层传入目标检测子网络中。 通过对无人机实际航拍图像中鸟巢的检测结果表明, 所提出的算法可以实现对航拍图像中小目标的精确检测。
关键词: Faster R-CNN; ResNet101; 无人机; 小目标检测; 鸟巢
引用格式: WANG Ji-wu, LUO Hai-bao, Yu Peng-fei, et al. Small objects detection in UAV aerial images based on improved Faster R-CNN. Journal of Measurement Science and Instrumentation, 2020, 11(1): 11-16. [doi: 10.3969/j.issn.1674-8042.2020.01.002]
[full text view]