Improved Kiwifruit Detection Using Pre-Trained VGG16 With RGB and NIR Information Fusion

被引:123
作者
Liu, Zhihao [1 ]
Wu, Jingzhu [2 ]
Fu, Longsheng [1 ,3 ,4 ,5 ]
Majeed, Yaqoob [5 ]
Feng, Yali [6 ]
Li, Rui [1 ]
Cui, Yongjie [1 ]
机构
[1] Northwest A&F Univ, Coll Mech & Elect Engn, Yangling 712100, Shaanxi, Peoples R China
[2] Beijing Technol & Business Univ, Beijing Key Lab Big Data Technol Food Safety, Beijing 100048, Peoples R China
[3] Minist Agr & Rural Affairs, Key Lab Agr Internet Things, Yangling 712100, Shaanxi, Peoples R China
[4] Shaanxi Key Lab Agr Informat Percept & Intelligen, Yangling 712100, Shaanxi, Peoples R China
[5] Washington State Univ, Ctr Precis & Automated Agr Syst, Prosser, WA 99350 USA
[6] Shanxi Agr Univ, Coll Engn, Jinzhong 030801, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Fruit detection; image alignment; information fusion; multi-modality faster R-CNN; RGB-D sensor; CLASSIFICATION; NETWORKS; FRUITS;
D O I
10.1109/ACCESS.2019.2962513
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This study presents a novel method to apply the RGB-D (Red Green Blue-Depth) sensors and fuse aligned RGB and NIR images with deep convolutional neural networks (CNN) for fruit detection. It aims to build a more accurate, faster, and more reliable fruit detection system, which is a vital element for fruit yield estimation and automated harvesting. Recent work in deep neural networks has led to the development of a state-of-the-art object detector termed Faster Region-based CNN (Faster R-CNN). A common Faster R-CNN network VGG16 was adopted through transfer learning, for the task of kiwifruit detection using imagery obtained from two modalities: RGB (red, green, blue) and Near-Infrared (NIR) images. Kinect v2 was used to take a bottom view of the kiwifruit canopy's NIR and RGB images. The NIR (1 channel) and RGB images (3 channels) were aligned and arranged side by side into a 6-channel image. The input layer of the VGG16 was modified to receive the 6-channel image. Two different fusion methods were used to extract features: Image-Fusion (fusion of the RGB and NIR images on input layer) and Feature-Fusion (fusion of feature maps of two VGG16 networks where the RGB and NIR images were input respectively). The improved networks were trained end-to-end using back-propagation and stochastic gradient descent techniques and compared to original VGG16 networks with RGB and NIR image input only. Results showed that the average precision (APs) of the original VGG16 with RGB and NIR image input only were 88.4% and 89.2% respectively, the 6-channel VGG16 using the Feature-Fusion method reached 90.5%, while that using the Image-Fusion method reached the highest AP of 90.7% and the fastest detection speed of 0.134 s/image. The results indicated that the proposed kiwifruit detection approach shows a potential for better fruit detection.
引用
收藏
页码:2327 / 2336
页数:10
相关论文
共 40 条
[1]   A practical approach for detection and classification of traffic signs using Convolutional Neural Networks [J].
Aghdam, Hamed Habibi ;
Heravi, Elnaz Jahani ;
Puig, Domenec .
ROBOTICS AND AUTONOMOUS SYSTEMS, 2016, 84 :97-112
[2]   基于机器视觉的猕猴桃特征参数提取附视频 [J].
慕军营 ;
陈军 ;
孙高杰 ;
刘斐 ;
马阳 ;
王峰霞 .
农机化研究, 2014, (06) :138-142
[3]  
[Anonymous], P IEEE 59 INT MIDW S
[4]   A Streampath-Based RCNN Approach to Ocean Eddy Detection [J].
Bai, Xue ;
Wang, Changbo ;
Li, Chenhui .
IEEE ACCESS, 2019, 7 :106336-106345
[5]   Image Segmentation for Fruit Detection and Yield Estimation in Apple Orchards [J].
Bargoti, Suchet ;
Underwood, James P. .
JOURNAL OF FIELD ROBOTICS, 2017, 34 (06) :1039-1060
[6]   Agricultural robots for field operations: Concepts and components [J].
Bechar, Avital ;
Vigneault, Clement .
BIOSYSTEMS ENGINEERING, 2016, 149 :94-111
[7]   The Food and Agriculture Organization of the UN and Asian LMEs: A commentary [J].
Brown, David ;
Hermes, Rudolf .
DEEP-SEA RESEARCH PART II-TOPICAL STUDIES IN OCEANOGRAPHY, 2019, 163 :124-126
[8]  
Cui YongJie Cui YongJie, 2013, Nongye Jixie Xuebao = Transactions of the Chinese Society for Agricultural Machinery, V44, P247
[9]  
Dai J, 2016, PROCEEDINGS 2016 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY (ICIT), P1796, DOI 10.1109/ICIT.2016.7475036
[10]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848