中国农业科技导报 ›› 2022, Vol. 24 ›› Issue (9): 96-105.DOI: 10.13304/j.nykjdb.2021.0612

• 智慧农业 农机装备 • 上一篇    下一篇

基于深度学习的高分辨率麦穗图像检测方法

赵越1(), 卫勇1, 单慧勇1, 穆志民2, 张健欣1, 吴海云1, 赵辉1(), 胡建龙3   

  1. 1.天津农学院工程技术学院, 天津 300384
    2.天津农学院基础科学学院, 天津 300384
    3.科芯(天津)生态农业科技有限公司, 天津 300450
  • 收稿日期:2021-07-24 接受日期:2021-11-22 出版日期:2022-09-15 发布日期:2022-10-11
  • 通讯作者: 赵辉
  • 作者简介:赵越 E-mail:119788584@qq.com
  • 基金资助:
    天津市科技计划项目(19YFZCSN00360);天津市教委科研计划项目(2017KJ180);天津市企业科技特派员项目(20YDTPJC01340)

Wheat Ear Detection Method Based on Deep Learning

Yue ZHAO1(), Yong WEI1, Huiyong SHAN1, Zhimin MU2, ZHANGJianxin1, Haiyun WU1, Hui ZHAO1(), Jianlong HU3   

  1. 1.College of Engineering and Technology, Tianjin Agricultural University, Tianjin 300384, China
    2.College of Basic Sciences, Tianjin Agricultural University, Tianjin 300384, China
    3.Coresense (Tianjin) EC0-Agri Technology Co. , Ltd, Tianjin 300450, China
  • Received:2021-07-24 Accepted:2021-11-22 Online:2022-09-15 Published:2022-10-11
  • Contact: Hui ZHAO

摘要:

小麦是重要的粮食作物之一,针对人工田间麦穗计数及产量预测效率低的问题,基于深度学习提出了一种高分辨率的小密集麦穗实时检测方法。对麦穗图像数据集进行图像分割、标注、增强预处理,基于Tensorflow搭建YOLOv4网络模型,调整改进后对其进行迁移学习;与YOLOv3、YOLOv4-tiny、Faster R-CNN训练模型进行对比,对改进模型的实用性与局限性进行分析;重点分析影响麦穗检测模型性能的关键因素。通过图像分割的方式,证明了通过改变图像分辨率确定麦穗所占图像最优像素比,可以提高前景与背景差异,对小密集麦穗有显著效果。通过对改进模型的测试,表明该模型检测精度高,鲁棒性强。不同分辨率、不同品种、不同时期的麦穗图像均类平均精度(mAP)为93.7%,单张图片的检测速度为52帧·s-1,满足了麦穗的高精度实时检测。该研究结果为田间麦穗计数以及产量预测提供技术支持。

关键词: 深度学习, 目标检测, 麦穗, YOLOv4, 实时检测

Abstract:

Wheat is one of the most important food crops in the world. In order to solve the problem of low efficiency of artificial field wheat ears counting and yield prediction, this paper proposed a real-time detection method based on deep learning for high resolution small and dense objects. The wheat ear images data set was preprocessed, including image segmentation, marking and enhancement, and the YOLOv4 network model was built based on TensorFlow to improve the transfer learning after adjusted; different models of YOLOv3, YOLOv4-Tiny and Faster R-CNN were trained and used compare to the practicability and limitations with the improved model; the key factors affecting the performance of the wheat ears detection model were discussed. By means of image segmentation, this paper studied the influence of images with different resolutions on model detection. The results showed that the difference between foreground and background could be improved by changing the image resolution, and the optimal pixel ratio of the target could be determined, which had a significant effect on small dense wheat ears and effectively improved the accuracy of the model. By testing the results of the improved model and evaluating the precision recall performance of different models, the model showed high detection accuracy and strong robustness. The mean average precision accuracy of wheat ear images with different resolutions, different varieties and different seasons was 93.7%, and the detection speed of single image was 52 frame·s-1, which met the real-time detection of wheat ears. The results provided a technical guarantee for wheat spike counting and yield prediction in the field.

Key words: deep learning, object detection, grain, YOLOv4, real-time detection

中图分类号: