Your browser doesn't support javascript.
loading
A wheat spike detection method based on Transformer.
Zhou, Qiong; Huang, Ziliang; Zheng, Shijian; Jiao, Lin; Wang, Liusan; Wang, Rujing.
Afiliação
  • Zhou Q; Institute of Intelligent Machines, Hefei Institutes of Physical Science, Chinese Academy of Sciences, Hefei, China.
  • Huang Z; Science Island Branch, University of Science and Technology of China, Hefei, China.
  • Zheng S; College of Information and Computer, Anhui Agricultural University, Hefei, China.
  • Jiao L; Institute of Intelligent Machines, Hefei Institutes of Physical Science, Chinese Academy of Sciences, Hefei, China.
  • Wang L; Science Island Branch, University of Science and Technology of China, Hefei, China.
  • Wang R; Institute of Intelligent Machines, Hefei Institutes of Physical Science, Chinese Academy of Sciences, Hefei, China.
Front Plant Sci ; 13: 1023924, 2022.
Article em En | MEDLINE | ID: mdl-36340370
ABSTRACT
Wheat spike detection has important research significance for production estimation and crop field management. With the development of deep learning-based algorithms, researchers tend to solve the detection task by convolutional neural networks (CNNs). However, traditional CNNs equip with the inductive bias of locality and scale-invariance, which makes it hard to extract global and long-range dependency. In this paper, we propose a Transformer-based network named Multi-Window Swin Transformer (MW-Swin Transformer). Technically, MW-Swin Transformer introduces the ability of feature pyramid network to extract multi-scale features and inherits the characteristic of Swin Transformer that performs self-attention mechanism by window strategy. Moreover, bounding box regression is a crucial step in detection. We propose a Wheat Intersection over Union loss by incorporating the Euclidean distance, area overlapping, and aspect ratio, thereby leading to better detection accuracy. We merge the proposed network and regression loss into a popular detection architecture, fully convolutional one-stage object detection, and name the unified model WheatFormer. Finally, we construct a wheat spike detection dataset (WSD-2022) to evaluate the performance of the proposed methods. The experimental results show that the proposed network outperforms those state-of-the-art algorithms with 0.459 mAP (mean average precision) and 0.918 AP50. It has been proved that our Transformer-based method is effective to handle wheat spike detection under complex field conditions.
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Tipo de estudo: Diagnostic_studies / Prognostic_studies Idioma: En Ano de publicação: 2022 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Tipo de estudo: Diagnostic_studies / Prognostic_studies Idioma: En Ano de publicação: 2022 Tipo de documento: Article