Your browser doesn't support javascript.
loading
A novel fish individual recognition method for precision farming based on knowledge distillation strategy and the range of the receptive field.
Yin, Jianhao; Wu, Junfeng; Gao, Chunqi; Yu, Hong; Liu, Liang; Guo, Shihao.
Afiliação
  • Yin J; College of Information Engineering, Dalian Ocean University, Dalian, China.
  • Wu J; Dalian Key Laboratory of Smart Fisheries, Dalian Ocean University, Dalian, China.
  • Gao C; Key Laboratory of Environment Controlled Aquaculture (Dalian Ocean University), Ministry of Education, Dalian Ocean University, Dalian, China.
  • Yu H; College of Information Engineering, Dalian Ocean University, Dalian, China.
  • Liu L; Dalian Key Laboratory of Smart Fisheries, Dalian Ocean University, Dalian, China.
  • Guo S; Key Laboratory of Environment Controlled Aquaculture (Dalian Ocean University), Ministry of Education, Dalian Ocean University, Dalian, China.
J Fish Biol ; 2024 Jun 09.
Article em En | MEDLINE | ID: mdl-38852608
ABSTRACT
With the continuous development of green and high-quality aquaculture technology, the process of industrialized aquaculture has been promoted. Automation, intelligence, and precision have become the future development trend of the aquaculture industry. Fish individual recognition can further distinguish fish individuals based on the determination of fish categories, providing basic support for fish disease analysis, bait feeding, and precision aquaculture. However, the high similarity of fish individuals and the complexity of the underwater environment presents great challenges to fish individual recognition. To address these problems, we propose a novel fish individual recognition method for precision farming that rethinks the knowledge distillation strategy and the chunking method in the vision transformer. The method uses the traditional convolutional neural network model as the teacher model, introducing the teacher token to guide the student model to learn the fish texture features. We propose stride patch embedding to expand the range of the receptive field, thus enhancing the local continuity of the image, and self-attention-pruning to discard unimportant tokens and reduce the model computation. The experimental results on the DlouFish dataset show that the proposed method in this paper improves accuracy by 3.25% compared to ECA Resnet152, with an accuracy of 93.19%, and also outperforms other vision transformer models.
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Idioma: En Revista: J Fish Biol Ano de publicação: 2024 Tipo de documento: Article País de afiliação: China

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Idioma: En Revista: J Fish Biol Ano de publicação: 2024 Tipo de documento: Article País de afiliação: China