RESUMO
Despite advances in deep learning for plant leaf disease recognition, accurately distinguishing morphological features under varying environmental conditions continues to pose significant challenges. Traditional deep learning models often fail to effectively merge local and global information, especially in small-scale datasets, impairing performance and elevating training costs. Focusing on citrus diseases, we propose an improved FasterViT Model, an advanced hybrid CNN-ViT framework that builds upon the FasterViT model. The proposed model seamlessly integrates CNN's rapid local learning capabilities with ViT's global information processing strength, thereby effectively extracting complex textures and morphological features from images. Cross-stage alternating Mixup and Cutout methods are strategically employed to enhance model robustness and generalization capabilities, particularly valuable for fast learning on small-scale datasets by simulating a more diverse training environment. Triplet Attention and AdaptiveAvgPool mechanisms are utilized to reduce training costs and optimize training performance. The proposed model is tested on both our specially constructed small-scale citrus disease dataset called in-field small dataset and the comprehensive PlantVillage dataset. The experimental results demonstrated that the model exhibits the capability of fast learning and adaptation to small sample training in plant disease detection tasks, and demonstrates the effectiveness of our improvement approach in improving model accuracy and reducing training costs. Additionally, its exemplary performance in transfer learning scenarios underscores its adaptability and broad applicability. This study not only highlights the efficacy of the improved FasterViT model in addressing the complexities of plant disease image recognition but also pioneers a new paradigm for developing efficient, scalable, and robust classification systems.
RESUMO
Problems: Plant Disease diagnosis based on deep learning mechanisms has been extensively studied and applied. However, the complex and dynamic agricultural growth environment results in significant variations in the distribution of state samples, and the lack of sufficient real disease databases weakens the information carried by the samples, posing challenges for accurately training models. Aim: This paper aims to test the feasibility and effectiveness of Denoising Diffusion Probabilistic Models (DDPM), Swin Transformer model, and Transfer Learning in diagnosing citrus diseases with a small sample. Methods: Two training methods are proposed: The Method 1 employs the DDPM to generate synthetic images for data augmentation. The Swin Transformer model is then used for pre-training on the synthetic dataset produced by DDPM, followed by fine-tuning on the original citrus leaf images for disease classification through transfer learning. The Method 2 utilizes the pre-trained Swin Transformer model on the ImageNet dataset and fine-tunes it on the augmented dataset composed of the original and DDPM synthetic images. Results and conclusion: The test results indicate that Method 1 achieved a validation accuracy of 96.3%, while Method 2 achieved a validation accuracy of 99.8%. Both methods effectively addressed the issue of model overfitting when dealing with a small dataset. Additionally, when compared with VGG16, EfficientNet, ShuffleNet, MobileNetV2, and DenseNet121 in citrus disease classification, the experimental results demonstrate the superiority of the proposed methods over existing approaches to a certain extent.