Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 89
Filtrar
Mais filtros

Base de dados
País/Região como assunto
Ano de publicação
Tipo de documento
Intervalo de ano de publicação
1.
New Phytol ; 243(2): 797-810, 2024 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-38807290

RESUMO

Automated pollen analysis is not yet efficient on environmental samples containing many pollen taxa and debris, which are typical in most pollen-based studies. Contrary to classification, detection remains overlooked although it is the first step from which errors can propagate. Here, we investigated a simple but efficient method to automate pollen detection for environmental samples, optimizing workload and performance. We applied the YOLOv5 algorithm on samples containing debris and c. 40 Mediterranean plant taxa, designed and tested several strategies for annotation, and analyzed variation in detection errors. About 5% of pollen grains were left undetected, while 5% of debris were falsely detected as pollen. Undetected pollen was mainly in poor-quality images, or of rare and irregular morphology. Pollen detection remained effective when applied to samples never seen by the algorithm, and was not improved by spending time to provide taxonomic details. Pollen detection of a single model taxon reduced annotation workload, but was only efficient for morphologically differentiated taxa. We offer guidelines to plant scientists to analyze automatically any pollen sample, providing sound criteria to apply for detection while using common and user-friendly tools. Our method contributes to enhance the efficiency and replicability of pollen-based studies.


Assuntos
Algoritmos , Pólen , Automação , Meio Ambiente , Processamento de Imagem Assistida por Computador/métodos
2.
BMC Med Imaging ; 24(1): 152, 2024 Jun 18.
Artigo em Inglês | MEDLINE | ID: mdl-38890604

RESUMO

BACKGROUND: Leishmaniasis is a vector-born neglected parasitic disease belonging to the genus Leishmania. Out of the 30 Leishmania species, 21 species cause human infection that affect the skin and the internal organs. Around, 700,000 to 1,000,000 of the newly infected cases and 26,000 to 65,000 deaths are reported worldwide annually. The disease exhibits three clinical presentations, namely, the cutaneous, muco-cutaneous and visceral Leishmaniasis which affects the skin, mucosal membrane and the internal organs, respectively. The relapsing behavior of the disease limits its diagnosis and treatment efficiency. The common diagnostic approaches follow subjective, error-prone, repetitive processes. Despite, an ever pressing need for an accurate detection of Leishmaniasis, the research conducted so far is scarce. In this regard, the main aim of the current research is to develop an artificial intelligence based detection tool for the Leishmaniasis from the Geimsa-stained microscopic images using deep learning method. METHODS: Stained microscopic images were acquired locally and labeled by experts. The images were augmented using different methods to prevent overfitting and improve the generalizability of the system. Fine-tuned Faster RCNN, SSD, and YOLOV5 models were used for object detection. Mean average precision (MAP), precision, and Recall were calculated to evaluate and compare the performance of the models. RESULTS: The fine-tuned YOLOV5 outperformed the other models such as Faster RCNN and SSD, with the MAP scores, of 73%, 54% and 57%, respectively. CONCLUSION: The currently developed YOLOV5 model can be tested in the clinics to assist the laboratorists in diagnosing Leishmaniasis from the microscopic images. Particularly, in low-resourced healthcare facilities, with fewer qualified medical professionals or hematologists, our AI support system can assist in reducing the diagnosing time, workload, and misdiagnosis. Furthermore, the dataset collected by us will be shared with other researchers who seek to improve upon the detection system of the parasite. The current model detects the parasites even in the presence of the monocyte cells, but sometimes, the accuracy decreases due to the differences in the sizes of the parasite cells alongside the blood cells. The incorporation of cascaded networks in future and the quantification of the parasite load, shall overcome the limitations of the currently developed system.


Assuntos
Corantes Azur , Aprendizado Profundo , Microscopia , Humanos , Microscopia/métodos , Leishmaniose/diagnóstico por imagem , Leishmaniose/parasitologia , Leishmania/isolamento & purificação
3.
BMC Med Imaging ; 24(1): 187, 2024 Jul 25.
Artigo em Inglês | MEDLINE | ID: mdl-39054448

RESUMO

OBJECTIVE: There are two major issues in the MRI image diagnosis task for Parkinson's disease. Firstly, there are slight differences in MRI images between healthy individuals and Parkinson's patients, and the medical field has not yet established precise lesion localization standards, which poses a huge challenge for the effective prediction of Parkinson's disease through MRI images. Secondly, the early diagnosis of Parkinson's disease traditionally relies on the subjective judgment of doctors, which leads to insufficient accuracy and consistency. This article proposes an improved YOLOv5 detection algorithm based on deep learning for predicting and classifying Parkinson's images. METHODS: This article improves the YOLOv5s network as the basic framework. Firstly, the CA attention mechanism was introduced to enable the model to dynamically adjust attention based on local features of the image, significantly enhancing the sensitivity of the model to PD related small pathological features; Secondly, replace the dynamic full dimensional convolution module to optimize the multi-level extraction of image features; Finally, the coupling head strategy is adopted to improve the execution efficiency of classification and localization tasks separately. RESULTS: We validated the effectiveness of the proposed method using a dataset of 582 MRI images from 108 patients. The results show that the proposed method achieves 0.961, 0.974, and 0.986 in Precision, Recall, and mAP, respectively, and the experimental results are superior to other algorithms. CONSLUSION: The improved model has achieved high accuracy and detection accuracy, and can accurately detect and recognize complex Parkinson's MRI images. SIGNIFICANCE: This algorithm has shown good performance in the early diagnosis of Parkinson's disease and can provide clinical assistance for doctors in early diagnosis. It compensates for the limitations of traditional methods.


Assuntos
Aprendizado Profundo , Imageamento por Ressonância Magnética , Doença de Parkinson , Humanos , Doença de Parkinson/diagnóstico por imagem , Doença de Parkinson/classificação , Imageamento por Ressonância Magnética/métodos , Algoritmos , Feminino , Masculino , Interpretação de Imagem Assistida por Computador/métodos , Idoso , Pessoa de Meia-Idade , Diagnóstico Precoce
4.
Sensors (Basel) ; 24(8)2024 Apr 19.
Artigo em Inglês | MEDLINE | ID: mdl-38676234

RESUMO

To enhance aerial image detection in complex environments characterized by multiple small targets and mutual occlusion, we propose an aerial target detection algorithm based on an improved version of YOLOv5 in this paper. Firstly, we employ an improved Mosaic algorithm to address redundant boundaries arising from varying image scales and to augment the training sample size, thereby enhancing detection accuracy. Secondly, we integrate the constructed hybrid attention module into the backbone network to enhance the model's capability in extracting pertinent feature information. Subsequently, we incorporate feature fusion layer 7 and P2 fusion into the neck network, leading to a notable enhancement in the model's capability to detect small targets. Finally, we replace the original PAN + FPN network structure with the optimized BiFPN (Bidirectional Feature Pyramid Network) to enable the model to preserve deeper semantic information, thereby enhancing detection capabilities for dense objects. Experimental results indicate a substantial improvement in both the detection accuracy and speed of the enhanced algorithm compared to its original version. It is noteworthy that the enhanced algorithm exhibits a markedly improved detection performance for aerial images, particularly under real-time conditions.

5.
Sensors (Basel) ; 24(7)2024 Mar 25.
Artigo em Inglês | MEDLINE | ID: mdl-38610313

RESUMO

Simultaneous localisation and mapping (SLAM) is crucial in mobile robotics. Most visual SLAM systems assume that the environment is static. However, in real life, there are many dynamic objects, which affect the accuracy and robustness of these systems. To improve the performance of visual SLAM systems, this study proposes a dynamic visual SLAM (SEG-SLAM) system based on the orientated FAST and rotated BRIEF (ORB)-SLAM3 framework and you only look once (YOLO)v5 deep-learning method. First, based on the ORB-SLAM3 framework, the YOLOv5 deep-learning method is used to construct a fusion module for target detection and semantic segmentation. This module can effectively identify and extract prior information for obviously and potentially dynamic objects. Second, differentiated dynamic feature point rejection strategies are developed for different dynamic objects using the prior information, depth information, and epipolar geometry method. Thus, the localisation and mapping accuracy of the SEG-SLAM system is improved. Finally, the rejection results are fused with the depth information, and a static dense 3D mapping without dynamic objects is constructed using the Point Cloud Library. The SEG-SLAM system is evaluated using public TUM datasets and real-world scenarios. The proposed method is more accurate and robust than current dynamic visual SLAM algorithms.

6.
Sensors (Basel) ; 24(7)2024 Mar 26.
Artigo em Inglês | MEDLINE | ID: mdl-38610324

RESUMO

The traditional rearview mirror method cannot fully guarantee safety when driving trucks. RGB and infrared images collected by cameras are used for registration and recognition, so as to achieve the perception of surroundings and ensure safe driving. The traditional scale-invariant feature transform (SIFT) algorithm has a mismatching rate, and the YOLO algorithm has an optimization space in feature extraction. To address these issues, this paper proposes a truck surround sensing technique based on multi-features and an improved YOLOv5 algorithm. Firstly, the edge corner points and infrared features of the preset target region are extracted, and then a feature point set containing the improved SIFT algorithm is generated for registration. Finally, the YOLOv5 algorithm is improved by fusing infrared features and introducing a composite prediction mechanism at the prediction end. The simulation results show that, on average, the image stitching accuracy is improved by 17%, the time is reduced by 89%, and the target recognition accuracy is improved by 2.86%. The experimental results show that this method can effectively perceive the surroundings of trucks, accurately identify targets, and reduce the missed alarm rate and false alarm rate.

7.
Sensors (Basel) ; 24(7)2024 Mar 29.
Artigo em Inglês | MEDLINE | ID: mdl-38610404

RESUMO

Due to the limited semantic information extraction with small objects and difficulty in distinguishing similar targets, it brings great challenges to target detection in remote sensing scenarios, which results in poor detection performance. This paper proposes an improved YOLOv5 remote sensing image target detection algorithm, SEB-YOLO (SPD-Conv + ECSPP + Bi-FPN + YOLOv5). Firstly, the space-to-depth (SPD) layer followed by a non-strided convolution (Conv) layer module (SPD-Conv) was used to reconstruct the backbone network, which retained the global features and reduced the feature loss. Meanwhile, the pooling module with the attention mechanism of the final layer of the backbone network was designed to help the network better identify and locate the target. Furthermore, a bidirectional feature pyramid network (Bi-FPN) with bilinear interpolation upsampling was added to improve bidirectional cross-scale connection and weighted feature fusion. Finally, the decoupled head is introduced to enhance the model convergence and solve the contradiction between the classification task and the regression task. Experimental results on NWPU VHR-10 and RSOD datasets show that the mAP of the proposed algorithm reaches 93.5% and 93.9%respectively, which is 4.0% and 5.3% higher than that of the original YOLOv5l algorithm. The proposed algorithm achieves better detection results for complex remote sensing images.

8.
Sensors (Basel) ; 24(12)2024 Jun 14.
Artigo em Inglês | MEDLINE | ID: mdl-38931645

RESUMO

The high-altitude real-time inspection of unmanned aerial vehicles (UAVs) has always been a very challenging task. Because high-altitude inspections are susceptible to interference from different weather conditions, interference from communication signals and a larger field of view result in a smaller object area to be identified. We adopted a method that combines a UAV system scheduling platform with artificial intelligence object detection to implement the UAV automatic inspection technology. We trained the YOLOv5s model on five different categories of vehicle data sets, in which mAP50 and mAP50-95 reached 93.2% and 71.7%, respectively. The YOLOv5s model size is only 13.76 MB, and the detection speed of a single inspection photo reaches 11.26 ms. It is a relatively lightweight model and is suitable for deployment on edge devices for real-time detection. In the original DeepStream framework, we set up the http communication protocol to start quickly to enable different users to call and use it at the same time. In addition, asynchronous sending of alarm frame interception function was added and the auxiliary services were set up to quickly resume video streaming after interruption. We deployed the trained YOLOv5s model on the improved DeepStream framework to implement automatic UAV inspection.

9.
Sensors (Basel) ; 24(13)2024 Jun 26.
Artigo em Inglês | MEDLINE | ID: mdl-39000937

RESUMO

Although existing 3D object-detection methods have achieved promising results on conventional datasets, it is still challenging to detect objects in data collected under adverse weather conditions. Data distortion from LiDAR and cameras in such conditions leads to poor performance of traditional single-sensor detection methods. Multi-modal data-fusion methods struggle with data distortion and low alignment accuracy, making accurate target detection difficult. To address this, we propose a multi-modal object-detection algorithm, Snow-CLOCs, specifically for snowy conditions. In image detection, we improved the YOLOv5 algorithm by integrating the InceptionNeXt network to enhance feature extraction and using the Wise-IoU algorithm to reduce dependency on high-quality data. For LiDAR point-cloud detection, we built upon the SECOND algorithm and employed the DROR filter to remove noise, enhancing detection accuracy. We combined the detection results from the camera and LiDAR into a unified detection set, represented using a sparse tensor, and extracted features through a 2D convolutional neural network to achieve object detection and localization. Snow-CLOCs achieved a detection accuracy of 86.61% for vehicle detection in snowy conditions.

10.
Sensors (Basel) ; 24(14)2024 Jul 11.
Artigo em Inglês | MEDLINE | ID: mdl-39065900

RESUMO

Traditionally, monitoring insect populations involved the use of externally placed sticky paper traps, which were periodically inspected by a human operator. To automate this process, a specialized sensing device and an accurate model for detecting and counting insect pests are essential. Despite considerable progress in insect pest detector models, their practical application is hindered by the shortage of insect trap images. To attenuate the "lack of data" issue, the literature proposes data augmentation. However, our knowledge about data augmentation is still quite limited, especially in the field of insect pest detection. The aim of this experimental study was to investigate the effect of several widely used augmentation techniques and their combinations on remote-sensed trap images with the YOLOv5 (small) object detector model. This study was carried out systematically on two different datasets starting from the single geometric and photometric transformation toward their combinations. Our results show that the model's mean average precision value (mAP50) could be increased from 0.844 to 0.992 and from 0.421 to 0.727 on the two datasets using the appropriate augmentation methods combination. In addition, this study also points out that the integration of photometric image transformations into the mosaic augmentation can be more efficient than the native combination of augmentation techniques because this approach further improved the model's mAP50 values to 0.999 and 0.756 on the two test sets, respectively.


Assuntos
Insetos , Tecnologia de Sensoriamento Remoto , Animais , Insetos/fisiologia , Tecnologia de Sensoriamento Remoto/métodos , Processamento de Imagem Assistida por Computador/métodos , Algoritmos , Humanos
11.
Sensors (Basel) ; 24(10)2024 May 09.
Artigo em Inglês | MEDLINE | ID: mdl-38793857

RESUMO

Bearings are crucial components of machinery and equipment, and it is essential to inspect them thoroughly to ensure a high pass rate. Currently, bearing scratch detection is primarily carried out manually, which cannot meet industrial demands. This study presents research on the detection of bearing surface scratches. An improved YOLOV5 network, named YOLOV5-CDG, is proposed for detecting bearing surface defects using scratch images as targets. The YOLOV5-CDG model is based on the YOLOV5 network model with the addition of a Coordinate Attention (CA) mechanism module, fusion of Deformable Convolutional Networks (DCNs), and a combination with the GhostNet lightweight network. To achieve bearing surface scratch detection, a machine vision-based bearing surface scratch sensor system is established, and a self-made bearing surface scratch dataset is produced as the basis. The scratch detection final Average Precision (AP) value is 97%, which is 3.4% higher than that of YOLOV5. Additionally, the model has an accuracy of 99.46% for detecting defective and qualified products. The average detection time per image is 263.4 ms on the CPU device and 12.2 ms on the GPU device, demonstrating excellent performance in terms of both speed and accuracy. Furthermore, this study analyzes and compares the detection results of various models, demonstrating that the proposed method satisfies the requirements for detecting scratches on bearing surfaces in industrial settings.

12.
Sensors (Basel) ; 24(2)2024 Jan 09.
Artigo em Inglês | MEDLINE | ID: mdl-38257487

RESUMO

Considering the high incidence of accidents at tunnel construction sites, using robots to replace humans in hazardous tasks can effectively safeguard their lives. However, most robots currently used in this field require manual control and lack autonomous obstacle avoidance capability. To address these issues, we propose a lightweight model based on an improved version of YOLOv5 for obstacle detection. Firstly, to enhance detection speed and reduce computational load, we modify the backbone network to the lightweight Shufflenet v2. Secondly, we introduce a coordinate attention mechanism to enhance the network's ability to learn feature representations. Subsequently, we replace the neck convolution block with GSConv to improve the model's efficiency. Finally, we modify the model's upsampling method to further enhance detection accuracy. Through comparative experiments on the model, the results demonstrate that our approach achieves an approximately 37% increase in detection speed with a minimal accuracy reduction of 1.5%. The frame rate has improved by about 54%, the parameter count has decreased by approximately 74%, and the model size has decreased by 2.5 MB. The experimental results indicate that our method can reduce hardware requirements for the model, striking a balance between detection speed and accuracy.

13.
Sensors (Basel) ; 24(15)2024 Aug 02.
Artigo em Inglês | MEDLINE | ID: mdl-39124057

RESUMO

With the increasing importance of subways in urban public transportation systems, pedestrian flow simulation for supporting station management and risk analysis becomes more necessary. There is a need to calibrate the simulation model parameters with real-world pedestrian flow data to achieve a simulation closer to the real situation. This study presents a calibration approach based on YOLOv5 for calibrating the simulation model parameters in the social force model inserted in Anylogic. This study compared the simulation results after model calibration with real data. The results show that (1) the parameters calibrated in this paper can reproduce the characteristics of pedestrian flow in the station; (2) the calibration model not only decreases global errors but also overcomes the common phenomenon of large differences between simulation and reality.

14.
Sensors (Basel) ; 24(5)2024 Feb 20.
Artigo em Inglês | MEDLINE | ID: mdl-38474909

RESUMO

Simultaneous Localization and Mapping (SLAM), as one of the core technologies in intelligent robotics, has gained substantial attention in recent years. Addressing the limitations of SLAM systems in dynamic environments, this research proposes a system specifically designed for plant factory transportation environments, named GY-SLAM. GY-SLAM incorporates a lightweight target detection network, GY, based on YOLOv5, which utilizes GhostNet as the backbone network. This integration is further enhanced with CoordConv coordinate convolution, CARAFE up-sampling operators, and the SE attention mechanism, leading to simultaneous improvements in detection accuracy and model complexity reduction. While mAP@0.5 increased by 0.514% to 95.364, the model simultaneously reduced the number of parameters by 43.976%, computational cost by 46.488%, and model size by 41.752%. Additionally, the system constructs pure static octree maps and grid maps. Tests conducted on the TUM dataset and a proprietary dataset demonstrate that GY-SLAM significantly outperforms ORB-SLAM3 in dynamic scenarios in terms of system localization accuracy and robustness. It shows a remarkable 92.59% improvement in RMSE for Absolute Trajectory Error (ATE), along with a 93.11% improvement in RMSE for the translational drift of Relative Pose Error (RPE) and a 92.89% improvement in RMSE for the rotational drift of RPE. Compared to YOLOv5s, the GY model brings a 41.5944% improvement in detection speed and a 17.7975% increase in SLAM operation speed to the system, indicating strong competitiveness and real-time capabilities. These results validate the effectiveness of GY-SLAM in dynamic environments and provide substantial support for the automation of logistics tasks by robots in specific contexts.

15.
Sensors (Basel) ; 24(5)2024 Feb 24.
Artigo em Inglês | MEDLINE | ID: mdl-38475010

RESUMO

This article presents the development of a vision system designed to enhance the autonomous navigation capabilities of robots in complex forest environments. Leveraging RGBD and thermic cameras, specifically the Intel RealSense 435i and FLIR ADK, the system integrates diverse visual sensors with advanced image processing algorithms. This integration enables robots to make real-time decisions, recognize obstacles, and dynamically adjust their trajectories during operation. The article focuses on the architectural aspects of the system, emphasizing the role of sensors and the formulation of algorithms crucial for ensuring safety during robot navigation in challenging forest terrains. Additionally, the article discusses the training of two datasets specifically tailored to forest environments, aiming to evaluate their impact on autonomous navigation. Tests conducted in real forest conditions affirm the effectiveness of the developed vision system. The results underscore the system's pivotal contribution to the autonomous navigation of robots in forest environments.


Assuntos
Dispositivos Ópticos , Robótica , Robótica/métodos , Agricultura Florestal , Algoritmos , Processamento de Imagem Assistida por Computador
16.
Sensors (Basel) ; 24(6)2024 Mar 20.
Artigo em Inglês | MEDLINE | ID: mdl-38544233

RESUMO

Kelp, often referred to as a "sea vegetable", holds substantial economic significance. Currently, the drying process for kelp in China primarily relies on outdoor sun-drying methods. Detecting kelp in the field presents challenges arising from issues such as overlapping and obstruction. To address these challenges, this study introduces a lightweight model, K-YOLOv5, specifically designed for the precise detection of sun-dried kelp. YOLOv5-n serves as the base model, with several enhancements implemented in this study: the addition of a detection head incorporating an upsampling layer and a convolution module to improve the recognition of small objects; the integration of an enhanced I-CBAM attention mechanism, focusing on key features to enhance the detection accuracy; the replacement of the CBS module in the neck network with GSConv to reduce the computational burden and accelerate the inference speed; and the optimization of the IoU algorithm to improve the identification of overlapping kelp. Utilizing drone-captured images of sun-dried kelp, a dataset comprising 2190 images is curated. Validation on this self-constructed dataset indicates that the improved K-YOLOv5 model significantly enhances the detection accuracy, achieving 88% precision and 78.4% recall. These values represent 6.8% and 8.6% improvements over the original model, respectively, meeting the requirements for the real-time recognition of sun-dried kelp.

17.
Sensors (Basel) ; 24(7)2024 Mar 27.
Artigo em Inglês | MEDLINE | ID: mdl-38610342

RESUMO

In the field of intelligent connected vehicles, the precise and real-time identification of speed bumps is critically important for the safety of autonomous driving. To address the issue that existing visual perception algorithms struggle to simultaneously maintain identification accuracy and real-time performance amidst image distortion and complex environmental conditions, this study proposes an enhanced lightweight neural network framework, YOLOv5-FPNet. This framework strengthens perception capabilities in two key phases: feature extraction and loss constraint. Firstly, FPNet, based on FasterNet and Dynamic Snake Convolution, is developed to adaptively extract structural features of distorted speed bumps with accuracy. Subsequently, the C3-SFC module is proposed to augment the adaptability of the neck and head components to distorted features. Furthermore, the SimAM attention mechanism is embedded within the backbone to enhance the ability of key feature extraction. Finally, an adaptive loss function, Inner-WiseIoU, based on a dynamic non-monotonic focusing mechanism, is designed to improve the generalization and fitting ability of bounding boxes. Experimental evaluations on a custom speed bumps dataset demonstrate the superior performance of FPNet, with significant improvements in key metrics such as the mAP, mAP50_95, and FPS by 38.76%, 143.15%, and 51.23%, respectively, compared to conventional lightweight neural networks. Ablation studies confirm the effectiveness of the proposed improvements. This research provides a fast and accurate speed bump detection solution for autonomous vehicles, offering theoretical insights for obstacle recognition in intelligent vehicle systems.

18.
Sensors (Basel) ; 24(7)2024 Apr 01.
Artigo em Inglês | MEDLINE | ID: mdl-38610467

RESUMO

Lineament is a unique geological structure. The study of Lunar lineament structure has great significance on understanding its history and evolution of Lunar surface. However, the existing geographic feature extraction methods are not suitable for the extraction of Lunar lineament structure. In this paper, a new lineament extraction method is proposed based on improved-UNet++ and YOLOv5. Firstly, new lineament dataset is created containing lineaments structure based on CCD data from LROC. At same time the residual blocks are replaced with the VGG blocks in the down sample part of the UNet++ with adding the attention block between each layer. Secondly, the improved-UNet++ and YOLO networks are trained to execute the object detection and semantic segmentation of lineament structure respectively. Finally, a polygon-match strategy is proposed to combine the results of object detection and semantic segmentation. The experiment result indicate that this new method has relatively better and more stable performance compared with current mainstream networks and the original UNet++ network in the instance segmentation of lineament structure. Additionally, the polygon-match strategy is able to perform preciser edge detail in the instance segmentation of lineament structure result.

19.
Sensors (Basel) ; 24(13)2024 Jul 05.
Artigo em Inglês | MEDLINE | ID: mdl-39001149

RESUMO

The efficient and accurate identification of traffic signs is crucial to the safety and reliability of active driving assistance and driverless vehicles. However, the accurate detection of traffic signs under extreme cases remains challenging. Aiming at the problems of missing detection and false detection in traffic sign recognition in fog traffic scenes, this paper proposes a recognition algorithm for traffic signs based on pix2pixHD+YOLOv5-T. Firstly, the defogging model is generated by training the pix2pixHD network to meet the advanced visual task. Secondly, in order to better match the defogging algorithm with the target detection algorithm, the algorithm YOLOv5-Transformer is proposed by introducing a transformer module into the backbone of YOLOv5. Finally, the defogging algorithm pix2pixHD is combined with the improved YOLOv5 detection algorithm to complete the recognition of traffic signs in foggy environments. Comparative experiments proved that the traffic sign recognition algorithm proposed in this paper can effectively reduce the impact of a foggy environment on traffic sign recognition. Compared with the YOLOv5-T and YOLOv5 algorithms in moderate fog environments, the overall improvement of this algorithm is achieved. The precision of traffic sign recognition of the algorithm in the fog traffic scene reached 78.5%, the recall rate was 72.2%, and mAP@0.5 was 82.8%.

20.
Sensors (Basel) ; 24(9)2024 Apr 27.
Artigo em Inglês | MEDLINE | ID: mdl-38732896

RESUMO

Accurate and fast recognition of vehicle license plates from natural scene images is a crucial and challenging task. Existing methods can recognize license plates in simple scenarios, but their performance degrades significantly in complex environments. A novel license plate detection and recognition model YOLOv5-PDLPR is proposed, which employs YOLOv5 target detection algorithm in the license plate detection part and uses the PDLPR algorithm proposed in this paper in the license plate recognition part. The PDLPR algorithm is mainly designed as follows: (1) A Multi-Head Attention mechanism is used to accurately recognize individual characters. (2) A global feature extractor network is designed to improve the completeness of the network for feature extraction. (3) The latest parallel decoder architecture is adopted to improve the inference efficiency. The experimental results show that the proposed algorithm has better accuracy and speed than the comparison algorithms, can achieve real-time recognition, and has high efficiency and robustness in complex scenes.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA