Pesquisa | BVS Violência e Saúde

1.

Independence and synergy of spatial attention in the two visual systems of jumping spiders.

Loconsole, Maria; Ferrante, Federico; Giacomazzi, Davide; De Agrò, Massimo.

J Exp Biol ; 2024 Sep 26.

Artigo em Inglês | MEDLINE | ID: mdl-39324315

RESUMO

By selectively focusing on a specific portion of the environment, animals can solve the problem of information overload, toning down irrelevant inputs and concentrating only on the relevant ones. This may be of particular relevance for animals such as the jumping spider, which possess a wide visual field of almost 360° and thus could benefit from a low-cost system for sharpening attention. Jumping spiders have a modular visual system composed of four pairs of eyes, of which only the two frontal eyes (i.e., AMEs) are motile, whereas the other secondary pairs remain immobile. We hypothesized that jumping spiders can exploit both principal and secondary eyes for stimulus detection and attentional shift, with the two systems working synergistically. In Experiment 1 we investigated AMEs' attentional responses following a spatial cue presented to the secondary eyes. In Experiment 2, we tested for enhanced attention in the secondary eyes' visual field congruent with the direction of the AMEs' focus. In both experiments, we observed that animals were faster and more accurate in detecting a target when it appeared in a direction opposite to that of the initial cue. In contrast with our initial hypothesis, these results would suggest that attention is segregated across eyes, with each system working on compensating the other by attending to different spatial locations.

2.

Advancing primate surveillance with image recognition techniques from unmanned aerial vehicles.

He, Gang; Zhang, Xiao; Wang, Jie; Xu, Pengfei; Hou, Xiduo; Dong, Wei; Lei, Yinghu; Jin, Xuelin; Wang, Weifeng; Tian, Wenyong; Huang, Yan; Li, Desheng; Qin, Tianyu; Wang, Jing; Pan, Ruliang; Li, Baoguo; Guo, Songtao.

Am J Primatol ; : e23676, 2024 Aug 15.

Artigo em Inglês | MEDLINE | ID: mdl-39148233

RESUMO

Using unmanned aerial vehicles (UAVs) for surveys on thermostatic animals has gained prominence due to their ability to provide practical and precise dynamic censuses, contributing to developing and refining conservation strategies. However, the practical application of UAVs for animal monitoring necessitates the automation of image interpretation to enhance their effectiveness. Based on our past experiences, we present the Sichuan snub-nosed monkey (Rhinopithecus roxellana) as a case study to illustrate the effective use of thermal cameras mounted on UAVs for monitoring monkey populations in Qinling, a region characterized by magnificent biodiversity. We used the local contrast method for a small infrared target detection algorithm to collect the total population size. Through the experimental group, we determined the average optimal grayscale threshold, while the validation group confirmed that this threshold enables automatic detection and counting of target animals in similar datasets. The precision rate obtained from the experiments ranged from 85.14% to 97.60%. Our findings reveal a negative correlation between the minimum average distance between thermal spots and the count of detected individuals, indicating higher interference in images with closer thermal spots. We propose a formula for adjusting primate population estimates based on detection rates obtained from UAV surveys. Our results demonstrate the practical application of UAV-based thermal imagery and automated detection algorithms for primate monitoring, albeit with consideration of environmental factors and the need for data preprocessing. This study contributes to advancing the application of UAV technology in wildlife monitoring, with implications for conservation management and research.

3.

Dense Pedestrian Detection Based on GR-YOLO.

Li, Nianfeng; Bai, Xinlu; Shen, Xiangfeng; Xin, Peizeng; Tian, Jia; Chai, Tengfei; Wang, Zhenyan.

Sensors (Basel) ; 24(14)2024 Jul 22.

Artigo em Inglês | MEDLINE | ID: mdl-39066144

RESUMO

In large public places such as railway stations and airports, dense pedestrian detection is important for safety and security. Deep learning methods provide relatively effective solutions but still face problems such as feature extraction difficulties, image multi-scale variations, and high leakage detection rates, which bring great challenges to the research in this field. In this paper, we propose an improved dense pedestrian detection algorithm GR-yolo based on Yolov8. GR-yolo introduces the repc3 module to optimize the backbone network, which enhances the ability of feature extraction, adopts the aggregation-distribution mechanism to reconstruct the yolov8 neck structure, fuses multi-level information, achieves a more efficient exchange of information, and enhances the detection ability of the model. Meanwhile, the Giou loss calculation is used to help GR-yolo converge better, improve the detection accuracy of the target position, and reduce missed detection. Experiments show that GR-yolo has improved detection performance over yolov8, with a 3.1% improvement in detection means accuracy on the wider people dataset, 7.2% on the crowd human dataset, and 11.7% on the people detection images dataset. Therefore, the proposed GR-yolo algorithm is suitable for dense, multi-scale, and scene-variable pedestrian detection, and the improvement also provides a new idea to solve dense pedestrian detection in real scenes.

4.

Research on Lightweight Method of Insulator Target Detection Based on Improved SSD.

Zeng, Bing; Zhou, Yu; He, Dilin; Zhou, Zhihao; Hao, Shitao; Yi, Kexin; Li, Zhilong; Zhang, Wenhua; Xie, Yunmin.

Sensors (Basel) ; 24(18)2024 Sep 12.

Artigo em Inglês | MEDLINE | ID: mdl-39338655

RESUMO

Aiming at the problems of a large volume, slow processing speed, and difficult deployment in the edge terminal, this paper proposes a lightweight insulator detection algorithm based on an improved SSD. Firstly, the original feature extraction network VGG-16 is replaced by a lightweight Ghost Module network to initially achieve the lightweight model. A Feature Pyramid structure and Feature Pyramid Network (FPN+PAN) are integrated into the Neck part and a Simplified Spatial Pyramid Pooling Fast (SimSPPF) module is introduced to realize the integration of local features and global features. Secondly, multiple Spatial and Channel Squeeze-and-Excitation (scSE) attention mechanisms are introduced in the Neck part to make the model pay more attention to the channels containing important feature information. The original six detection heads are reduced to four to improve the inference speed of the network. In order to improve the recognition performance of occluded and overlapping targets, DIoU-NMS was used to replace the original non-maximum suppression (NMS). Furthermore, the channel pruning strategy is used to reduce the unimportant weight matrix of the model, and the knowledge distillation strategy is used to fine-adjust the network model after pruning, so as to ensure the detection accuracy. The experimental results show that the parameter number of the proposed model is reduced from 26.15 M to 0.61 M, the computational load is reduced from 118.95 G to 1.49 G, and the mAP is increased from 96.8% to 98%. Compared with other models, the proposed model not only guarantees the detection accuracy of the algorithm, but also greatly reduces the model volume, which provides support for the realization of visible light insulator target detection based on edge intelligence.

5.

A Study of Classroom Behavior Recognition Incorporating Super-Resolution and Target Detection.

Zhang, Xiaoli; Nie, Jialei; Wei, Shoulin; Zhu, Guifu; Dai, Wei; Yang, Can.

Sensors (Basel) ; 24(17)2024 Aug 30.

Artigo em Inglês | MEDLINE | ID: mdl-39275552

RESUMO

With the development of educational technology, machine learning and deep learning provide technical support for traditional classroom observation assessment. However, in real classroom scenarios, the technique faces challenges such as lack of clarity of raw images, complexity of datasets, multi-target detection errors, and complexity of character interactions. Based on the above problems, a student classroom behavior recognition network incorporating super-resolution and target detection is proposed. To cope with the problem of unclear original images in the classroom scenario, SRGAN (Super Resolution Generative Adversarial Network for Images) is used to improve the image resolution and thus the recognition accuracy. To address the dataset complexity and multi-targeting problems, feature extraction is optimized, and multi-scale feature recognition is enhanced by introducing AKConv and LASK attention mechanisms into the Backbone module of the YOLOv8s algorithm. To improve the character interaction complexity problem, the CBAM attention mechanism is integrated to enhance the recognition of important feature channels and spatial regions. Experiments show that it can detect six behaviors of students-raising their hands, reading, writing, playing on their cell phones, looking down, and leaning on the table-in high-definition images. And the accuracy and robustness of this network is verified. Compared with small-object detection algorithms such as Faster R-CNN, YOLOv5, and YOLOv8s, this network demonstrates good detection performance on low-resolution small objects, complex datasets with numerous targets, occlusion, and overlapping students.

6.

An MRS-YOLO Model for High-Precision Waste Detection and Classification.

Ren, Yuanming; Li, Yizhe; Gao, Xinya.

Sensors (Basel) ; 24(13)2024 Jul 04.

Artigo em Inglês | MEDLINE | ID: mdl-39001117

RESUMO

With the advancement in living standards, there has been a significant surge in the quantity and diversity of household waste. To safeguard the environment and optimize resource utilization, there is an urgent demand for effective and cost-efficient intelligent waste classification methodologies. This study presents MRS-YOLO (Multi-Resolution Strategy-YOLO), a waste detection and classification model. The paper introduces the SlideLoss_IOU technique for detecting small objects, integrates RepViT of the Transformer mechanism, and devises a novel feature extraction strategy by amalgamating multi-dimensional and dynamic convolution mechanisms. These enhancements not only elevate the detection accuracy and speed but also bolster the robustness of the current YOLO model. Validation conducted on a dataset comprising 12,072 samples across 10 categories, including recyclable metal and paper, reveals a 3.6% enhancement in mAP50% accuracy compared to YOLOv8, coupled with a 15.09% reduction in volume. Furthermore, the model demonstrates improved accuracy in detecting small targets and exhibits comprehensive detection capabilities across diverse scenarios. For transparency and to facilitate further research, the source code and related datasets used in this study have been made publicly available at GitHub.

7.

A Method for Real-Time Recognition of Safflower Filaments in Unstructured Environments Using the YOLO-SaFi Model.

Chen, Bangbang; Ding, Feng; Ma, Baojian; Wang, Liqiang; Ning, Shanping.

Sensors (Basel) ; 24(13)2024 Jul 08.

Artigo em Inglês | MEDLINE | ID: mdl-39001189

RESUMO

The identification of safflower filament targets and the precise localization of picking points are fundamental prerequisites for achieving automated filament retrieval. In light of challenges such as severe occlusion of targets, low recognition accuracy, and the considerable size of models in unstructured environments, this paper introduces a novel lightweight YOLO-SaFi model. The architectural design of this model features a Backbone layer incorporating the StarNet network; a Neck layer introducing a novel ELC convolution module to refine the C2f module; and a Head layer implementing a new lightweight shared convolution detection head, Detect_EL. Furthermore, the loss function is enhanced by upgrading CIoU to PIoUv2. These enhancements significantly augment the model's capability to perceive spatial information and facilitate multi-feature fusion, consequently enhancing detection performance and rendering the model more lightweight. Performance evaluations conducted via comparative experiments with the baseline model reveal that YOLO-SaFi achieved a reduction of parameters, computational load, and weight files by 50.0%, 40.7%, and 48.2%, respectively, compared to the YOLOv8 baseline model. Moreover, YOLO-SaFi demonstrated improvements in recall, mean average precision, and detection speed by 1.9%, 0.3%, and 88.4 frames per second, respectively. Finally, the deployment of the YOLO-SaFi model on the Jetson Orin Nano device corroborates the superior performance of the enhanced model, thereby establishing a robust visual detection framework for the advancement of intelligent safflower filament retrieval robots in unstructured environments.

8.

Dazzling Evaluation of the Impact of a High-Repetition-Rate CO₂ Pulsed Laser on Infrared Imaging Systems.

Zheng, Hanyu; Wang, Yunzhe; Liu, Yang; Sun, Tao; Shao, Junfeng.

Sensors (Basel) ; 24(6)2024 Mar 12.

Artigo em Inglês | MEDLINE | ID: mdl-38544089

RESUMO

This article utilizes the Canny edge extraction algorithm based on contour curvature and the cross-correlation template matching algorithm to extensively study the impact of a high-repetition-rate CO2 pulsed laser on the target extraction and tracking performance of an infrared imaging detector. It establishes a quantified dazzling pattern for lasers on infrared imaging systems. By conducting laser dazzling and damage experiments, a detailed analysis of the normalized correlation between the target and the dazzling images is performed to quantitatively describe the laser dazzling effects. Simultaneously, an evaluation system, including target distance and laser power evaluation factors, is established to determine the dazzling level and whether the target is recognizable. The research results reveal that the laser power and target position are crucial factors affecting the detection performance of infrared imaging detector systems under laser dazzling. Different laser powers are required to successfully interfere with the recognition algorithm of the infrared imaging detector at different distances. And laser dazzling produces a considerable quantity of false edge information, which seriously affects the performance of the pattern recognition algorithm. In laser damage experiments, the detector experienced functional damage, with a quarter of the image displaying as completely black. The energy density threshold required for the functional damage of the detector is approximately 3 J/cm2. The dazzling assessment conclusions also apply to the evaluation of the damage results. Finally, the proposed evaluation formula aligns with the experimental results, objectively reflecting the actual impact of laser dazzling on the target extraction and the tracking performance of infrared imaging systems. This study provides an in-depth and accurate analysis for understanding the influence of lasers on the performance of infrared imaging detectors.

9.

YOLOv8-C2f-Faster-EMA: An Improved Underwater Trash Detection Model Based on YOLOv8.

Zhu, Jin; Hu, Tao; Zheng, Linhan; Zhou, Nan; Ge, Huilin; Hong, Zhichao.

Sensors (Basel) ; 24(8)2024 Apr 12.

Artigo em Inglês | MEDLINE | ID: mdl-38676100

RESUMO

Anthropogenic waste deposition in aquatic environments precipitates a decline in water quality, engendering pollution that adversely impacts human health, ecological integrity, and economic endeavors. The evolution of underwater robotic technologies heralds a new era in the timely identification and extraction of submerged litter, offering a proactive measure against the scourge of water pollution. This study introduces a refined YOLOv8-based algorithm tailored for the enhanced detection of small-scale underwater debris, aiming to mitigate the prevalent challenges of high miss and false detection rates in aquatic settings. The research presents the YOLOv8-C2f-Faster-EMA algorithm, which optimizes the backbone, neck layer, and C2f module for underwater characteristics and incorporates an effective attention mechanism. This algorithm improves the accuracy of underwater litter detection while simplifying the computational model. Empirical evidence underscores the superiority of this method over the conventional YOLOv8n framework, manifesting in a significant uplift in detection performance. Notably, the proposed method realized a 6.7% increase in precision (P), a 4.1% surge in recall (R), and a 5% enhancement in mean average precision (mAP). Transcending its foundational utility in marine conservation, this methodology harbors potential for subsequent integration into remote sensing ventures. Such an adaptation could substantially enhance the precision of detection models, particularly in the realm of localized surveillance, thereby broadening the scope of its applicability and impact.

10.

Research on Human Posture Estimation Algorithm Based on YOLO-Pose.

Ding, Jing; Niu, Shanwei; Nie, Zhigang; Zhu, Wenyu.

Sensors (Basel) ; 24(10)2024 May 10.

Artigo em Inglês | MEDLINE | ID: mdl-38793891

RESUMO

In response to the numerous challenges faced by traditional human pose recognition methods in practical applications, such as dense targets, severe edge occlusion, limited application scenarios, complex backgrounds, and poor recognition accuracy when targets are occluded, this paper proposes a YOLO-Pose algorithm for human pose estimation. The specific improvements are divided into four parts. Firstly, in the Backbone section of the YOLO-Pose model, lightweight GhostNet modules are introduced to reduce the model's parameter count and computational requirements, making it suitable for deployment on unmanned aerial vehicles (UAVs). Secondly, the ACmix attention mechanism is integrated into the Neck section to improve detection speed during object judgment and localization. Furthermore, in the Head section, key points are optimized using coordinate attention mechanisms, significantly enhancing key point localization accuracy. Lastly, the paper improves the loss function and confidence function to enhance the model's robustness. Experimental results demonstrate that the improved model achieves a 95.58% improvement in mAP50 and a 69.54% improvement in mAP50-95 compared to the original model, with a reduction of 14.6 M parameters. The model achieves a detection speed of 19.9 ms per image, optimized by 30% and 39.5% compared to the original model. Comparisons with other algorithms such as Faster R-CNN, SSD, YOLOv4, and YOLOv7 demonstrate varying degrees of performance improvement.

Assuntos

Algoritmos , Postura , Humanos , Postura/fisiologia , Dispositivos Aéreos não Tripulados , Processamento de Imagem Assistida por Computador/métodos

11.

Enhanced Lightweight YOLOX for Small Object Wildfire Detection in UAV Imagery.

Luan, Tian; Zhou, Shixiong; Zhang, Guokang; Song, Zechun; Wu, Jiahui; Pan, Weijun.

Sensors (Basel) ; 24(9)2024 Apr 24.

Artigo em Inglês | MEDLINE | ID: mdl-38732816

RESUMO

Target detection technology based on unmanned aerial vehicle (UAV)-derived aerial imagery has been widely applied in the field of forest fire patrol and rescue. However, due to the specificity of UAV platforms, there are still significant issues to be resolved such as severe omission, low detection accuracy, and poor early warning effectiveness. In light of these issues, this paper proposes an improved YOLOX network for the rapid detection of forest fires in images captured by UAVs. Firstly, to enhance the network's feature-extraction capability in complex fire environments, a multi-level-feature-extraction structure, CSP-ML, is designed to improve the algorithm's detection accuracy for small-target fire areas. Additionally, a CBAM attention mechanism is embedded in the neck network to reduce interference caused by background noise and irrelevant information. Secondly, an adaptive-feature-extraction module is introduced in the YOLOX network's feature fusion part to prevent the loss of important feature information during the fusion process, thus enhancing the network's feature-learning capability. Lastly, the CIoU loss function is used to replace the original loss function, to address issues such as excessive optimization of negative samples and poor gradient-descent direction, thereby strengthening the network's effective recognition of positive samples. Experimental results show that the improved YOLOX network has better detection performance, with mAP@50 and mAP@50_95 increasing by 6.4% and 2.17%, respectively, compared to the traditional YOLOX network. In multi-target flame and small-target flame scenarios, the improved YOLO model achieved a mAP of 96.3%, outperforming deep learning algorithms such as FasterRCNN, SSD, and YOLOv5 by 33.5%, 7.7%, and 7%, respectively. It has a lower omission rate and higher detection accuracy, and it is capable of handling small-target detection tasks in complex fire environments. This can provide support for UAV patrol and rescue applications from a high-altitude perspective.

12.

EMA-YOLO: A Novel Target-Detection Algorithm for Immature Yellow Peach Based on YOLOv8.

Xu, Dandan; Xiong, Hao; Liao, Yue; Wang, Hongruo; Yuan, Zhizhang; Yin, Hua.

Sensors (Basel) ; 24(12)2024 Jun 11.

Artigo em Inglês | MEDLINE | ID: mdl-38931568

RESUMO

Accurate determination of the number and location of immature small yellow peaches is crucial for bagging, thinning, and estimating yield in modern orchards. However, traditional methods have faced challenges in accurately distinguishing immature yellow peaches due to their resemblance to leaves and susceptibility to variations in shooting angles and distance. To address these issues, we proposed an improved target-detection model (EMA-YOLO) based on YOLOv8. Firstly, the sample space was enhanced algorithmically to improve the diversity of samples. Secondly, an EMA attention-mechanism module was introduced to encode global information; this module could further aggregate pixel-level features through dimensional interaction and strengthen small-target-detection capability by incorporating a 160 × 160 detection head. Finally, EIoU was utilized as a loss function to reduce the incidence of missed detections and false detections of the target small yellow peaches under the condition of high density of yellow peaches. Experimental results show that compared with the original YOLOv8n model, the EMA-YOLO model improves mAP by 4.2%, Furthermore, compared with SDD, Objectbox, YOLOv5n, and YOLOv7n, this model's mAP was improved by 30.1%, 14.2%,15.6%, and 7.2%, respectively. In addition, the EMA-YOLO model achieved good results under different conditions of illumination and shooting distance and significantly reduced the number of missed detections. Therefore, this method can provide technical support for smart management of yellow-peach orchards.

13.

Real-Time Detection of Unauthorized Unmanned Aerial Vehicles Using SEB-YOLOv8s.

Fang, Ao; Feng, Song; Liang, Bo; Jiang, Ji.

Sensors (Basel) ; 24(12)2024 Jun 17.

Artigo em Inglês | MEDLINE | ID: mdl-38931699

RESUMO

Aiming at real-time detection of UAVs, small UAV targets are easily missed and difficult to detect in complex backgrounds. To maintain high detection performance while reducing memory and computational costs, this paper proposes the SEB-YOLOv8s detection method. Firstly, the YOLOv8 network structure is reconstructed using SPD-Conv to reduce the computational burden and accelerate the processing speed while retaining more shallow features of small targets. Secondly, we design the AttC2f module and replace the C2f module in the backbone of YOLOv8s with it, enhancing the model's ability to obtain accurate information and enriching the extracted relevant information. Finally, Bi-Level Routing Attention is introduced to optimize the Neck part of the network, reducing the model's attention to interfering information and filtering it out. The experimental results show that the mAP50 of the proposed method reaches 90.5% and the accuracy reaches 95.9%, which are improvements of 2.2% and 1.9%, respectively, compared with the original model. The mAP50-95 is improved by 2.7%, and the model's occupied memory size only increases by 2.5 MB, effectively achieving high-accuracy real-time detection with low memory consumption.

14.

SEB-YOLO: An Improved YOLOv5 Model for Remote Sensing Small Target Detection.

Hui, Yan; You, Shijie; Hu, Xiuhua; Yang, Panpan; Zhao, Jing.

Sensors (Basel) ; 24(7)2024 Mar 29.

Artigo em Inglês | MEDLINE | ID: mdl-38610404

RESUMO

Due to the limited semantic information extraction with small objects and difficulty in distinguishing similar targets, it brings great challenges to target detection in remote sensing scenarios, which results in poor detection performance. This paper proposes an improved YOLOv5 remote sensing image target detection algorithm, SEB-YOLO (SPD-Conv + ECSPP + Bi-FPN + YOLOv5). Firstly, the space-to-depth (SPD) layer followed by a non-strided convolution (Conv) layer module (SPD-Conv) was used to reconstruct the backbone network, which retained the global features and reduced the feature loss. Meanwhile, the pooling module with the attention mechanism of the final layer of the backbone network was designed to help the network better identify and locate the target. Furthermore, a bidirectional feature pyramid network (Bi-FPN) with bilinear interpolation upsampling was added to improve bidirectional cross-scale connection and weighted feature fusion. Finally, the decoupled head is introduced to enhance the model convergence and solve the contradiction between the classification task and the regression task. Experimental results on NWPU VHR-10 and RSOD datasets show that the mAP of the proposed algorithm reaches 93.5% and 93.9%respectively, which is 4.0% and 5.3% higher than that of the original YOLOv5l algorithm. The proposed algorithm achieves better detection results for complex remote sensing images.

15.

Underwater Rescue Target Detection Based on Acoustic Images.

Hu, Sufeng; Liu, Tao.

Sensors (Basel) ; 24(6)2024 Mar 10.

Artigo em Inglês | MEDLINE | ID: mdl-38544042

RESUMO

In order to effectively respond to floods and water emergencies that result in the drowning of missing persons, timely and effective search and rescue is a very critical step in underwater rescue. Due to the complex underwater environment and low visibility, unmanned underwater vehicles (UUVs) with sonar are more efficient than traditional manual search and rescue methods to conduct active searches using deep learning algorithms. In this paper, we constructed a sound-based rescue target dataset that encompasses both the source and target domains using deep transfer learning techniques. For the underwater acoustic rescue target detection of small targets, which lack image feature accuracy, this paper proposes a two-branch convolution module and improves the YOLOv5s algorithm model to design an acoustic rescue small target detection algorithm model. For an underwater rescue target dataset based on acoustic images with a small sample acoustic dataset, a direct fine-tuning using optical image pre-training lacks cross-domain adaptability due to the different statistical properties of optical and acoustic images. This paper therefore proposes a heterogeneous information hierarchical migration learning method. For the false detection of acoustic rescue targets in a complex underwater background, the network layer is frozen during the hierarchical migration of heterogeneous information to improve the detection accuracy. In addition, in order to be more applicable to the embedded devices carried by underwater UAVs, an underwater acoustic rescue target detection algorithm based on ShuffleNetv2 is proposed to improve the two-branch convolutional module and the backbone network of YOLOv5s algorithm, and to create a lightweight model based on hierarchical migration of heterogeneous information. Through extensive comparative experiments conducted on various acoustic images, we have thoroughly validated the feasibility and effectiveness of our method. Our approach has demonstrated state-of-the-art performance in underwater search and rescue target detection tasks.

16.

Research on Improved Algorithms for Cone Bucket Detection in Formula Unmanned Competition.

Li, Xu; Li, Gang; Zhang, Zhe; Sun, Haosen.

Sensors (Basel) ; 24(18)2024 Sep 13.

Artigo em Inglês | MEDLINE | ID: mdl-39338691

RESUMO

The model network based on YOLOv8 for detecting race cones and buckets in the Formula Unmanned Competition for Chinese university students needs help with problems with complex structure, redundant number of parameters, and computation, significantly affecting detection efficiency. A lightweight detection model based on YOLOv8 is proposed to address these problems. The model includes improving the backbone network, neck network, and detection head, as well as introducing knowledge distillation and other techniques to construct a lightweight model. The specific improvements are as follows: firstly, the backbone network for extracting features is improved by introducing the ADown module in YOLOv9 to replace the convolution module used for downsampling in the YOLOv8 network, and secondly, the FasterBlock in FasterNet network was introduced to replace the fusion module in YOLOv8 C2f, and then the self-developed lightweight detection head was introduced to improve the detection performance while achieving lightweight. Finally, the detection performance was further improved by knowledge distillation. The experimental results on the public dataset FSACOCO show that the improved model's accuracy, recall, and average precision are 92.7%, 84.6%, and 91%, respectively. Compared with the original YOLOv8n detection model, the recall and average precision increase by 2.7 and 1.2 percentage points, the memory is half the original, and the model computation is 51%. The model significantly reduces the misdetection and leakage of conical buckets in real-vehicle tests and, at the same time, ensures the detection speed to satisfy the deployment requirements on tiny devices. Satisfies all the requirements for deployment of tiny devices in the race car of the China University Student Driverless Formula Competition. The improved method in this paper can be applied to conebucket detection in complex scenarios, and the improved idea can be carried over to the detection of other small targets.

17.

RCRFNet: Enhancing Object Detection with Self-Supervised Radar-Camera Fusion and Open-Set Recognition.

Chen, Minwei; Liu, Yajun; Zhang, Zenghui; Guo, Weiwei.

Sensors (Basel) ; 24(15)2024 Jul 24.

Artigo em Inglês | MEDLINE | ID: mdl-39123850

RESUMO

Robust object detection in complex environments, poor visual conditions, and open scenarios presents significant technical challenges in autonomous driving. These challenges necessitate the development of advanced fusion methods for millimeter-wave (mmWave) radar point cloud data and visual images. To address these issues, this paper proposes a radar-camera robust fusion network (RCRFNet), which leverages self-supervised learning and open-set recognition to effectively utilise the complementary information from both sensors. Specifically, the network uses matched radar-camera data through a frustum association approach to generate self-supervised signals, enhancing network training. The integration of global and local depth consistencies between radar point clouds and visual images, along with image features, helps construct object class confidence levels for detecting unknown targets. Additionally, these techniques are combined with a multi-layer feature extraction backbone and a multimodal feature detection head to achieve robust object detection. Experiments on the nuScenes public dataset demonstrate that RCRFNet outperforms state-of-the-art (SOTA) methods, particularly in conditions of low visual visibility and when detecting unknown class objects.

18.

DS-SIAUG: A Self-Training Approach Using a Disrupted Student Model for Enhanced Side-Scan Sonar Image Augmentation.

Peng, Chengyang; Jin, Shaohua; Bian, Gang; Cui, Yang.

Sensors (Basel) ; 24(15)2024 Aug 05.

Artigo em Inglês | MEDLINE | ID: mdl-39124108

RESUMO

Side-scan sonar is a principal technique for subsea target detection, where the quantity of sonar images of seabed targets significantly influences the accuracy of intelligent target recognition. To expand the number of representative side-scan sonar target image samples, a novel augmentation method employing self-training with a Disrupted Student model is designed (DS-SIAUG). The process begins by inputting a dataset of side-scan sonar target images, followed by augmenting the samples through an adversarial network consisting of the DDPM (Denoising Diffusion Probabilistic Model) and the YOLO (You Only Look Once) detection model. Subsequently, the Disrupted Student model is used to filter out representative target images. These selected images are then reused as a new dataset to repeat the adversarial filtering process. Experimental results indicate that using the Disrupted Student model for selection achieves a target recognition accuracy comparable to manual selection, improving the accuracy of intelligent target recognition by approximately 5% over direct adversarial network augmentation.

19.

A New Multi-Branch Convolutional Neural Network and Feature Map Extraction Method for Traffic Congestion Detection.

Jiang, Shan; Feng, Yuming; Zhang, Wei; Liao, Xiaofeng; Dai, Xiangguang; Onasanya, Babatunde Oluwaseun.

Sensors (Basel) ; 24(13)2024 Jul 01.

Artigo em Inglês | MEDLINE | ID: mdl-39001052

RESUMO

With the continuous advancement of the economy and technology, the number of cars continues to increase, and the traffic congestion problem on some key roads is becoming increasingly serious. This paper proposes a new vehicle information feature map (VIFM) method and a multi-branch convolutional neural network (MBCNN) model and applies it to the problem of traffic congestion detection based on camera image data. The aim of this study is to build a deep learning model with traffic images as input and congestion detection results as output. It aims to provide a new method for automatic detection of traffic congestion. The deep learning-based method in this article can effectively utilize the existing massive camera network in the transportation system without requiring too much investment in hardware. This study first uses an object detection model to identify vehicles in images. Then, a method for extracting a VIFM is proposed. Finally, a traffic congestion detection model based on MBCNN is constructed. This paper verifies the application effect of this method in the Chinese City Traffic Image Database (CCTRIB). Compared to other convolutional neural networks, other deep learning models, and baseline models, the method proposed in this paper yields superior results. The method in this article obtained an F1 score of 98.61% and an accuracy of 98.62%. Experimental results show that this method effectively solves the problem of traffic congestion detection and provides a powerful tool for traffic management.

20.

MWIRGas-YOLO: Gas Leakage Detection Based on Mid-Wave Infrared Imaging.

Xu, Shiwei; Wang, Xia; Sun, Qiyang; Dong, Kangjun.

Sensors (Basel) ; 24(13)2024 Jul 04.

Artigo em Inglês | MEDLINE | ID: mdl-39001124

RESUMO

The integration of visual algorithms with infrared imaging technology has become an effective tool for industrial gas leak detection. However, existing research has mostly focused on simple scenarios where a gas plume is clearly visible, with limited studies on detecting gas in complex scenes where target contours are blurred and contrast is low. This paper uses a cooled mid-wave infrared (MWIR) system to provide high sensitivity and fast response imaging and proposes the MWIRGas-YOLO network for detecting gas leaks in mid-wave infrared imaging. This network effectively detects low-contrast gas leakage and segments the gas plume within the scene. In MWIRGas-YOLO, it utilizes the global attention mechanism (GAM) to fully focus on gas plume targets during feature fusion, adds a small target detection layer to enhance information on small-sized targets, and employs transfer learning of similar features from visible light smoke to provide the model with prior knowledge of infrared gas features. Using a cooled mid-wave infrared imager to collect gas leak images, the experimental results show that the proposed algorithm significantly improves the performance over the original model. The segment mean average precision reached 96.1% (mAP50) and 47.6% (mAP50:95), respectively, outperforming the other mainstream algorithms. This can provide an effective reference for research on infrared imaging for gas leak detection.

RESUMO

RESUMO

RESUMO

RESUMO

RESUMO

RESUMO

RESUMO

RESUMO

RESUMO

RESUMO

Assuntos

RESUMO

RESUMO

RESUMO

RESUMO

RESUMO

RESUMO

RESUMO

RESUMO

RESUMO

RESUMO

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA