Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 26
Filtrar
Mais filtros

Base de dados
País/Região como assunto
Tipo de documento
Intervalo de ano de publicação
1.
Cereb Cortex ; 33(19): 10286-10302, 2023 09 26.
Artigo em Inglês | MEDLINE | ID: mdl-37536059

RESUMO

What are the dynamics of global feature-based and spatial attention, when deployed together? In an attentional shifting experiment, flanked by three control experiments, we investigated neural temporal dynamics of combined attentional shifts. For this purpose, orange- and blue-frequency-tagged spatially overlapping Random Dot Kinematograms were presented in the left and right visual hemifield to elicit continuous steady-state-visual-evoked-potentials. After being initially engaged in a fixation cross task, participants were at some point in time cued to shift attention to one of the Random Dot Kinematograms, to detect and respond to brief coherent motion events, while ignoring all such events in other Random Dot Kinematograms. The analysis of steady-state visual-evoked potentials allowed us to map time courses and dynamics of early sensory-gain modulations by attention. This revealed a time-invariant amplification of the to-be attended color both at the attended and the unattended side, followed by suppression for the to-be-ignored color at attended and unattended sides. Across all experiments, global and obligatory feature-based selection dominated early sensory gain modulations, whereas spatial attention played a minor modulatory role. However, analyses of behavior and neural markers such as alpha-band activity and event-related potentials to target- and distractor-event processing, revealed clear modulations by spatial attention.


Assuntos
Eletroencefalografia , Potenciais Evocados Visuais , Humanos , Tempo de Reação/fisiologia , Sinais (Psicologia) , Estimulação Luminosa
2.
BMC Health Serv Res ; 24(1): 194, 2024 Feb 13.
Artigo em Inglês | MEDLINE | ID: mdl-38351077

RESUMO

BACKGROUND: Family doctor contract policy is now run by the State Council as an important move to promote the hierarchical medical system. Whether the family doctor contract policy achieves the initial government's goal should be measured further from the perspective of patient visits between hospitals and community health centers, which are regarded as grass medical agencies. METHODS: The spatial feature measurement method is applied with ArcGIS 10.2 software to analyze the spatial aggregation effect of patient visits to hospitals or community health centers among 20 districts of one large city in China and analyze the family doctor contract policy published in those areas to compare the influence of visit tendencies. RESULTS: From year 2016-2020, visits to hospitals were in the high-high cluster, and the density was spatially overflow, while there was no such tendency in visits to community health centers. The analysis of different family doctor contract policy implementation times in 20 districts reflects that the family doctor contract policy has a very limited effect on the promotion of the hierarchical medical system, and the innovation of the family doctor contract policy needs to be considered. CONCLUSIONS: A brief summary and potential implications. A multi-integrated medical system along with family doctor contract policy needs to be established, especially integrated in leadership and governance, financing, workforce, and service delivery between hospitals and community health centers, to promote the hierarchical medical system.


Assuntos
Atenção à Saúde , Médicos de Família , Humanos , Aceitação pelo Paciente de Cuidados de Saúde , Serviços Contratados , Política de Saúde , China
3.
Sensors (Basel) ; 24(4)2024 Feb 16.
Artigo em Inglês | MEDLINE | ID: mdl-38400424

RESUMO

Car-sharing systems require accurate demand prediction to ensure efficient resource allocation and scheduling decisions. However, developing precise predictive models for vehicle demand remains a challenging problem due to the complex spatio-temporal relationships. This paper introduces USTIN, the Unified Spatio-Temporal Inference Prediction Network, a novel neural network architecture for demand prediction. The model consists of three key components: a temporal feature unit, a spatial feature unit, and a spatio-temporal feature unit. The temporal unit utilizes historical demand data and comprises four layers, each corresponding to a different time scale (hourly, daily, weekly, and monthly). Meanwhile, the spatial unit incorporates contextual points of interest data to capture geographic demand factors around parking stations. Additionally, the spatio-temporal unit incorporates weather data to model the meteorological impacts across locations and time. We conducted extensive experiments on real-world car-sharing data. The proposed USTIN model demonstrated its ability to effectively learn intricate temporal, spatial, and spatiotemporal relationships, and outperformed existing state-of-the-art approaches. Moreover, we employed negative binomial regression with uncertainty to identify the most influential factors affecting car usage.

4.
Sensors (Basel) ; 23(2)2023 Jan 12.
Artigo em Inglês | MEDLINE | ID: mdl-36679709

RESUMO

Land surface temperatures (LST) are vital parameters in land surface-atmosphere interactions. Constrained by technology and atmospheric interferences, LST retrievals from various satellite sensors usually return missing data, thus negatively impacting analyses. Reconstructing missing data is important for acquiring gap-free datasets. However, the current reconstruction methods are limited for maintaining spatial details and high accuracies. We developed a new gap-free algorithm termed the spatial feature-considered random forest regression (SFRFR) model; it builds stable nonlinear relationships to connect the LST with related parameters, including terrain elements, land coverage types, spectral indexes, surface reflectance data, and the spatial feature of the LST, to reconstruct the missing LST data. The SFRFR model reconstructed gap-free LST data retrieved from the Landsat 8 satellite on 27 July 2017 in Wuhan. The results show that the SFRFR model exhibits the best performance according to the various evaluation metrics among the SFRFR, random forest regression and spline interpolation, with a coefficient of determination (R2) reaching 0.96, root-mean-square error (RMSE) of 0.55, and mean absolute error (MAE) of 0.55. Then, we reconstructed gap-free LST data gathered in Wuhan from 2016 to 2021 to analyze urban thermal environment changes and found that 2020 presented the coolest temperatures. The SFRFR model still displayed satisfactory results, with an average R2 of 0.91 and an MAE of 0.63. We further discuss and discover the factors affecting the visual performance of SFRFR and identify the research priority to circumvent these disadvantages. Overall, this study provides a simple, practical method for acquiring gap-free LST data to help us better understand the spatiotemporal LST variation process.


Assuntos
Algoritmos , Monitoramento Ambiental , Temperatura , Monitoramento Ambiental/métodos , Cidades , China
5.
J Digit Imaging ; 36(3): 932-946, 2023 06.
Artigo em Inglês | MEDLINE | ID: mdl-36720840

RESUMO

Breast cancer is one of the most dangerous and common cancers in women which leads to a major research topic in medical science. To assist physicians in pre-screening for breast cancer to reduce unnecessary biopsies, breast ultrasound and computer-aided diagnosis (CAD) have been used to distinguish between benign and malignant tumors. In this study, we proposed a CAD system for tumor diagnosis using a multi-channel fusion method and feature extraction structure based on multi-feature fusion on breast ultrasound (BUS) images. In the pre-processing stage, the multi-channel fusion method completed the color conversion of the BUS image to make it contain richer information. In the feature extraction stage, the pre-trained ResNet50 network was selected as the basic network, and three levels of features were combined based on adaptive spatial feature fusion (ASFF), and finally, the shallow local binary pattern (LBP) texture features were fused. Support vector machine (SVM) was used for comparative analysis. A retrospective analysis was carried out, and 1615 breast tumor images (572 benign and 1043 malignant) confirmed by pathological examinations were collected. After data processing and augmentation, for an independent test set consisting of 874 breast ultrasound images (457 benign and 417 malignant), the accuracy, precision, recall, specificity, F1 score, and AUC of our method were 96.91%, 98.75%, 94.72%, 98.91%, 0.97, and 0.991, respectively. The results show that the integration of shallow LBP texture features and multi-level depth features can more effectively improve the comprehensive performance of breast tumor diagnosis, and has strong clinical application value. Compared with the past methods, our proposed method is expected to realize the automatic diagnosis of breast tumors and provide an auxiliary tool for radiologists to accurately diagnose breast diseases.


Assuntos
Neoplasias da Mama , Mama , Feminino , Humanos , Estudos Retrospectivos , Mama/diagnóstico por imagem , Ultrassonografia , Neoplasias da Mama/diagnóstico por imagem , Neoplasias da Mama/patologia , Redes Neurais de Computação
6.
Sensors (Basel) ; 22(12)2022 Jun 09.
Artigo em Inglês | MEDLINE | ID: mdl-35746154

RESUMO

Scene text detection task aims to precisely localize text in natural environments. At present, the application scenarios of text detection topics have gradually shifted from plain document text to more complex natural scenarios. Objects with similar texture and text morphology in the complex background noise of natural scene images are prone to false recall and difficult to detect multi-scale texts, a multi-directional scene Uyghur text detection model based on fine-grained feature representation and spatial feature fusion is proposed, and feature extraction and feature fusion are improved to enhance the network's ability to represent multi-scale features. In this method, the multiple groups of 3 × 3 convolutional feature groups that are connected like the hierarchical residual to build a residual network for feature extraction, which captures the feature details and increases the receptive field of the network to adapt to multi-scale text and long glued dimensional font detection and suppress false positives of text-like objects. Secondly, an adaptive multi-level feature map fusion strategy is adopted to overcome the inconsistency of information in multi-scale feature map fusion. The proposed model achieves 93.94% and 84.92% F-measure on the self-built Uyghur dataset and the ICDAR2015 dataset, respectively, which improves the accuracy of Uyghur text detection and suppresses false positives.


Assuntos
Algoritmos
7.
Cereb Cortex ; 30(12): 6481-6489, 2020 11 03.
Artigo em Inglês | MEDLINE | ID: mdl-32770201

RESUMO

Our aim is to explore the spatial and temporal features of anorexia nervosa (AN) and obsessive-compulsive disorder (OCD) considering different brain regions and development stages. The gene sets related to 16 brain regions and nine development stages were obtained from a brain spatial and temporal transcriptomic dataset. Using the genome-wide association study data, transcriptome-wide association study (TWAS) was conducted to identify the genes whose imputed expressions were associated with AN and OCD, respectively. The mRNA expression profiles were analyzed by GEO2R to obtain differentially expressed genes. Gene set enrichment analysis was conducted to detect the spatial and temporal features related to AN and OCD using the TWAS and mRNA expression analysis results. We observed multiple common association signals shared by TWAS and mRNA expression analysis of AN, such as the primary auditory cortex vs. cerebellar cortex in fetal development and earlier vs. later fetal development in the somatosensory cortex. For OCD, we also detected multiple common association signals, such as medial prefrontal cortex vs. amygdala in adulthood and fetal development vs. infancy in mediodorsal nucleus of thalamus. Our study provides novel clues for describing the spatial and temporal features of brain development in the pathogenesis of AN and OCD.


Assuntos
Anorexia Nervosa/genética , Anorexia Nervosa/fisiopatologia , Encéfalo/crescimento & desenvolvimento , Regulação da Expressão Gênica no Desenvolvimento , Transtorno Obsessivo-Compulsivo/genética , Transtorno Obsessivo-Compulsivo/fisiopatologia , Estudo de Associação Genômica Ampla , Genômica , Humanos , Transcriptoma
8.
Sensors (Basel) ; 22(1)2021 Dec 28.
Artigo em Inglês | MEDLINE | ID: mdl-35009735

RESUMO

Achieving high-performance numerical weather prediction (NWP) is important for people's livelihoods and for socioeconomic development. However, NWP is obtained by solving differential equations with globally observed data without capturing enough local and spatial information at the observed station. To improve the forecasting performance, we propose a novel spatial lightGBM (Light Gradient Boosting Machine) model to correct the numerical forecast results at each observation station. By capturing the local spatial information of stations and using a single-station single-time strategy, the proposed method can incorporate the observed data and model data to achieve high-performance correction of medium-range predictions. Experimental results for temperature and wind prediction in Hainan Province show that the proposed correction method performs well compared with the ECWMF model and outperforms other competing methods.


Assuntos
Tempo (Meteorologia) , Vento , Previsões , Humanos , Temperatura , Tempo
9.
Sensors (Basel) ; 21(9)2021 Apr 29.
Artigo em Inglês | MEDLINE | ID: mdl-33946998

RESUMO

Research on the human activity recognition could be utilized for the monitoring of elderly people living alone to reduce the cost of home care. Video sensors can be easily deployed in the different zones of houses to achieve monitoring. The goal of this study is to employ a linear-map convolutional neural network (CNN) to perform action recognition with RGB videos. To reduce the amount of the training data, the posture information is represented by skeleton data extracted from the 300 frames of one film. The two-stream method was applied to increase the accuracy of recognition by using the spatial and motion features of skeleton sequences. The relations of adjacent skeletal joints were employed to build the direct acyclic graph (DAG) matrices, source matrix, and target matrix. Two features were transferred by DAG matrices and expanded as color texture images. The linear-map CNN had a two-dimensional linear map at the beginning of each layer to adjust the number of channels. A two-dimensional CNN was used to recognize the actions. We applied the RGB videos from the action recognition datasets of the NTU RGB+D database, which was established by the Rapid-Rich Object Search Lab, to execute model training and performance evaluation. The experimental results show that the obtained precision, recall, specificity, F1-score, and accuracy were 86.9%, 86.1%, 99.9%, 86.3%, and 99.5%, respectively, in the cross-subject source, and 94.8%, 94.7%, 99.9%, 94.7%, and 99.9%, respectively, in the cross-view source. An important contribution of this work is that by using the skeleton sequences to produce the spatial and motion features and the DAG matrix to enhance the relation of adjacent skeletal joints, the computation speed was faster than the traditional schemes that utilize single frame image convolution. Therefore, this work exhibits the practical potential of real-life action recognition.


Assuntos
Algoritmos , Redes Neurais de Computação , Idoso , Bases de Dados Factuais , Atividades Humanas , Humanos , Esqueleto
10.
Sensors (Basel) ; 22(1)2021 Dec 24.
Artigo em Inglês | MEDLINE | ID: mdl-35009657

RESUMO

Passive radars based on long-term evolution (LTE) signals suffer from sever interferences. The interferences are not only from the base station used as the illuminator of opportunity (BS-IoO), but also from the other co-channel base stations (CCBS) working at the same frequency with the BS-IoO. Because the reference signals of the co-channel interferences are difficult to obtain, cancellation performance degrades seriously when traditional interference suppression methods are applied in LTE-based passive radar. This paper proposes a cascaded cancellation method based on the spatial spectrum cognition of interference. It consists of several cancellation loops. In each loop, the spatial spectrum of strong interferences is first recognized by using the cyclostationary characteristic of LTE signal and the compressed sensing technique. A clean reference signal of each interference is then reconstructed according to the spatial spectrum previously obtained. With the reference signal, the interferences are cancelled. At the end of each loop, the energy of the interference residual is estimated. If the interference residual is still strong, then the cancellation loop continues; otherwise it terminates. The proposed method can get good cancellation performance with a small-sized antenna array. Theoretical and simulation results demonstrate the effectiveness of the proposed method.

11.
Sheng Wu Yi Xue Gong Cheng Xue Za Zhi ; 38(1): 1-9, 2021 Feb 25.
Artigo em Zh | MEDLINE | ID: mdl-33899422

RESUMO

With the advantage of providing more natural and flexible control manner, brain-computer interface systems based on motor imagery electroencephalogram (EEG) have been widely used in the field of human-machine interaction. However, due to the lower signal-noise ratio and poor spatial resolution of EEG signals, the decoding accuracy is relative low. To solve this problem, a novel convolutional neural network based on temporal-spatial feature learning (TSCNN) was proposed for motor imagery EEG decoding. Firstly, for the EEG signals preprocessed by band-pass filtering, a temporal-wise convolution layer and a spatial-wise convolution layer were respectively designed, and temporal-spatial features of motor imagery EEG were constructed. Then, 2-layer two-dimensional convolutional structures were adopted to learn abstract features from the raw temporal-spatial features. Finally, the softmax layer combined with the fully connected layer were used to perform decoding task from the extracted abstract features. The experimental results of the proposed method on the open dataset showed that the average decoding accuracy was 80.09%, which is approximately 13.75% and 10.99% higher than that of the state-of-the-art common spatial pattern (CSP) + support vector machine (SVM) and filter bank CSP (FBCSP) + SVM recognition methods, respectively. This demonstrates that the proposed method can significantly improve the reliability of motor imagery EEG decoding.


Assuntos
Interfaces Cérebro-Computador , Imaginação , Algoritmos , Eletroencefalografia , Humanos , Redes Neurais de Computação , Reprodutibilidade dos Testes
12.
Sensors (Basel) ; 19(11)2019 May 30.
Artigo em Inglês | MEDLINE | ID: mdl-31151259

RESUMO

Synthetic Aperture Radar (SAR) scene classification is challenging but widely applied, in which deep learning can play a pivotal role because of its hierarchical feature learning ability. In the paper, we propose a new scene classification framework, named Feature Recalibration Network with Multi-scale Spatial Features (FRN-MSF), to achieve high accuracy in SAR-based scene classification. First, a Multi-Scale Omnidirectional Gaussian Derivative Filter (MSOGDF) is constructed. Then, Multi-scale Spatial Features (MSF) of SAR scenes are generated by weighting MSOGDF, a Gray Level Gradient Co-occurrence Matrix (GLGCM) and Gabor transformation. These features were processed by the Feature Recalibration Network (FRN) to learn high-level features. In the network, the Depthwise Separable Convolution (DSC), Squeeze-and-Excitation (SE) Block and Convolution Neural Network (CNN) are integrated. Finally, these learned features will be classified by the Softmax function. Eleven types of SAR scenes obtained from four systems combining different bands and resolutions were trained and tested, and a mean accuracy of 98.18% was obtained. To validate the generality of FRN-MSF, five types of SAR scenes sampled from two additional large-scale Gaofen-3 and TerraSAR-X images were evaluated for classification. The mean accuracy of the five types reached 94.56%; while the mean accuracy for the same five types of the former tested 11 types of scene was 96%. The high accuracy indicates that the FRN-MSF is promising for SAR scene classification without losing generality.

13.
Sensors (Basel) ; 18(6)2018 Jun 20.
Artigo em Inglês | MEDLINE | ID: mdl-29925817

RESUMO

Recently, image-filtering based hyperspectral image (HSI) feature extraction has been widely studied. However, due to limited spatial resolution and feature distribution complexity, the problems of cross-region mixing after filtering and spectral discriminative reduction still remain. To address these issues, this paper proposes a spectral-spatial propagation filter (PF) based HSI feature extraction method that can effectively address the above problems. The dimensionality/band of an HSI is typically high; therefore, principal component analysis (PCA) is first used to reduce the HSI dimensionality. Then, the principal components of the HSI are filtered with the PF. When cross-region mixture occurs in the image, the filter template reduces the weight assignments of the cross-region mixed pixels to handle the issue of cross-region mixed pixels simply and effectively. To validate the effectiveness of the proposed method, experiments are carried out on three common HSIs using support vector machine (SVM) classifiers with features learned by the PF. The experimental results demonstrate that the proposed method effectively extracts the spectral-spatial features of HSIs and significantly improves the accuracy of HSI classification.

14.
Environ Monit Assess ; 190(4): 260, 2018 Mar 30.
Artigo em Inglês | MEDLINE | ID: mdl-29603019

RESUMO

The analysis of a large number of multidimensional surface water monitoring data for extracting potential information plays an important role in water quality management. In this study, growing hierarchical self-organizing map (GHSOM) was applied to a water quality assessment of the Songhua River Basin in China using 22 water quality parameters monitored monthly from 13 monitoring sites from 2011 to 2015 (14,782 observations). The spatial and temporal features and correlation between the water quality parameters were explored, and the major contaminants were identified. The results showed that the downstream of the Second Songhua River had the worst water quality of the Songhua River Basin. The upstream and midstream of Nenjiang River and the Second Songhua River had the best. The major contaminants of the Songhua River were chemical oxygen demand (COD), ammonia nitrogen (NH3-N), total phosphorus (TP), and fecal coliform (FC). In the Songhua River, the water pollution at downstream has been gradually eased in years. However, FC and biochemical oxygen demand (BOD5) showed growth over time. The component planes showed that three sets of parameters had positive correlations with each other. GHSOM was found to have advantages over self-organizing maps and hierarchical clustering analysis as follows: (1) automatically generating the necessary neurons, (2) intuitively exhibiting the hierarchical inheritance relationship between the original data, and (3) depicting the boundaries of the classification much more clearly. Therefore, the application of GHSOM in water quality assessments, especially with large amounts of monitoring data, enables the extraction of more information and provides strong support for water quality management.


Assuntos
Monitoramento Ambiental/métodos , Rios/química , Poluentes Químicos da Água/análise , Análise da Demanda Biológica de Oxigênio , China , Análise por Conglomerados , Fósforo/análise , Poluição da Água/análise , Poluição da Água/estatística & dados numéricos , Qualidade da Água/normas
15.
Med Biol Eng Comput ; 62(2): 591-603, 2024 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-37953335

RESUMO

Decision-making plays a critical role in an individual's interpersonal interactions and cognitive processes. Due to the issue of strong subjectivity in the classification research of art design decisions, we utilize the relatively objective electroencephalogram (EEG) to explore design decision problems. However, different regions of the brain do not have the same influence on the design decision classification, so this paper proposes a spatial feature based convolutional neural network (space-CNN) to explore the problem of decision classification of EEG signals from different regions. We recruit 16 subjects to collect their EEG data while viewing four stimulation patterns. After noise reduction of the raw data by discrete wavelet transform (DWT), the EEG image is generated by combining it with the spatial features of the EEG signal, which is used as the input of CNN. Our experimental results show that the degree of influence of different brain regions on decision-making is parietal lobe > frontal lobe > occipital lobe > temporal lobe. In addition, the average accuracy of space-CNN reaches 86.13%, which is about 6% higher than similar studies.


Assuntos
Encéfalo , Redes Neurais de Computação , Humanos , Análise de Ondaletas , Eletroencefalografia/métodos , Lobo Occipital , Algoritmos
16.
Front Neurorobot ; 18: 1351939, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38352724

RESUMO

The transportation of hazardous chemicals on roadways has raised significant safety concerns. Incidents involving these substances often lead to severe and devastating consequences. Consequently, there is a pressing need for real-time detection systems tailored for hazardous material vehicles. However, existing detection methods face challenges in accurately identifying smaller targets and achieving high precision. This paper introduces a novel solution, HMV-YOLO, an enhancement of the YOLOv7-tiny model designed to address these challenges. Within this model, two innovative modules, CBSG and G-ELAN, are introduced. The CBSG module's mathematical model incorporates components such as Convolution (Conv2d), Batch Normalization (BN), SiLU activation, and Global Response Normalization (GRN) to mitigate feature collapse issues and enhance neuron activity. The G-ELAN module, building upon CBSG, further advances feature fusion. Experimental results showcase the superior performance of the enhanced model compared to the original one across various evaluation metrics. This advancement shows great promise for practical applications, particularly in the context of real-time monitoring systems for hazardous material vehicles.

17.
Mar Pollut Bull ; 198: 115874, 2024 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-38056290

RESUMO

The oil spill accidents on the sea surface pose a severe threat to the marine environment and human health. This paper proposes a novel Semantic Segmentation Network (SSN) for processing oil spill images so that low-contrast oil spills on the sea surface can be accurately identified. After the detection accuracy and real-time performance of the current SSNs are compared, the basic network architecture of DeeplabV3+ based target detection is analyzed. The standard convolution is replaced by the Omni-dimensional Dynamic Convolution (ODConv) in the Ghost Module Depth-Wise separable Convolution (DWConv) to further enhance the feature extraction ability of the network. Furthermore, a new DeeplabV3+ based network with ODGhostNetV2 is constructed as the main feature extraction module, and an Adaptive Triplet Attention (ATA) module is deployed in the encoder and decoder at the same time. This not only improves the richness of semantic features but also increases the following receptive fields of the network model. ATA integrates the Adaptively Spatial Feature Fusion (ASFF) module to optimize the weight assignment problem in the feature map fusion process. The ablation experiments are conducted to verify the proposed network which show high accuracy and good real-time performance for the oil spill detection.


Assuntos
Poluição por Petróleo , Humanos , Semântica , Oceanos e Mares
18.
Animals (Basel) ; 14(14)2024 Jul 21.
Artigo em Inglês | MEDLINE | ID: mdl-39061590

RESUMO

The cultivation of the Chinese mitten crab (Eriocheir sinensis) is an important component of China's aquaculture industry and also a field of concern worldwide. It focuses on the selection of high-quality, disease-free juvenile crabs. However, the early maturity rate of more than 18.2% and the mortality rate of more than 60% make it difficult to select suitable juveniles for adult culture. The juveniles exhibit subtle distinguishing features, and the methods for differentiating between sexes vary significantly; without training from professional breeders, it is challenging for laypersons to identify and select the appropriate juveniles. Therefore, we propose a task-aligned detection algorithm for identifying one-year-old precocious Chinese mitten crabs, named R-TNET. Initially, the required images were obtained by capturing key frames, and then they were annotated and preprocessed by professionals to build a training dataset. Subsequently, the ResNeXt network was selected as the backbone feature extraction network, with Convolutional Block Attention Modules (CBAMs) and a Deformable Convolution Network (DCN) embedded in its residual blocks to enhance its capability to extract complex features. Adaptive spatial feature fusion (ASFF) was then integrated into the feature fusion network to preserve the detailed features of small targets such as one-year-old precocious Chinese mitten crab juveniles. Finally, based on the detection head proposed by task-aligned one-stage object detection, the parameters of its anchor alignment metric were adjusted to detect, locate, and classify the crab juveniles. The experimental results showed that this method achieves a mean average precision (mAP) of 88.78% and an F1-score of 97.89%. This exceeded the best-performing mainstream object detection algorithm, YOLOv7, by 4.17% in mAP and 1.77% in the F1-score. Ultimately, in practical application scenarios, the algorithm effectively identified one-year-old precocious Chinese mitten crabs, providing technical support for the automated selection of high-quality crab juveniles in the cultivation process, thereby promoting the rapid development of aquaculture and agricultural intelligence in China.

19.
Neural Netw ; 161: 39-54, 2023 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-36735999

RESUMO

Spatial boundary effect can significantly reduce the performance of a learned discriminative correlation filter (DCF) model. A commonly used method to relieve this effect is to extract appearance features from a wider region of a target. However, this way would introduce unexpected features from background pixels and noises, which will lead to a decrease of the filter's discrimination power. To address this shortcoming, this paper proposes an innovative method called enhanced robust spatial feature selection and correlation filter Learning (EFSCF), which performs jointly sparse feature learning to handle boundary effects effectively while suppressing the influence of background pixels and noises. Unlike the ℓ2-norm-based tracking approaches that are prone to non-Gaussian noises, the proposed method imposes the ℓ2,1-norm on the loss term to enhance the robustness against the training outliers. To enhance the discrimination further, a jointly sparse feature selection scheme based on the ℓ2,1 -norm is designed to regularize the filter in rows and columns simultaneously. To the best of the authors' knowledge, this has been the first work exploring the structural sparsity in rows and columns of a learned filter simultaneously. The proposed model can be efficiently solved by an alternating direction multiplier method. The proposed EFSCF is verified by experiments on four challenging unmanned aerial vehicle datasets under severe noise and appearance changes, and the results show that the proposed method can achieve better tracking performance than the state-of-the-art trackers.


Assuntos
Conhecimento , Aprendizagem
20.
PeerJ Comput Sci ; 9: e1302, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-37346580

RESUMO

The residual structure has an important influence on the design of the neural network model. The neural network model based on residual structure has excellent performance in computer vision tasks. However, the performance of classical residual networks is restricted by the size of receptive fields, channel information, spatial information and other factors. In this article, a novel residual structure is proposed. We modify the identity mapping and down-sampling block to get greater effective receptive field, and its excellent performance in channel information fusion and spatial feature extraction is verified by ablation studies. In order to further verify its feature extraction capability, a non-deep convolutional neural network (CNN) was designed and tested on Cifar10 and Cifar100 benchmark platforms using a naive training method. Our network model achieves better performance than other mainstream networks under the same training parameters, the accuracy we achieved is 3.08 percentage point higher than ResNet50 and 1.38 percentage points higher than ResNeXt50. Compared with SeResNet152, it is 0.29 percentage point higher in the case of 50 epochs less training.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA