Búsqueda | Portal Regional de la BVS

A Joint De-Rain and De-Mist Network Based on the Atmospheric Scattering Model.

Gu, Linyun; Xu, Huahu; Ma, Xiaojin.

J Imaging ; 9(7)2023 Jun 26.

Artículo en Inglés | MEDLINE | ID: mdl-37504806

RESUMEN

Rain can have a detrimental effect on optical components, leading to the appearance of streaks and halos in images captured during rainy conditions. These visual distortions caused by rain and mist contribute significant noise information that can compromise image quality. In this paper, we propose a novel approach for simultaneously removing both streaks and halos from the image to produce clear results. First, based on the principle of atmospheric scattering, a rain and mist model is proposed to initially remove the streaks and halos from the image by reconstructing the image. The Deep Memory Block (DMB) selectively extracts the rain layer transfer spectrum and the mist layer transfer spectrum from the rainy image to separate these layers. Then, the Multi-scale Convolution Block (MCB) receives the reconstructed images and extracts both structural and detailed features to enhance the overall accuracy and robustness of the model. Ultimately, extensive results demonstrate that our proposed model JDDN (Joint De-rain and De-mist Network) outperforms current state-of-the-art deep learning methods on synthetic datasets as well as real-world datasets, with an average improvement of 0.29 dB on the heavy-rainy-image dataset.

Hybrid Attention Network for Language-Based Person Search.

Li, Yang; Xu, Huahu; Xiao, Junsheng.

Sensors (Basel) ; 20(18)2020 Sep 15.

Artículo en Inglés | MEDLINE | ID: mdl-32942720

RESUMEN

Language-based person search retrieves images of a target person using natural language description and is a challenging fine-grained cross-modal retrieval task. A novel hybrid attention network is proposed for the task. The network includes the following three aspects: First, a cubic attention mechanism for person image, which combines cross-layer spatial attention and channel attention. It can fully excavate both important midlevel details and key high-level semantics to obtain better discriminative fine-grained feature representation of a person image. Second, a text attention network for language description, which is based on bidirectional LSTM (BiLSTM) and self-attention mechanism. It can better learn the bidirectional semantic dependency and capture the key words of sentences, so as to extract the context information and key semantic features of the language description more effectively and accurately. Third, a cross-modal attention mechanism and a joint loss function for cross-modal learning, which can pay more attention to the relevant parts between text and image features. It can better exploit both the cross-modal and intra-modal correlation and can better solve the problem of cross-modal heterogeneity. Extensive experiments have been conducted on the CUHK-PEDES dataset. Our approach obtains higher performance than state-of-the-art approaches, demonstrating the advantage of the approach we propose.

Asunto(s)

Lenguaje , Aprendizaje Automático , Redes Neurales de la Computación , Humanos , Procesamiento de Lenguaje Natural , Semántica

Attention Based CNN-ConvLSTM for Pedestrian Attribute Recognition.

Li, Yang; Xu, Huahu; Bian, Minjie; Xiao, Junsheng.

Sensors (Basel) ; 20(3)2020 Feb 03.

Artículo en Inglés | MEDLINE | ID: mdl-32028568

RESUMEN

As a result of its important role in video surveillance, pedestrian attribute recognition has become an attractive facet of computer vision research. Because of the changes in viewpoints, illumination, resolution and occlusion, the task is very challenging. In order to resolve the issue of unsatisfactory performance of existing pedestrian attribute recognition methods resulting from ignoring the correlation between pedestrian attributes and spatial information, in this paper, the task is regarded as a spatiotemporal, sequential, multi-label image classification problem. An attention-based neural network consisting of convolutional neural networks (CNN), channel attention (CAtt) and convolutional long short-term memory (ConvLSTM) is proposed (CNN-CAtt-ConvLSTM). Firstly, the salient and correlated visual features of pedestrian attributes are extracted by pre-trained CNN and CAtt. Then, ConvLSTM is used to further extract spatial information and correlations from pedestrian attributes. Finally, pedestrian attributes are predicted with optimized sequences based on attribute image area size and importance. Extensive experiments are carried out on two common pedestrian attribute datasets, PEdesTrian Attribute (PETA) dataset and Richly Annotated Pedestrian (RAP) dataset, and higher performance than other state-of-the-art (SOTA) methods is achieved, which proves the superiority and validity of our method.

Asunto(s)

Atención/fisiología , Reconocimiento de Normas Patrones Automatizadas/métodos , Peatones , Reconocimiento en Psicología/fisiología , Algoritmos , Identificación Biométrica , Humanos , Procesamiento de Imagen Asistido por Computador , Memoria a Largo Plazo/fisiología , Redes Neurales de la Computación , Grabación en Video

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

ENVIAR RESULTADO:

SELECCIÓN DE REFERENCIAS

DETALLE DE LA BÚSQUEDA