Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 2 de 2
Filtrar
Más filtros










Base de datos
Intervalo de año de publicación
1.
Opt Express ; 31(2): 1737-1754, 2023 Jan 16.
Artículo en Inglés | MEDLINE | ID: mdl-36785202

RESUMEN

Incomplete optical distortion correction in VR HMDs leads to spatial dynamic distortion, which is a potential cause of VIMS. A perception experiment is designed for the investigation with three spatial distortion levels, with the subjective SSQ, five-scale VIMS level rating, and objective postural instability adopted as the evaluation metrics. The results show that the factor of spatial distortion level has a significant effect on all metrics increments (p<0.05). As the spatial distortion level drops off, the increments of VIMS symptoms decrease. The study highlights the importance of perfect spatial distortion correction in VR HMDs for eliminating the potential VIMS aggravation effect.

2.
IEEE Trans Pattern Anal Mach Intell ; 44(3): 1670-1684, 2022 03.
Artículo en Inglés | MEDLINE | ID: mdl-32956036

RESUMEN

Visual grounding (VG) aims to locate the most relevant object or region in an image, based on a natural language query. Generally, it requires the machine to first understand the query, identify the key concepts in the image, and then locate the target object by specifying its bounding box. However, in many real-world visual grounding applications, we have to face with ambiguous queries and images with complicated scene structures. Identifying the target based on highly redundant and correlated information can be very challenging, and often leading to unsatisfactory performance. To tackle this, in this paper, we exploit an attention module for each kind of information to reduce internal redundancies. We then propose an accumulated attention (A-ATT) mechanism to reason among all the attention modules jointly. In this way, the relation among different kinds of information can be explicitly captured. Moreover, to improve the performance and robustness of our VG models, we additionally introduce some noises into the training procedure to bridge the distribution gap between the human-labeled training data and the real-world poor quality data. With this "noised" training strategy, we can further learn a bounding box regressor, which can be used to refine the bounding box of the target object. We evaluate the proposed methods on four popular datasets (namely ReferCOCO, ReferCOCO+, ReferCOCOg, and GuessWhat?!). The experimental results show that our methods significantly outperform all previous works on every dataset in terms of accuracy.


Asunto(s)
Algoritmos , Atención , Humanos
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA