Omnidirectional image super-resolution via position attention network.

Wang, Xin; Wang, Shiqi; Li, Jinxing; Li, Mu; Li, Jinkai; Xu, Yong

Wang, Xin; Wang, Shiqi; Li, Jinxing; Li, Mu; Li, Jinkai; Xu, Yong.

Afiliação

Wang X; School of Computer Science and Technology, Harbin Institute of Technology, Shenzhen, Shenzhen, 518055, China; Shenzhen Key Laboratory of Visual Object Detection and Recognition, Harbin Institute of Technology, Shenzhen, Shenzhen, 518055, China; Department of Computer Science, City University of Hong
Wang S; Department of Computer Science, City University of Hong Kong, Kowloon Tong, Hong Kong.
Li J; School of Computer Science and Technology, Harbin Institute of Technology, Shenzhen, Shenzhen, 518055, China.
Li M; School of Computer Science and Technology, Harbin Institute of Technology, Shenzhen, Shenzhen, 518055, China. Electronic address: limuhit@gmail.com.
Li J; School of Computer Science and Technology, Harbin Institute of Technology, Shenzhen, Shenzhen, 518055, China; Shenzhen Key Laboratory of Visual Object Detection and Recognition, Harbin Institute of Technology, Shenzhen, Shenzhen, 518055, China.
Xu Y; School of Computer Science and Technology, Harbin Institute of Technology, Shenzhen, Shenzhen, 518055, China; Shenzhen Key Laboratory of Visual Object Detection and Recognition, Harbin Institute of Technology, Shenzhen, Shenzhen, 518055, China. Electronic address: laterfall@hit.edu.cn.

Neural Netw ; 178: 106464, 2024 Oct.

Article em En | MEDLINE | ID: mdl-38968779

ABSTRACT

ABSTRACT

For convenient transmission, omnidirectional images (ODIs) usually follow the equirectangular projection (ERP) format and are low-resolution. To provide better immersive experience, omnidirectional image super resolution (ODISR) is essential. However, ERP ODIs suffer from serious geometric distortion and pixel stretching across latitudes, generating massive redundant information at high latitudes. This characteristic poses a huge challenge for the traditional SR methods, which can only obtain the suboptimal ODISR performance. To address this issue, we propose a novel position attention network (PAN) for ODISR in this paper. Specifically, a two-branch structure is introduced, in which the basic enhancement branch (BE) serves to achieve coarse deep feature enhancement for extracted shallow features. Meanwhile, the position attention enhancement branch (PAE) builds a positional attention mechanism to dynamically adjust the contribution of features at different latitudes in the ERP representation according to their positions and stretching degrees, which achieves the enhancement for the differentiated information, suppresses the redundant information, and modulate the deep features with spatial distortion. Subsequently, the features of two branches are fused effectively to achieve the further refinement and adapt the distortion characteristic of ODIs. After that, we exploit a long-term memory module (LM), promoting information interactions and fusions between the branches to enhance the perception of the distortion, aggregating the prior hierarchical features to keep the long-term memory and boosting the ODISR performance. Extensive results demonstrate the state-of-the-art performance and the high efficiency of our PAN in ODISR.

Assuntos

Redes Neurais de Computação; Humanos; Processamento de Imagem Assistida por Computador/métodos; Atenção/fisiologia; Algoritmos

Palavras-chave

Distortion characteristic; Omnidirectional image; Position attention; Super-resolution

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Redes Neurais de Computação Limite: Humans Idioma: En Revista: Neural Netw Ano de publicação: 2024 Tipo de documento: Article

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google