Frequency compensated diffusion model for real-scene dehazing.

Wang, Jing; Wu, Songtao; Yuan, Zhiqiang; Tong, Qiang; Xu, Kuanhong

Wang, Jing; Wu, Songtao; Yuan, Zhiqiang; Tong, Qiang; Xu, Kuanhong.

Afiliação

Wang J; Sony Research and Development Center Beijing Lab, Chao-Yang District, Beijing, 100027, China.
Wu S; Sony Research and Development Center Beijing Lab, Chao-Yang District, Beijing, 100027, China. Electronic address: Songtao.Wu@sony.com.
Yuan Z; Aerospace Information Research Institute, Chinese Academy of Science, Hai-dian District, Beijing, 100094, China.
Tong Q; Sony Research and Development Center Beijing Lab, Chao-Yang District, Beijing, 100027, China.
Xu K; Sony Research and Development Center Beijing Lab, Chao-Yang District, Beijing, 100027, China.

Neural Netw ; 175: 106281, 2024 Jul.

Article em En | MEDLINE | ID: mdl-38579573

ABSTRACT

ABSTRACT

Due to distribution shift, deep learning based methods for image dehazing suffer from performance degradation when applied to real-world hazy images. In this paper, this study considers a dehazing framework based on conditional diffusion models for improved generalization to real haze. First, our work finds that optimizing the training objective of diffusion models, i.e., Gaussian noise vectors, is non-trivial. The spectral bias of deep networks hinders the higher frequency modes in Gaussian vectors from being learned and hence impairs the reconstruction of image details. To tackle this issue, this study designs a network unit, named Frequency Compensation block (FCB), with a bank of filters that jointly emphasize the mid-to-high frequencies of an input signal. Our work demonstrates that diffusion models with FCB achieve significant gains in both perceptual and distortion metrics. Second, to further boost the generalization performance, this study proposed a novel data synthesis pipeline, HazeAug, to augment haze in terms of degree and diversity. Within the framework, a solid baseline for blind dehazing is set up where models are trained on synthetic hazy-clean pairs, and directly generalize to real data. Extensive evaluations on real dehazing datasets demonstrate the superior performance of the proposed dehazing diffusion model in distortion metrics. Compared to recent methods pre-trained on large-scale, high-quality image datasets, our model achieves a significant PSNR improvement of over 1 dB on challenging databases such as Dense-Haze and Nh-Haze.

Assuntos

Aprendizado Profundo; Redes Neurais de Computação; Processamento de Imagem Assistida por Computador/métodos; Humanos; Algoritmos; Distribuição Normal

Palavras-chave

Data synthesis; Dehazing; Diffusion models; Frequency compensation

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Redes Neurais de Computação / Aprendizado Profundo Limite: Humans Idioma: En Revista: Neural Netw Assunto da revista: NEUROLOGIA Ano de publicação: 2024 Tipo de documento: Article País de afiliação: China

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google