Pesquisa | BVS Bolivia

Self-Supervised Multimodal Learning: A Survey.

Zong, Yongshuo; Aodha, Oisin Mac; Hospedales, Timothy.

IEEE Trans Pattern Anal Mach Intell ; PP2024 Aug 07.

Artigo em Inglês | MEDLINE | ID: mdl-39110564

RESUMO

Multimodal learning, which aims to understand and analyze information from multiple modalities, has achieved substantial progress in the supervised regime in recent years. However, the heavy dependence on data paired with expensive human annotations impedes scaling up models. Meanwhile, given the availability of large-scale unannotated data in the wild, self-supervised learning has become an attractive strategy to alleviate the annotation bottleneck. Building on these two directions, self-supervised multimodal learning (SSML) provides ways to learn from raw multimodal data. In this survey, we provide a comprehensive review of the state-of-the-art in SSML, in which we elucidate three major challenges intrinsic to self-supervised learning with multimodal data: (1) learning representations from multimodal data without labels, (2) fusion of different modalities, and (3) learning with unaligned data. We then detail existing solutions to these challenges. Specifically, we consider (1) objectives for learning from multimodal unlabeled data via self-supervision, (2) model architectures from the perspective of different multimodal fusion strategies, and (3) pair-free learning strategies for coarse-grained and fine-grained alignment. We also review real-world applications of SSML algorithms in diverse fields such as healthcare, remote sensing, and machine translation. Finally, we discuss challenges and future directions for SSML. A collection of related resources can be found at: https://github.com/ys-zong/awesome-self-supervised-multimodal-learning.

Magnetic Resonance Image Denoising Algorithm Based on Cartoon, Texture, and Residual Parts.

Zeng, Yanqiu; Zhang, Baocan; Zhao, Wei; Xiao, Shixiao; Zhang, Guokai; Ren, Haiping; Zhao, Wenbing; Peng, Yonghong; Xiao, Yutian; Lu, Yiwen; Zong, Yongshuo; Ding, Yimin.

Comput Math Methods Med ; 2020: 1405647, 2020.

Artigo em Inglês | MEDLINE | ID: mdl-32411276

RESUMO

Magnetic resonance (MR) images are often contaminated by Gaussian noise, an electronic noise caused by the random thermal motion of electronic components, which reduces the quality and reliability of the images. This paper puts forward a hybrid denoising algorithm for MR images based on two sparsely represented morphological components and one residual part. To begin with, decompose a noisy MR image into the cartoon, texture, and residual parts by MCA, and then each part is denoised by using Wiener filter, wavelet hard threshold, and wavelet soft threshold, respectively. Finally, stack up all the denoised subimages to obtain the denoised MR image. The experimental results show that the proposed method has significantly better performance in terms of mean square error and peak signal-to-noise ratio than each method alone.

Assuntos

Algoritmos , Imageamento por Ressonância Magnética/estatística & dados numéricos , Encéfalo/diagnóstico por imagem , Biologia Computacional , Simulação por Computador , Bases de Dados Factuais , Humanos , Interpretação de Imagem Assistida por Computador/estatística & dados numéricos , Neuroimagem/estatística & dados numéricos , Distribuição Normal , Análise de Componente Principal , Processamento de Sinais Assistido por Computador , Razão Sinal-Ruído , Análise de Ondaletas

RESUMO

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA