Pesquisa | BVS CLAP/SMR-OPAS/OMS

DynGAN: Solving Mode Collapse in GANs With Dynamic Clustering.

Luo, Yixin; Yang, Zhouwang.

IEEE Trans Pattern Anal Mach Intell ; 46(8): 5493-5503, 2024 Aug.

Artigo em Inglês | MEDLINE | ID: mdl-38376961

RESUMO

Generative Adversarial Networks (GANs) are widely-used generative models for synthesizing complex and realistic data. However, mode collapse, where the diversity of generated samples is significantly lower than that of real samples, poses a major challenge for further applications. Our theoretical analysis demonstrates that the generator loss function is non-convex with respect to its parameters when there are multiple modes in real data. In particular, parameters that result in generated distributions with perfect partial mode coverage of the real distribution are the local minima of the generator loss function. To address mode collapse, we propose a unified framework called Dynamic GAN. This method detects collapsed samples in the generator by thresholding on observable discriminator outputs, divides the training set based on these collapsed samples, and trains a dynamic conditional model on the partitions. The theoretical outcome ensures progressive mode coverage and experiments on synthetic and real-world data sets demonstrate that our method surpasses several GAN variants. In conclusion, we examine the root cause of mode collapse and offer a novel approach to quantitatively detect and resolve it in GANs.

Multi-resolution auto-encoder for anomaly detection of retinal imaging.

Luo, Yixin; Ma, Yangling; Yang, Zhouwang.

Phys Eng Sci Med ; 47(2): 517-529, 2024 Jun.

Artigo em Inglês | MEDLINE | ID: mdl-38285270

RESUMO

Identifying unknown types of diseases is a crucial step in preceding retinal imaging classification for the sake of safety, which is known as anomaly detection of retinal imaging. However, the widely-used supervised learning algorithms are not suitable for this problem, since the data of the unknown category is unobtainable. Moreover, for retinal imaging with different types of anomalous regions, using a single-resolution input causes information loss. Therefore, we propose an unsupervised auto-encoder model with multi-resolution inputs and outputs. We provide a theoretical understanding of the effectiveness of reconstruction error and the improvement of self-supervised learning for anomaly detection. Our experiments on two widely-used retinal imaging datasets show that the proposed methods are superior to other methods, and further experiments verify the validity of each part of the proposed method.

Assuntos

Algoritmos , Processamento de Imagem Assistida por Computador , Retina , Humanos , Retina/diagnóstico por imagem

Ver mais detalhes

ENVIAR RESULTADO:

Exportar

Imprimir

RSS

XML

RESUMO

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA