Generating High-Resolution 3D Faces and Bodies Using VQ-VAE-2 with PixelSNAIL Networks on 2D Representations.

Gallucci, Alessio; Znamenskiy, Dmitry; Long, Yuxuan; Pezzotti, Nicola; Petkovic, Milan

Gallucci, Alessio; Znamenskiy, Dmitry; Long, Yuxuan; Pezzotti, Nicola; Petkovic, Milan.

Afiliação

Gallucci A; Philips Research, 5656 AE Eindhoven, The Netherlands.
Znamenskiy D; Department of Mathematics and Computer Science, Eindhoven University of Technology, 5612 AE Eindhoven, The Netherlands.
Long Y; Philips Research, 5656 AE Eindhoven, The Netherlands.
Pezzotti N; Philips Research, 5656 AE Eindhoven, The Netherlands.
Petkovic M; Philips Research, 5656 AE Eindhoven, The Netherlands.

Sensors (Basel) ; 23(3)2023 Jan 19.

Article em En | MEDLINE | ID: mdl-36772208

ABSTRACT

ABSTRACT

Modeling and representing 3D shapes of the human body and face is a prominent field due to its applications in the healthcare, clothes, and movie industry. In our work, we tackled the problem of 3D face and body synthesis by reducing 3D meshes to 2D image representations. We show that the face can naturally be modeled on a 2D grid. At the same time, for more challenging 3D body geometries, we proposed a novel non-bijective 3D-2D conversion method representing the 3D body mesh as a plurality of rendered projections on the 2D grid. Then, we trained a state-of-the-art vector-quantized variational autoencoder (VQ-VAE-2) to learn a latent representation of 2D images and fit a PixelSNAIL autoregressive model to sample novel synthetic meshes. We evaluated our method versus a classical one based on principal component analysis (PCA) by sampling from the empirical cumulative distribution of the PCA scores. We used the empirical distributions of two commonly used metrics, specificity and diversity, to quantitatively demonstrate that the synthetic faces generated with our method are statistically closer to real faces when compared with the PCA ones. Our experiment on the 3D body geometry requires further research to match the test set statistics but shows promising results.

Palavras-chave

2D regular representation; 3D body synthesis; 3D face synthesis; artificial neural networks; autoencoders; autoregressive models; generative modeling

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Tipo de estudo: Prognostic_studies Idioma: En Ano de publicação: 2023 Tipo de documento: Article

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Tipo de estudo: Prognostic_studies Idioma: En Ano de publicação: 2023 Tipo de documento: Article