Voxel-Based 3D Object Reconstruction from Single 2D Image Using Variational Autoencoders

Tahir, Rohan; Sargano, Allah Bux; Habib, Zulfiqar

Voxel-Based 3D Object Reconstruction from Single 2D Image Using Variational Autoencoders

Rohan Tahir, Allah Bux Sargano and Zulfiqar Habib
Additional contact information
Rohan Tahir: Department of Computer Science, COMSATS University Islamabad, Lahore 54000, Pakistan
Allah Bux Sargano: Department of Computer Science, COMSATS University Islamabad, Lahore 54000, Pakistan
Zulfiqar Habib: Department of Computer Science, COMSATS University Islamabad, Lahore 54000, Pakistan

Mathematics, 2021, vol. 9, issue 18, 1-11

Abstract: In recent years, learning-based approaches for 3D reconstruction have gained much popularity due to their encouraging results. However, unlike 2D images, 3D cannot be represented in its canonical form to make it computationally lean and memory-efficient. Moreover, the generation of a 3D model directly from a single 2D image is even more challenging due to the limited details available from the image for 3D reconstruction. Existing learning-based techniques still lack the desired resolution, efficiency, and smoothness of the 3D models required for many practical applications. In this paper, we propose voxel-based 3D object reconstruction (V3DOR) from a single 2D image for better accuracy, one using autoencoders (AE) and another using variational autoencoders (VAE). The encoder part of both models is used to learn suitable compressed latent representation from a single 2D image, and a decoder generates a corresponding 3D model. Our contribution is twofold. First, to the best of the authors’ knowledge, it is the first time that variational autoencoders (VAE) have been employed for the 3D reconstruction problem. Second, the proposed models extract a discriminative set of features and generate a smoother and high-resolution 3D model. To evaluate the efficacy of the proposed method, experiments have been conducted on a benchmark ShapeNet data set. The results confirm that the proposed method outperforms state-of-the-art methods.

Keywords: voxels; geometric modeling; 3D surface reconstruction; variational autoencoders; deep learning (search for similar items in EconPapers)
JEL-codes: C (search for similar items in EconPapers)
Date: 2021
References: View complete reference list from CitEc
Citations: View citations in EconPapers (1)

Downloads: (external link)
https://www.mdpi.com/2227-7390/9/18/2288/pdf (application/pdf)
https://www.mdpi.com/2227-7390/9/18/2288/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jmathe:v:9:y:2021:i:18:p:2288-:d:637358

Access Statistics for this article

Mathematics is currently edited by Ms. Emma He

More articles in Mathematics from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().