3D car shape reconstruction from a contour sketch using GAN and lazy learning-Reference-Cited by-同舟云学术

3D car shape reconstruction from a contour sketch using GAN and lazy learning

Published:2021-04-16 Issue: Volume: Page:
ISSN:0178-2789
Container-title:The Visual Computer
language:en
Short-container-title:Vis Comput

Author:

Nozawa Naoki,Shum Hubert P. H.^ORCID,Feng Qi,Ho Edmond S. L.^ORCID,Morishima Shigeo

Abstract

Abstract3D car models are heavily used in computer games, visual effects, and even automotive designs. As a result, producing such models with minimal labour costs is increasingly more important. To tackle the challenge, we propose a novel system to reconstruct a 3D car using a single sketch image. The system learns from a synthetic database of 3D car models and their corresponding 2D contour sketches and segmentation masks, allowing effective training with minimal data collection cost. The core of the system is a machine learning pipeline that combines the use of a generative adversarial network (GAN) and lazy learning. GAN, being a deep learning method, is capable of modelling complicated data distributions, enabling the effective modelling of a large variety of cars. Its major weakness is that as a global method, modelling the fine details in the local region is challenging. Lazy learning works well to preserve local features by generating a local subspace with relevant data samples. We demonstrate that the combined use of GAN and lazy learning produces is able to produce high-quality results, in which different types of cars with complicated local features can be generated effectively with a single sketch. Our method outperforms existing ones using other machine learning structures such as the variational autoencoder.

Funder

Royal Society

JST ACCEL

JST-Mirai Program

JSPS KAKENHI

Publisher

Springer Science and Business Media LLC

Subject

Computer Graphics and Computer-Aided Design,Computer Vision and Pattern Recognition,Software

Link

https://link.springer.com/content/pdf/10.1007/s00371-020-02024-y.pdf

Reference51 articles.

1. Blanz, V., Vetter, T.: A morphable model for the synthesis of 3d faces. In: Proceedings of the 26th annual conference on computer graphics and interactive techniques. SIGGRAPH ’99, pp 187–194. ACM Press/Addison-Wesley Publishing Co., New York (1999)

2. Canny, J.: A computational approach to edge detection. IEEE Trans. Pattern Anal. Mach. Intell. 6, 679–698 (1986)

3. Chai, J., Hodgins, J.K.: Performance animation from low-dimensional control signals. ACM Trans. Graph. 24(3), 686–696 (2005)

4. Chang, A.X., Funkhouser, T., Guibas, L., Hanrahan, P., Huang, Q., Li, Z., Savarese, S., Savva, M., Song, S., Su, H., Xiao, J., Yi, L., Yu, F.: ShapeNet: an information-rich 3D model repository. Technical report, Stanford University—Princeton University—Toyota Technological Institute at Chicago (2015) arXiv:1512.03012 [cs.GR]

5. Charles, R.Q., Su, H., Kaichun, M., Guibas, L.J.: Pointnet: deep learning on point sets for 3D classification and segmentation. In: Proceedings of the 2017 IEEE conference on computer vision and pattern recognition (CVPR), pp 77–85 (2017)

Cited by 17 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. GAN-based generation of realistic 3D volumetric data: A systematic review and taxonomy;Medical Image Analysis;2024-04

2. Large GAN Is All You Need;Lecture Notes in Computer Science;2024

3. Investigation on the Encoder-Decoder Application for Mesh Generation;Advances in Computer Graphics;2023-12-29

4. A fast-training GAN for coal–gangue image augmentation based on a few samples;The Visual Computer;2023-12-22

5. FFANet: dual attention-based flow field-aware network for wall identification;The Visual Computer;2023-12-13