Acorn-Reference-Cited by-同舟云学术

Acorn

Published:2021-08-31 Issue:4 Volume:40 Page:1-13
ISSN:0730-0301
Container-title:ACM Transactions on Graphics
language:en
Short-container-title:ACM Trans. Graph.

Author:

Martel Julien N. P.¹,Lindell David B.¹,Lin Connor Z.¹,Chan Eric R.¹,Monteiro Marco¹,Wetzstein Gordon¹

Affiliation:

1. Stanford University

Abstract

Neural representations have emerged as a new paradigm for applications in rendering, imaging, geometric modeling, and simulation. Compared to traditional representations such as meshes, point clouds, or volumes they can be flexibly incorporated into differentiable learning-based pipelines. While recent improvements to neural representations now make it possible to represent signals with fine details at moderate resolutions (e.g., for images and 3D shapes), adequately representing large-scale or complex scenes has proven a challenge. Current neural representations fail to accurately represent images at resolutions greater than a megapixel or 3D scenes with more than a few hundred thousand polygons. Here, we introduce a new hybrid implicit-explicit network architecture and training strategy that adaptively allocates resources during training and inference based on the local complexity of a signal of interest. Our approach uses a multiscale block-coordinate decomposition, similar to a quadtree or octree, that is optimized during training. The network architecture operates in two stages: using the bulk of the network parameters, a coordinate encoder generates a feature grid in a single forward pass. Then, hundreds or thousands of samples within each block can be efficiently evaluated using a lightweight feature decoder. With this hybrid implicit-explicit network architecture, we demonstrate the first experiments that fit gigapixel images to nearly 40 dB peak signal-to-noise ratio. Notably this represents an increase in scale of over 1000X compared to the resolution of previously demonstrated image-fitting experiments. Moreover, our approach is able to represent 3D shapes significantly faster and better than previous techniques; it reduces training times from days to hours or minutes and memory requirements by over an order of magnitude.

Funder

Okawa Research grant

NSF

Sloan Fellowship

Swiss National Foundation

PECASE

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Graphics and Computer-Aided Design

Link

https://dl.acm.org/doi/pdf/10.1145/3450626.3459785

Reference54 articles.

1. MatryODShka: Real-time 6DoF Video View Synthesis Using Multi-sphere Images

2. SAL: Sign Agnostic Learning of Shapes From Raw Data

3. Adaptive mesh refinement for hyperbolic partial differential equations

4. Immersive light field video with a layered mesh representation

5. Deep Local Shapes: Learning Local SDF Priors for Detailed 3D Reconstruction

Cited by 99 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. MuSic-UDF: Learning Multi-Scale dynamic grid representation for high-fidelity surface reconstruction from point clouds;Computers & Graphics;2024-09

2. Disorder-Invariant Implicit Neural Representation;IEEE Transactions on Pattern Analysis and Machine Intelligence;2024-08

3. N-BVH: Neural ray queries with bounding volume hierarchies;Special Interest Group on Computer Graphics and Interactive Techniques Conference Conference Papers '24;2024-07-13

4. Neural Geometry Fields For Meshes;Special Interest Group on Computer Graphics and Interactive Techniques Conference Conference Papers '24;2024-07-13

5. Sharing massive biomedical data at magnitudes lower bandwidth using implicit neural function;Proceedings of the National Academy of Sciences;2024-07-03