Factored Neural Representation for Scene Understanding

Wong, Yu-Shiang; Mitra, Niloy J.

Factored Neural Representation for Scene Understanding

dc.contributor.author	Wong, Yu-Shiang	en_US
dc.contributor.author	Mitra, Niloy J.	en_US
dc.contributor.editor	Memari, Pooran	en_US
dc.contributor.editor	Solomon, Justin	en_US
dc.date.accessioned	2023-06-30T06:19:13Z
dc.date.available	2023-06-30T06:19:13Z
dc.date.issued	2023
dc.description.abstract	A long-standing goal in scene understanding is to obtain interpretable and editable representations that can be directly constructed from a raw monocular RGB-D video, without requiring specialized hardware setup or priors. The problem is significantly more challenging in the presence of multiple moving and/or deforming objects. Traditional methods have approached the setup with a mix of simplifications, scene priors, pretrained templates, or known deformation models. The advent of neural representations, especially neural implicit representations and radiance fields, opens the possibility of end-to-end optimization to collectively capture geometry, appearance, and object motion. However, current approaches produce global scene encoding, assume multiview capture with limited or no motion in the scenes, and do not facilitate easy manipulation beyond novel view synthesis. In this work, we introduce a factored neural scene representation that can directly be learned from a monocular RGB-D video to produce object-level neural presentations with an explicit encoding of object movement (e.g., rigid trajectory) and/or deformations (e.g., nonrigid movement). We evaluate ours against a set of neural approaches on both synthetic and real data to demonstrate that the representation is efficient, interpretable, and editable (e.g., change object trajectory). Code and data are available at: http://geometry.cs.ucl.ac.uk/projects/2023/factorednerf/.	en_US
dc.description.number	5
dc.description.sectionheaders	Point Clouds and Scenes
dc.description.seriesinformation	Computer Graphics Forum
dc.description.volume	42
dc.identifier.doi	10.1111/cgf.14911
dc.identifier.issn	1467-8659
dc.identifier.pages	14 pages
dc.identifier.uri	https://doi.org/10.1111/cgf.14911
dc.identifier.uri	https://diglib.eg.org:443/handle/10.1111/cgf14911
dc.publisher	The Eurographics Association and John Wiley & Sons Ltd.	en_US
dc.subject	CCS Concepts: Computing methodologies -> Reconstruction; Volumetric models; Tracking
dc.subject	Computing methodologies
dc.subject	Reconstruction
dc.subject	Volumetric models
dc.subject	Tracking
dc.title	Factored Neural Representation for Scene Understanding	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: v42i5_15_14911.pdf
Size:: 42.23 MB
Format:: Adobe Portable Document Format

Download

Collections

42-Issue 5
SGP23: Eurographics Symposium on Geometry Processing (CGF 42-5)