MoNeRF: Deformable Neural Rendering for Talking Heads via Latent Motion Navigation
dc.contributor.author | Li, X. | en_US |
dc.contributor.author | Ding, Y. | en_US |
dc.contributor.author | Li, R. | en_US |
dc.contributor.author | Tang, Z. | en_US |
dc.contributor.author | Li, K. | en_US |
dc.date.accessioned | 2025-03-07T16:48:51Z | |
dc.date.available | 2025-03-07T16:48:51Z | |
dc.date.issued | 2024 | |
dc.description.abstract | Novel view synthesis for talking heads presents significant challenges due to the complex and diverse motion transformations involved. Conventional methods often resort to reliance on structure priors, like facial templates, to warp observed images into a canonical space conducive to rendering. However, the incorporation of such priors introduces a trade‐off‐while aiding in synthesis, they concurrently amplify model complexity, limiting generalizability to other deformable scenes. Departing from this paradigm, we introduce a pioneering solution: the motion‐conditioned neural radiance field, MoNeRF, designed to model talking heads through latent motion navigation. At the core of MoNeRF lies a novel approach utilizing a compact set of latent codes to represent orthogonal motion directions. This innovative strategy empowers MoNeRF to efficiently capture and depict intricate scene motion by linearly combining these latent codes. In an extended capability, MoNeRF facilitates motion control through latent code adjustments, supports view transfer based on reference videos, and seamlessly extends its applicability to model human bodies without necessitating structural modifications. Rigorous quantitative and qualitative experiments unequivocally demonstrate MoNeRF's superior performance compared to state‐of‐the‐art methods in talking head synthesis. We will release the source code upon publication. | en_US |
dc.description.number | 1 | |
dc.description.sectionheaders | Original Article | |
dc.description.seriesinformation | Computer Graphics Forum | |
dc.description.volume | 44 | |
dc.identifier.doi | 10.1111/cgf.15274 | |
dc.identifier.issn | 1467-8659 | |
dc.identifier.pages | 13 | |
dc.identifier.uri | https://doi.org/10.1111/cgf.15274 | |
dc.identifier.uri | https://diglib.eg.org/handle/10.1111/cgf15274 | |
dc.publisher | Eurographics ‐ The European Association for Computer Graphics and John Wiley & Sons Ltd. | en_US |
dc.subject | image and video processing | |
dc.subject | rendering | |
dc.subject | image‐based rendering | |
dc.subject | • Computing methodologies → Computer graphics; Image manipulation; Image‐based rendering | |
dc.title | MoNeRF: Deformable Neural Rendering for Talking Heads via Latent Motion Navigation | en_US |
Files
Original bundle
1 - 1 of 1
No Thumbnail Available
- Name:
- 14_cgf15274.pdf
- Size:
- 12.19 MB
- Format:
- Adobe Portable Document Format