Local Positional Encoding for Multi-Layer Perceptrons

dc.contributor.authorFujieda, Shinen_US
dc.contributor.authorYoshimura, Atsushien_US
dc.contributor.authorHarada, Takahiroen_US
dc.contributor.editorChaine, Raphaëlleen_US
dc.contributor.editorDeng, Zhigangen_US
dc.contributor.editorKim, Min H.en_US
dc.date.accessioned2023-10-09T07:42:50Z
dc.date.available2023-10-09T07:42:50Z
dc.date.issued2023
dc.description.abstractA multi-layer perceptron (MLP) is a type of neural networks which has a long history of research and has been studied actively recently in computer vision and graphics fields. One of the well-known problems of an MLP is the capability of expressing highfrequency signals from low-dimensional inputs. There are several studies for input encodings to improve the reconstruction quality of an MLP by applying pre-processing against the input data. This paper proposes a novel input encoding method, local positional encoding, which is an extension of positional and grid encodings. Our proposed method combines these two encoding techniques so that a small MLP learns high-frequency signals by using positional encoding with fewer frequencies under the lower resolution of the grid to consider the local position and scale in each grid cell. We demonstrate the effectiveness of our proposed method by applying it to common 2D and 3D regression tasks where it shows higher-quality results compared to positional and grid encodings, and comparable results to hierarchical variants of grid encoding such as multi-resolution grid encoding with equivalent memory footprint.en_US
dc.description.sectionheadersLearning-based Reflectance
dc.description.seriesinformationPacific Graphics Short Papers and Posters
dc.identifier.doi10.2312/pg.20231273
dc.identifier.isbn978-3-03868-234-9
dc.identifier.pages73-80
dc.identifier.pages8 pages
dc.identifier.urihttps://doi.org/10.2312/pg.20231273
dc.identifier.urihttps://diglib.eg.org:443/handle/10.2312/pg20231273
dc.publisherThe Eurographics Associationen_US
dc.rightsAttribution 4.0 International License
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.subjectCCS Concepts: Computing methodologies -> Artificial intelligence; Machine learning algorithms; Image representations
dc.subjectComputing methodologies
dc.subjectArtificial intelligence
dc.subjectMachine learning algorithms
dc.subjectImage representations
dc.titleLocal Positional Encoding for Multi-Layer Perceptronsen_US
Files
Original bundle
Now showing 1 - 2 of 2
No Thumbnail Available
Name:
073-080.pdf
Size:
39.78 MB
Format:
Adobe Portable Document Format
No Thumbnail Available
Name:
paper1117_mm.pdf
Size:
27.58 MB
Format:
Adobe Portable Document Format