Raster-to-Graph: Floorplan Recognition via Autoregressive Graph Prediction with an Attention Transformer

dc.contributor.authorHu, Sizheen_US
dc.contributor.authorWu, Wenmingen_US
dc.contributor.authorSu, Ruolinen_US
dc.contributor.authorHou, Wannien_US
dc.contributor.authorZheng, Lipingen_US
dc.contributor.authorXu, Benzhuen_US
dc.contributor.editorBermano, Amit H.en_US
dc.contributor.editorKalogerakis, Evangelosen_US
dc.date.accessioned2024-04-30T09:06:54Z
dc.date.available2024-04-30T09:06:54Z
dc.date.issued2024
dc.description.abstractRecognizing the detailed information embedded in rasterized floorplans is at the research forefront in the community of computer graphics and vision. With the advent of deep neural networks, automatic floorplan recognition has made tremendous breakthroughs. However, co-recognizing both the structures and semantics of floorplans through one neural network remains a significant challenge. In this paper, we introduce a novel framework Raster-to-Graph, which automatically achieves structural and semantic recognition of floorplans.We represent vectorized floorplans as structural graphs embedded with floorplan semantics, thus transforming the floorplan recognition task into a structural graph prediction problem. We design an autoregressive prediction framework using the neural network architecture of the visual attention Transformer, iteratively predicting the wall junctions and wall segments of floorplans in the order of graph traversal. Additionally, we propose a large-scale floorplan dataset containing over 10,000 real-world residential floorplans. Our autoregressive framework can automatically recognize the structures and semantics of floorplans. Extensive experiments demonstrate the effectiveness of our framework, showing significant improvements on all metrics. Qualitative and quantitative evaluations indicate that our framework outperforms existing state-of-the-art methods. Code and dataset for this paper are available at: https://github.com/HSZVIS/Raster-to-Graph.en_US
dc.description.number2
dc.description.sectionheadersShape and Scene Understanding
dc.description.seriesinformationComputer Graphics Forum
dc.description.volume43
dc.identifier.doi10.1111/cgf.15007
dc.identifier.issn1467-8659
dc.identifier.pages14 pages
dc.identifier.urihttps://doi.org/10.1111/cgf.15007
dc.identifier.urihttps://diglib.eg.org/handle/10.1111/cgf15007
dc.publisherThe Eurographics Association and John Wiley & Sons Ltd.en_US
dc.subjectCCS Concepts: Computing methodologies -> Shape modeling; Computer vision
dc.subjectComputing methodologies
dc.subjectShape modeling
dc.subjectComputer vision
dc.titleRaster-to-Graph: Floorplan Recognition via Autoregressive Graph Prediction with an Attention Transformeren_US
Files
Original bundle
Now showing 1 - 2 of 2
Loading...
Thumbnail Image
Name:
v43i2_03_15007.pdf
Size:
35.55 MB
Format:
Adobe Portable Document Format
No Thumbnail Available
Name:
paper1114.mp4
Size:
20.93 MB
Format:
Video MP4
Collections