Fitness of General-Purpose Monocular Depth Estimation Architectures for Transparent Structures

Wirth, Tristan; Jamili, Aria; Buelow, Max von; Knauthe, Volker; Guthe, Stefan

Fitness of General-Purpose Monocular Depth Estimation Architectures for Transparent Structures

dc.contributor.author	Wirth, Tristan	en_US
dc.contributor.author	Jamili, Aria	en_US
dc.contributor.author	Buelow, Max von	en_US
dc.contributor.author	Knauthe, Volker	en_US
dc.contributor.author	Guthe, Stefan	en_US
dc.contributor.editor	Pelechano, Nuria	en_US
dc.contributor.editor	Vanderhaeghe, David	en_US
dc.date.accessioned	2022-04-22T08:16:09Z
dc.date.available	2022-04-22T08:16:09Z
dc.date.issued	2022
dc.description.abstract	Due to material properties, monocular depth estimation of transparent structures is inherently challenging. Recent advances leverage additional knowledge that is not available in all contexts, i.e., known shape or depth information from a sensor. General-purpose machine learning models, that do not utilize such additional knowledge, have not yet been explicitly evaluated regarding their performance on transparent structures. In this work, we show that these models show poor performance on the depth estimation of transparent structures. However, fine-tuning on suitable data sets, such as ClearGrasp, increases their estimation performance on the task at hand. Our evaluations show that high performance on general-purpose benchmarks translates well into performance on transparent objects after fine-tuning. Furthermore, our analysis suggests, that state-of-theart high-performing models are not able to capture a high grade of detail from both the image foreground and background at the same time. This finding shows the demand for a combination of existing models to further enhance depth estimation quality.	en_US
dc.description.sectionheaders	Image and Video
dc.description.seriesinformation	Eurographics 2022 - Short Papers
dc.identifier.doi	10.2312/egs.20221020
dc.identifier.isbn	978-3-03868-169-4
dc.identifier.issn	1017-4656
dc.identifier.pages	9-12
dc.identifier.pages	4 pages
dc.identifier.uri	https://doi.org/10.2312/egs.20221020
dc.identifier.uri	https://diglib.eg.org:443/handle/10.2312/egs20221020
dc.publisher	The Eurographics Association	en_US
dc.rights	Attribution 4.0 International License
dc.rights.uri	https://creativecommons.org/licenses/by/4.0/
dc.subject	CCS Concepts: Computing methodologies --> Computer vision; Shape inference
dc.subject	Computing methodologies
dc.subject	Computer vision
dc.subject	Shape inference
dc.title	Fitness of General-Purpose Monocular Depth Estimation Architectures for Transparent Structures	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: 009-012.pdf
Size:: 3.47 MB
Format:: Adobe Portable Document Format

Download

Collections

EG 2022 - Short Papers