Semi-Supervised 3D Shape Recognition via Multimodal Deep Co-training

dc.contributor.authorSong, Mofeien_US
dc.contributor.authorLiu, Yuen_US
dc.contributor.authorLiu, Xiao Fanen_US
dc.contributor.editorEisemann, Elmar and Jacobson, Alec and Zhang, Fang-Lueen_US
dc.date.accessioned2020-10-29T18:50:55Z
dc.date.available2020-10-29T18:50:55Z
dc.date.issued2020
dc.description.abstract3D shape recognition has been actively investigated in the field of computer graphics. With the rapid development of deep learning, various deep models have been introduced and achieved remarkable results. Most 3D shape recognition methods are supervised and learn only from the large amount of labeled shapes. However, it is expensive and time consuming to obtain such a large training set. In contrast to these methods, this paper studies a semi-supervised learning framework to train a deep model for 3D shape recognition by using both labeled and unlabeled shapes. Inspired by the co-training algorithm, our method iterates between model training and pseudo-label generation phases. In the model training phase, we train two deep networks based on the point cloud and multi-view representation simultaneously. In the pseudo-label generation phase, we generate the pseudo-labels of the unlabeled shapes using the joint prediction of two networks, which augments the labeled set for the next iteration. To extract more reliable consensus information from multiple representations, we propose an uncertainty-aware consistency loss function to combine the two networks into a multimodal network. This not only encourages the two networks to give similar predictions on the unlabeled set, but also eliminates the negative influence of the large performance gap between the two networks. Experiments on the benchmark ModelNet40 demonstrate that, with only 10% labeled training data, our approach achieves competitive performance to the results reported by supervised methods.en_US
dc.description.number7
dc.description.sectionheadersRecognition
dc.description.seriesinformationComputer Graphics Forum
dc.description.volume39
dc.identifier.doi10.1111/cgf.14144
dc.identifier.issn1467-8659
dc.identifier.pages279-289
dc.identifier.urihttps://doi.org/10.1111/cgf.14144
dc.identifier.urihttps://diglib.eg.org:443/handle/10.1111/cgf14144
dc.publisherThe Eurographics Association and John Wiley & Sons Ltd.en_US
dc.subjectComputing methodologies
dc.subjectShape analysis
dc.titleSemi-Supervised 3D Shape Recognition via Multimodal Deep Co-trainingen_US
Files
Collections