PG2023 Short Papers and Posters

Permanent URI for this collection

https://diglib7.eg.org/handle/10.2312/3543885

Pacific Graphics 2023 - Short Papers and Posters
Daejeon, South Korea || October 10 – 13, 2023

(for Full Papers (CGF) see PG 2023 - CGF 42-7)

Table of Contents

Neural Rendering

SS-SfP: Neural Inverse Rendering for Self Supervised Shape from (Mixed) Polarization

[full paper

] [meta data

]

Ashish Tiwari and Shanmuganathan Raman

Sketch-based Modeling

SketchBodyNet: A Sketch-Driven Multi-faceted Decoder Network for 3D Human Reconstruction

[full paper

] [meta data

]

Fei Wang, Kongzhang Tang, Hefeng Wu, Baoquan Zhao, Hao Cai, and Teng Zhou

Imaging

WaveNet: Wave-Aware Image Enhancement

[full paper

] [meta data

]

Jiachen Dang, Zehao Li, Yong Zhong, and Lishun Wang

Motion Capture and Generation

Feature-Sized Sampling for Vector Line Art

[full paper

] [meta data

]

Stefan Ohrhallinger, Amal Dev Parakkat, and Pooran Memari

DASKEL: An Interactive Choreographical System with Labanotation-Skeleton Translation

[full paper

] [meta data

]

Siyuan Luo, Borou Yu, and Zeyu Wang

Image Editing and Color

Multi-Stage Degradation and Content Embedding Fusion for Blind Super-Resolution

[full paper

] [meta data

]

Haiyang Zhang, Mengyu Jiang, and Liang Liu

Images, Vectorization, and Layouts

Automatic Vector Caricature via Face Parametrization

[full paper

] [meta data

]

Koki Madono, Yannick Hold-Geoffroy, Yijun Li, Daichi Ito, Jose Echevarria, and Cameron Smith

Details and Styles on 3D Models

A Style Transfer Network of Local Geometry for 3D Mesh Stylization

[full paper

] [meta data

]

Hongyuan Kang, Xiao Dong, Xufei Guo, Juan Cao, and Zhonggui Chen

Learning-based Reflectance

Local Positional Encoding for Multi-Layer Perceptrons

[full paper

] [meta data

]

Shin Fujieda, Atsushi Yoshimura, and Takahiro Harada

Radiance and Appearance

Generalizable Dynamic Radiance Fields For Talking Head Synthesis With Few-shot

[full paper

] [meta data

]

Rujing Dang, Shaohui Wang, and Haoqian Wang

Color Harmonization on Images

Text2Mat: Generating Materials from Text

[full paper

] [meta data

]

Zhen He, Jie Guo, Yan Zhang, Qinghao Tu, Mufan Chen, Yanwen Guo, Pengyu Wang, and Wei Dai

Posters

Sketch-to-Architecture: Generative AI-aided Architectural Design

[full paper

] [meta data

]

Pengzhi Li, Baijuan Li, and Zhiheng Li

Avatar Emotion Recognition using Non-verbal Communication

[full paper

] [meta data

]

Jalal Safari Bazargani, Abolghasem Sadeghi-Niaraki, and Soo-Mi Choi

TreeGCN-ED: A Tree-Structured Graph-Based Autoencoder Framework For Point Cloud Processing

[full paper

] [meta data

]

Prajwal Singh, Ashish Tiwari, Kaustubh Sadekar, and Shanmuganathan Raman

Hand Shadow Art: A Differentiable Rendering Perspective

[full paper

] [meta data

]

Aalok Gangopadhyay, Prajwal Singh, Ashish Tiwari, and Shanmuganathan Raman

Reconstructing Baseball Pitching Motions from Video

[full paper

] [meta data

]

Jiwon Kim, Dongkwon Kim, and Ri Yu

A Simple Stochastic Regularization Technique for Avoiding Overfitting in Low Resource Image Classification

[full paper

] [meta data

]

Ya Tu Ji, Bai Lun Wang, Qing Dao Er Ji Ren, Bao Shi, Nier E. Wu, Min Lu, Na Liu, Xu Fei Zhuang, Xuan Xuan Xu, Li Wang, Ling Jie Dai, Miao Miao Yao, and Xiao Mei Li

Multi-scale Monocular Panorama Depth Estimation

[full paper

] [meta data

]

Payal Mohadikar, Chuanmao Fan, Chenxi Zhao, and Ye Duan

Combining Transformer and CNN for Super-Resolution of Animal Fiber Microscopy Images

[full paper

] [meta data

]

Jiagen Li, Yatu Ji, Min Lu, Li Wang, Lingjie Dai, Xuanxuan Xu, Nier Wu, and Na Liu

Detection of Impurities in Wool Based on Improved YOlOV8

[full paper

] [meta data

]

Yang Liu, Yatu Ji, Qing Dao Er Ji Ren, Bao Shi, Xufei Zhuang, Miaomiao Yao, and Xiaomei Li

Progressive Graph Matching Network for Correspondences

[full paper

] [meta data

]

Huihang Feng, Lupeng Liu, and Jun Xiao

Emotion-based Interaction Technique Using User's Voice and Facial Expressions in Virtual and Augmented Reality

[full paper

] [meta data

]

Beom-Seok Ko, Ho-San Kang, Kyuhong Lee, Manuel Braunschweiler, Fabio Zünd, Robert W. Sumner, and Soo-Mi Choi

Color3d: Photorealistic Texture Mapping for 3D Mesh

[full paper

] [meta data

]

Chenxi Zhao, Chuanmao Fan, Payal Mohadikar, and Ye Duan

Visualization System for Analyzing Congestion Pricing Policies

[full paper

] [meta data

]

SeokHwan Choi, Seongbum Seo, Sangbong Yoo, and Yun Jang

Revisiting Visualization Evaluation Using EEG and Visualization Literacy Assessment Test

[full paper

] [meta data

]

Soobin Yim, Chanyoung Jung, Chanyoung Yoon, Sangbong Yoo, Seongwon Choi, and Yun Jang

BibTeX (PG2023 Short Papers and Posters)

@inproceedings{10.2312:pg.20232022,

booktitle = {Pacific Graphics - Short Papers and Posters},

editor = {Chaine, Raphaëlle and 
Deng, Zhigang and 
Kim, Min H.
},
title = {{Pacific Graphics 2023 - Short Papers and Posters: Frontmatter}},

author = {Chaine, Raphaëlle and 
Deng, Zhigang and 
Kim, Min H.
},
year = {2023},

publisher = {The Eurographics Association},

ISBN = {978-3-03868-234-9},

DOI = {10.2312/pg.20232022}

}

@inproceedings{10.2312:pg.20231265,

booktitle = {Pacific Graphics Short Papers and Posters},

editor = {Chaine, Raphaëlle and 
Deng, Zhigang and 
Kim, Min H.
},
title = {{SS-SfP: Neural Inverse Rendering for Self Supervised Shape from (Mixed) Polarization}},

author = {Tiwari, Ashish and 
Raman, Shanmuganathan
},
year = {2023},

publisher = {The Eurographics Association},

ISBN = {978-3-03868-234-9},

DOI = {10.2312/pg.20231265}

}

@inproceedings{10.2312:pg.20231266,

booktitle = {Pacific Graphics Short Papers and Posters},

editor = {Chaine, Raphaëlle and 
Deng, Zhigang and 
Kim, Min H.
},
title = {{SketchBodyNet: A Sketch-Driven Multi-faceted Decoder Network for 3D Human Reconstruction}},

author = {Wang, Fei and 
Tang, Kongzhang and 
Wu, Hefeng and 
Zhao, Baoquan and 
Cai, Hao and 
Zhou, Teng
},
year = {2023},

publisher = {The Eurographics Association},

ISBN = {978-3-03868-234-9},

DOI = {10.2312/pg.20231266}

}

@inproceedings{10.2312:pg.20231267,

booktitle = {Pacific Graphics Short Papers and Posters},

editor = {Chaine, Raphaëlle and 
Deng, Zhigang and 
Kim, Min H.
},
title = {{WaveNet: Wave-Aware Image Enhancement}},

author = {Dang, Jiachen and 
Li, Zehao and 
Zhong, Yong and 
Wang, Lishun
},
year = {2023},

publisher = {The Eurographics Association},

ISBN = {978-3-03868-234-9},

DOI = {10.2312/pg.20231267}

}

@inproceedings{10.2312:pg.20231268,

booktitle = {Pacific Graphics Short Papers and Posters},

editor = {Chaine, Raphaëlle and 
Deng, Zhigang and 
Kim, Min H.
},
title = {{Feature-Sized Sampling for Vector Line Art}},

author = {Ohrhallinger, Stefan and 
Parakkat, Amal Dev and 
Memari, Pooran
},
year = {2023},

publisher = {The Eurographics Association},

ISBN = {978-3-03868-234-9},

DOI = {10.2312/pg.20231268}

}

@inproceedings{10.2312:pg.20231269,

booktitle = {Pacific Graphics Short Papers and Posters},

editor = {Chaine, Raphaëlle and 
Deng, Zhigang and 
Kim, Min H.
},
title = {{DASKEL: An Interactive Choreographical System with Labanotation-Skeleton Translation}},

author = {Luo, Siyuan and 
Yu, Borou and 
Wang, Zeyu
},
year = {2023},

publisher = {The Eurographics Association},

ISBN = {978-3-03868-234-9},

DOI = {10.2312/pg.20231269}

}

@inproceedings{10.2312:pg.20231270,

booktitle = {Pacific Graphics Short Papers and Posters},

editor = {Chaine, Raphaëlle and 
Deng, Zhigang and 
Kim, Min H.
},
title = {{Multi-Stage Degradation and Content Embedding Fusion for Blind Super-Resolution}},

author = {Zhang, Haiyang and 
Jiang, Mengyu and 
Liu, Liang
},
year = {2023},

publisher = {The Eurographics Association},

ISBN = {978-3-03868-234-9},

DOI = {10.2312/pg.20231270}

}

@inproceedings{10.2312:pg.20231272,

booktitle = {Pacific Graphics Short Papers and Posters},

editor = {Chaine, Raphaëlle and 
Deng, Zhigang and 
Kim, Min H.
},
title = {{A Style Transfer Network of Local Geometry for 3D Mesh Stylization}},

author = {Kang, Hongyuan and 
Dong, Xiao and 
Guo, Xufei and 
Cao, Juan and 
Chen, Zhonggui
},
year = {2023},

publisher = {The Eurographics Association},

ISBN = {978-3-03868-234-9},

DOI = {10.2312/pg.20231272}

}

@inproceedings{10.2312:pg.20231271,

booktitle = {Pacific Graphics Short Papers and Posters},

editor = {Chaine, Raphaëlle and 
Deng, Zhigang and 
Kim, Min H.
},
title = {{Automatic Vector Caricature via Face Parametrization}},

author = {Madono, Koki and 
Hold-Geoffroy, Yannick and 
Li, Yijun and 
Ito, Daichi and 
Echevarria, Jose and 
Smith, Cameron
},
year = {2023},

publisher = {The Eurographics Association},

ISBN = {978-3-03868-234-9},

DOI = {10.2312/pg.20231271}

}

@inproceedings{10.2312:pg.20231273,

booktitle = {Pacific Graphics Short Papers and Posters},

editor = {Chaine, Raphaëlle and 
Deng, Zhigang and 
Kim, Min H.
},
title = {{Local Positional Encoding for Multi-Layer Perceptrons}},

author = {Fujieda, Shin and 
Yoshimura, Atsushi and 
Harada, Takahiro
},
year = {2023},

publisher = {The Eurographics Association},

ISBN = {978-3-03868-234-9},

DOI = {10.2312/pg.20231273}

}

@inproceedings{10.2312:pg.20231275,

booktitle = {Pacific Graphics Short Papers and Posters},

editor = {Chaine, Raphaëlle and 
Deng, Zhigang and 
Kim, Min H.
},
title = {{Text2Mat: Generating Materials from Text}},

author = {He, Zhen and 
Guo, Jie and 
Zhang, Yan and 
Tu, Qinghao and 
Chen, Mufan and 
Guo, Yanwen and 
Wang, Pengyu and 
Dai, Wei
},
year = {2023},

publisher = {The Eurographics Association},

ISBN = {978-3-03868-234-9},

DOI = {10.2312/pg.20231275}

}

@inproceedings{10.2312:pg.20231274,

booktitle = {Pacific Graphics Short Papers and Posters},

editor = {Chaine, Raphaëlle and 
Deng, Zhigang and 
Kim, Min H.
},
title = {{Generalizable Dynamic Radiance Fields For Talking Head Synthesis With Few-shot}},

author = {Dang, Rujing and 
Wang, Shaohui and 
Wang, Haoqian
},
year = {2023},

publisher = {The Eurographics Association},

ISBN = {978-3-03868-234-9},

DOI = {10.2312/pg.20231274}

}

@inproceedings{10.2312:pg.20231276,

booktitle = {Pacific Graphics Short Papers and Posters},

editor = {Chaine, Raphaëlle and 
Deng, Zhigang and 
Kim, Min H.
},
title = {{Sketch-to-Architecture: Generative AI-aided Architectural Design}},

author = {Li, Pengzhi and 
Li, Baijuan and 
Li, Zhiheng
},
year = {2023},

publisher = {The Eurographics Association},

ISBN = {978-3-03868-234-9},

DOI = {10.2312/pg.20231276}

}

@inproceedings{10.2312:pg.20231277,

booktitle = {Pacific Graphics Short Papers and Posters},

editor = {Chaine, Raphaëlle and 
Deng, Zhigang and 
Kim, Min H.
},
title = {{Avatar Emotion Recognition using Non-verbal Communication}},

author = {Bazargani, Jalal Safari and 
Sadeghi-Niaraki, Abolghasem and 
Choi, Soo-Mi
},
year = {2023},

publisher = {The Eurographics Association},

ISBN = {978-3-03868-234-9},

DOI = {10.2312/pg.20231277}

}

@inproceedings{10.2312:pg.20231279,

booktitle = {Pacific Graphics Short Papers and Posters},

editor = {Chaine, Raphaëlle and 
Deng, Zhigang and 
Kim, Min H.
},
title = {{Hand Shadow Art: A Differentiable Rendering Perspective}},

author = {Gangopadhyay, Aalok and 
Singh, Prajwal and 
Tiwari, Ashish and 
Raman, Shanmuganathan
},
year = {2023},

publisher = {The Eurographics Association},

ISBN = {978-3-03868-234-9},

DOI = {10.2312/pg.20231279}

}

@inproceedings{10.2312:pg.20231278,

booktitle = {Pacific Graphics Short Papers and Posters},

editor = {Chaine, Raphaëlle and 
Deng, Zhigang and 
Kim, Min H.
},
title = {{TreeGCN-ED: A Tree-Structured Graph-Based Autoencoder Framework For Point Cloud Processing}},

author = {Singh, Prajwal and 
Tiwari, Ashish and 
Sadekar, Kaustubh and 
Raman, Shanmuganathan
},
year = {2023},

publisher = {The Eurographics Association},

ISBN = {978-3-03868-234-9},

DOI = {10.2312/pg.20231278}

}

@inproceedings{10.2312:pg.20231280,

booktitle = {Pacific Graphics Short Papers and Posters},

editor = {Chaine, Raphaëlle and 
Deng, Zhigang and 
Kim, Min H.
},
title = {{Reconstructing Baseball Pitching Motions from Video}},

author = {Kim, Jiwon and 
Kim, Dongkwon and 
Yu, Ri
},
year = {2023},

publisher = {The Eurographics Association},

ISBN = {978-3-03868-234-9},

DOI = {10.2312/pg.20231280}

}

@inproceedings{10.2312:pg.20231281,

booktitle = {Pacific Graphics Short Papers and Posters},

editor = {Chaine, Raphaëlle and 
Deng, Zhigang and 
Kim, Min H.
},
title = {{A Simple Stochastic Regularization Technique for Avoiding Overfitting in Low Resource Image Classification}},

author = {Ji, Ya Tu and 
Wang, Bai Lun and 
Dai, Ling Jie and 
Yao, Miao Miao and 
Li, Xiao Mei and 
Ren, Qing Dao Er Ji and 
Shi, Bao and 
Wu, Nier E. and 
Lu, Min and 
Liu, Na and 
Zhuang, Xu Fei and 
Xu, Xuan Xuan and 
Wang, Li
},
year = {2023},

publisher = {The Eurographics Association},

ISBN = {978-3-03868-234-9},

DOI = {10.2312/pg.20231281}

}

@inproceedings{10.2312:pg.20231282,

booktitle = {Pacific Graphics Short Papers and Posters},

editor = {Chaine, Raphaëlle and 
Deng, Zhigang and 
Kim, Min H.
},
title = {{Multi-scale Monocular Panorama Depth Estimation}},

author = {Mohadikar, Payal and 
Fan, Chuanmao and 
Zhao, Chenxi and 
Duan, Ye
},
year = {2023},

publisher = {The Eurographics Association},

ISBN = {978-3-03868-234-9},

DOI = {10.2312/pg.20231282}

}

@inproceedings{10.2312:pg.20231283,

booktitle = {Pacific Graphics Short Papers and Posters},

editor = {Chaine, Raphaëlle and 
Deng, Zhigang and 
Kim, Min H.
},
title = {{Combining Transformer and CNN for Super-Resolution of Animal Fiber Microscopy Images}},

author = {Li, Jiagen and 
Ji, Yatu and 
Lu, Min and 
Wang, Li and 
Dai, Lingjie and 
Xu, Xuanxuan and 
Wu, Nier and 
Liu, Na
},
year = {2023},

publisher = {The Eurographics Association},

ISBN = {978-3-03868-234-9},

DOI = {10.2312/pg.20231283}

}

@inproceedings{10.2312:pg.20231284,

booktitle = {Pacific Graphics Short Papers and Posters},

editor = {Chaine, Raphaëlle and 
Deng, Zhigang and 
Kim, Min H.
},
title = {{Detection of Impurities in Wool Based on Improved YOlOV8}},

author = {Liu, Yang and 
Ji, Yatu and 
Ren, Qing Dao Er Ji and 
Shi, Bao and 
Zhuang, Xufei and 
Yao, Miaomiao and 
Li, Xiaomei
},
year = {2023},

publisher = {The Eurographics Association},

ISBN = {978-3-03868-234-9},

DOI = {10.2312/pg.20231284}

}

@inproceedings{10.2312:pg.20231287,

booktitle = {Pacific Graphics Short Papers and Posters},

editor = {Chaine, Raphaëlle and 
Deng, Zhigang and 
Kim, Min H.
},
title = {{Color3d: Photorealistic Texture Mapping for 3D Mesh}},

author = {Zhao, Chenxi and 
Fan, Chuanmao and 
Mohadikar, Payal and 
Duan, Ye
},
year = {2023},

publisher = {The Eurographics Association},

ISBN = {978-3-03868-234-9},

DOI = {10.2312/pg.20231287}

}

@inproceedings{10.2312:pg.20231285,

booktitle = {Pacific Graphics Short Papers and Posters},

editor = {Chaine, Raphaëlle and 
Deng, Zhigang and 
Kim, Min H.
},
title = {{Progressive Graph Matching Network for Correspondences}},

author = {Feng, Huihang and 
Liu, Lupeng and 
Xiao, Jun
},
year = {2023},

publisher = {The Eurographics Association},

ISBN = {978-3-03868-234-9},

DOI = {10.2312/pg.20231285}

}

@inproceedings{10.2312:pg.20231286,

booktitle = {Pacific Graphics Short Papers and Posters},

editor = {Chaine, Raphaëlle and 
Deng, Zhigang and 
Kim, Min H.
},
title = {{Emotion-based Interaction Technique Using User's Voice and Facial Expressions in Virtual and Augmented Reality}},

author = {Ko, Beom-Seok and 
Kang, Ho-San and 
Lee, Kyuhong and 
Braunschweiler, Manuel and 
Zünd, Fabio and 
Sumner, Robert W. and 
Choi, Soo-Mi
},
year = {2023},

publisher = {The Eurographics Association},

ISBN = {978-3-03868-234-9},

DOI = {10.2312/pg.20231286}

}

@inproceedings{10.2312:pg.20231289,

booktitle = {Pacific Graphics Short Papers and Posters},

editor = {Chaine, Raphaëlle and 
Deng, Zhigang and 
Kim, Min H.
},
title = {{Revisiting Visualization Evaluation Using EEG and Visualization Literacy Assessment Test}},

author = {Yim, Soobin and 
Jung, Chanyoung and 
Yoon, Chanyoung and 
Yoo, Sangbong and 
Choi, Seongwon and 
Jang, Yun
},
year = {2023},

publisher = {The Eurographics Association},

ISBN = {978-3-03868-234-9},

DOI = {10.2312/pg.20231289}

}

@inproceedings{10.2312:pg.20231288,

booktitle = {Pacific Graphics Short Papers and Posters},

editor = {Chaine, Raphaëlle and 
Deng, Zhigang and 
Kim, Min H.
},
title = {{Visualization System for Analyzing Congestion Pricing Policies}},

author = {Choi, SeokHwan and 
Seo, Seongbum and 
Yoo, Sangbong and 
Jang, Yun
},
year = {2023},

publisher = {The Eurographics Association},

ISBN = {978-3-03868-234-9},

DOI = {10.2312/pg.20231288}

}

Browse

Now showing 1 - 26 of 26

Pacific Graphics 2023 - Short Papers and Posters: Frontmatter
(The Eurographics Association, 2023) Chaine, Raphaëlle; Deng, Zhigang; Kim, Min H.; Chaine, Raphaëlle; Deng, Zhigang; Kim, Min H.
SS-SfP: Neural Inverse Rendering for Self Supervised Shape from (Mixed) Polarization
(The Eurographics Association, 2023) Tiwari, Ashish; Raman, Shanmuganathan; Chaine, Raphaëlle; Deng, Zhigang; Kim, Min H.
We present a novel inverse rendering-based framework to estimate the 3D shape (per-pixel surface normals and depth) of objects and scenes from single-view polarization images, the problem popularly known as Shape from Polarization (SfP). The existing physics-based and learning-based methods for SfP perform under certain restrictions, i.e., (a) purely diffuse or purely specular reflections, which are seldom in the real surfaces, (b) availability of the ground truth surface normals for direct supervision that are hard to acquire and are limited by the scanner's resolution, and (c) known refractive index. To overcome these restrictions, we start by learning to separate the partially-polarized diffuse and specular reflection components, which we call reflectance cues, based on a modified polarization reflection model and then estimate shape under mixed polarization through an inverse-rendering based self-supervised deep learning framework called SS-SfP, guided by the polarization data and estimated reflectance cues. Furthermore, we also obtain the refractive index as a non-linear least squares solution. Through extensive quantitative and qualitative evaluation, we establish the efficacy of the proposed framework over simple single-object scenes from DeepSfP dataset and complex in-the-wild scenes from SPW dataset in an entirely self-supervised setting. To the best of our knowledge, this is the first learning-based approach to address SfP under mixed polarization in a completely selfsupervised framework. Code will be made publicly available.
SketchBodyNet: A Sketch-Driven Multi-faceted Decoder Network for 3D Human Reconstruction
(The Eurographics Association, 2023) Wang, Fei; Tang, Kongzhang; Wu, Hefeng; Zhao, Baoquan; Cai, Hao; Zhou, Teng; Chaine, Raphaëlle; Deng, Zhigang; Kim, Min H.
Reconstructing 3D human shapes from 2D images has received increasing attention recently due to its fundamental support for many high-level 3D applications. Compared with natural images, freehand sketches are much more flexible to depict various shapes, providing a high potential and valuable way for 3D human reconstruction. However, such a task is highly challenging. The sparse abstract characteristics of sketches add severe difficulties, such as arbitrariness, inaccuracy, and lacking image details, to the already badly ill-posed problem of 2D-to-3D reconstruction. Although current methods have achieved great success in reconstructing 3D human bodies from a single-view image, they do not work well on freehand sketches. In this paper, we propose a novel sketch-driven multi-faceted decoder network termed SketchBodyNet to address this task. Specifically, the network consists of a backbone and three separate attention decoder branches, where a multi-head self-attention module is exploited in each decoder to obtain enhanced features, followed by a multi-layer perceptron. The multi-faceted decoders aim to predict the camera, shape, and pose parameters, respectively, which are then associated with the SMPL model to reconstruct the corresponding 3D human mesh. In learning, existing 3D meshes are projected via the camera parameters into 2D synthetic sketches with joints, which are combined with the freehand sketches to optimize the model. To verify our method, we collect a large-scale dataset of about 26k freehand sketches and their corresponding 3D meshes containing various poses of human bodies from 14 different angles. Extensive experimental results demonstrate our SketchBodyNet achieves superior performance in reconstructing 3D human meshes from freehand sketches.
WaveNet: Wave-Aware Image Enhancement
(The Eurographics Association, 2023) Dang, Jiachen; Li, Zehao; Zhong, Yong; Wang, Lishun; Chaine, Raphaëlle; Deng, Zhigang; Kim, Min H.
As a low-level vision task, image enhancement is widely used in various computer vision applications. Recently, multiple methods combined with CNNs, MLP, Transformer, and the Fourier transform have achieved promising results on image enhancement tasks. However, these methods cannot achieve a balance between accuracy and computational cost. In this paper, we formulate the enhancement into a signal modulation problem and propose the WaveNet architecture, which performs well in various parameters and improves the feature expression using wave-like feature representation. Specifically, to better capture wave-like feature representations, we propose to represent a pixel as a sampled value of a signal function with three wave functions (Cosine Wave (CW), Sine Wave (SW), and Gating Wave (GW)) inspired by the Fourier transform. The amplitude and phase are required to generate the wave-like features. The amplitude term includes the original contents of features, and the phase term modulates the relationship between various inputs and fixed weights. To dynamically obtain the phase and the amplitude, we build the Wave Transform Block (WTB) that adaptively generates the waves and modulates the wave superposition mode. Based on the WTB, we establish an effective architecture WaveNet for image enhancement. Extensive experiments on six real-world datasets show that our model achieves better quantitative and qualitative results than state-of-the-art methods. The source code and pretrained model are available at https://github.com/DeniJsonC/WaveNet.
Feature-Sized Sampling for Vector Line Art
(The Eurographics Association, 2023) Ohrhallinger, Stefan; Parakkat, Amal Dev; Memari, Pooran; Chaine, Raphaëlle; Deng, Zhigang; Kim, Min H.
By introducing a first-of-its-kind quantifiable sampling algorithm based on feature size, we present a fresh perspective on the practical aspects of planar curve sampling. Following the footsteps of e-sampling, which was originally proposed in the context of curve reconstruction to offer provable topological guarantees [ABE98] under quantifiable bounds, we propose an arbitrarily precise e-sampling algorithm for sampling smooth planar curves (with a prior bound on the minimum feature size of the curve). This paper not only introduces the first such algorithm which provides user-control and quantifiable precision but also highlights the importance of such a sampling process under two key contexts: 1) To conduct a first study comparing theoretical sampling conditions with practical sampling requirements for reconstruction guarantees that can further be used for analysing the upper bounds of e for various reconstruction algorithms with or without proofs, 2) As a feature-aware sampling of vector line art that can be used for applications such as coloring and meshing.
DASKEL: An Interactive Choreographical System with Labanotation-Skeleton Translation
(The Eurographics Association, 2023) Luo, Siyuan; Yu, Borou; Wang, Zeyu; Chaine, Raphaëlle; Deng, Zhigang; Kim, Min H.
We propose DASKEL, a real-time interactive choreography system with bidirectional human skeleton-Labanotation conversion. DASKEL fuses dance notation (DA) with human skeleton data (SKEL). Our approach connects dance data represented in Labanotation with motion capture skeleton data in BVH format, facilitating seamless bidirectional conversion between the two formats. Moreover, DASKEL introduces a numerical representation for symbols used in Labanotation and supports their intuitive visualization which augments the practicality and applicability. Previous methods for the conversion between Labanotation and human skeleton only support the upper body, and our approach generalizes to the bidirectional conversion for the whole body. To generate more accurate and human-like dance postures, we integrate kinematic methods with physics-based simulation, resulting in more natural character animations generated from dance notations.
Multi-Stage Degradation and Content Embedding Fusion for Blind Super-Resolution
(The Eurographics Association, 2023) Zhang, Haiyang; Jiang, Mengyu; Liu, Liang; Chaine, Raphaëlle; Deng, Zhigang; Kim, Min H.
To achieve promising results on blind image super-resolution (SR), some Unsupervised Degradation Prediction (UDP) methods narrow the domain gap between the degradation embedding space and the SR feature space by fusing the degradation embedding with the additional content embedding before multi-stage SR. However, fusing these two embeddings before multi-stage SR is inflexible, due to the variation of the domain gap at each SR stage. To address this issue, we propose the Multi-Stage Degradation and Content Embedding Fusion (MDCF), which adaptively fuses the degradation embedding with the content embedding at each SR stage rather than before multi-stage SR. Based on the MDCF, we introduce a novel UDP method, called MDCFnet, which contains an additional Dual-Path Local and Global encoder (DPLG) to extract the degradation embedding and the content embedding separately. Specially, DPLG diversifies receptive fields to enrich the degradation embedding and combines local and global features to optimize the content embedding. Extensive experiments on real images and several benchmarks demonstrate that the proposed MDCFnet can outperform the existing UDP methods and achieve competitive performance on PSNR and SSIM even compared with the state-of-the-art SKP methods.
A Style Transfer Network of Local Geometry for 3D Mesh Stylization
(The Eurographics Association, 2023) Kang, Hongyuan; Dong, Xiao; Guo, Xufei; Cao, Juan; Chen, Zhonggui; Chaine, Raphaëlle; Deng, Zhigang; Kim, Min H.
Style transfer of images develops rapidly, however, only a few studies focus on geometric style transfer on 3D models. In this paper, we propose a style learning network to synthesize local geometric textures with similar styles on source mesh, driven by specific mesh or image features. Our network modifies a source mesh by predicting the displacement of vertices along the normal direction to generate geometric details. To constrain the style of the source mesh to be consistent with a specific style mesh, we define a style loss on 2D projected images of two meshes based on a differentiable renderer. We extract a set of global and local features from multiple views of 3D models via a pre-trained VGG network, driving the deformation of the source mesh based on the style loss. Our network is flexible in style learning as it can extract features from meshes and images to guide geometric deformation. Experiments verify the robustness of the proposed network and show the outperforming results of transferring multiple styles to the source mesh. We also conduct experiments to analyze the effectiveness of network design.
Automatic Vector Caricature via Face Parametrization
(The Eurographics Association, 2023) Madono, Koki; Hold-Geoffroy, Yannick; Li, Yijun; Ito, Daichi; Echevarria, Jose; Smith, Cameron; Chaine, Raphaëlle; Deng, Zhigang; Kim, Min H.
Automatic caricature generation is a challenging task that aims to emphasize the subject's facial characteristics while preserving its identity. Due to the complexity of the task, caricatures could exclusively be performed by a trained artist. Recent developments in deep learning have achieved promising results in capturing artistic styles. Despite the success, current methods still struggle to accurately capture the whimsical aspect of caricatures while preserving identity. In this work, we propose Parametric Caricature, the first parametric-based caricature generation that yields vectorized and animatable caricatures. We devise several hundred parameters to encode facial traits, which our method directly predicts instead of estimating the raster caricature like previous methods. To guide the attention of the method, we segment the different parts of the face and retrieve the most similar parts from an artist-made database of caricatures. Our method proposes visually appealing caricatures more adapted to use as avatars than existing methods, as demonstrated by our user study.
Local Positional Encoding for Multi-Layer Perceptrons
(The Eurographics Association, 2023) Fujieda, Shin; Yoshimura, Atsushi; Harada, Takahiro; Chaine, Raphaëlle; Deng, Zhigang; Kim, Min H.
A multi-layer perceptron (MLP) is a type of neural networks which has a long history of research and has been studied actively recently in computer vision and graphics fields. One of the well-known problems of an MLP is the capability of expressing highfrequency signals from low-dimensional inputs. There are several studies for input encodings to improve the reconstruction quality of an MLP by applying pre-processing against the input data. This paper proposes a novel input encoding method, local positional encoding, which is an extension of positional and grid encodings. Our proposed method combines these two encoding techniques so that a small MLP learns high-frequency signals by using positional encoding with fewer frequencies under the lower resolution of the grid to consider the local position and scale in each grid cell. We demonstrate the effectiveness of our proposed method by applying it to common 2D and 3D regression tasks where it shows higher-quality results compared to positional and grid encodings, and comparable results to hierarchical variants of grid encoding such as multi-resolution grid encoding with equivalent memory footprint.
Text2Mat: Generating Materials from Text
(The Eurographics Association, 2023) He, Zhen; Guo, Jie; Zhang, Yan; Tu, Qinghao; Chen, Mufan; Guo, Yanwen; Wang, Pengyu; Dai, Wei; Chaine, Raphaëlle; Deng, Zhigang; Kim, Min H.
Specific materials are often associated with a certain type of objects in the real world. They simulate the way the surface of the object interacting with light and are named after that type of object. We observe that the text labels of materials contain advanced semantic information, which can be used as a guidance to assist the generation of specific materials. Based on that, we propose Text2Mat, a text-guided material generation framework. To meet the demand of material generation based on text descriptions, we construct a large set of PBR materials with specific text labels. Each material contains detailed text descriptions that match the visual appearance of the material. Furthermore, for the sake of controlling the texture and spatial layout of generated materials through text, we introduce texture attribute labels and extra attributes describing regular materials. Using this dataset, we train a specific neural network adapted from Stable Diffusion to achieve text-based material generation. Extensive experiments and rendering effects demonstrate that Text2Mat can generate materials with spatial layout and texture styles highly corresponding to text descriptions.
Generalizable Dynamic Radiance Fields For Talking Head Synthesis With Few-shot
(The Eurographics Association, 2023) Dang, Rujing; Wang, Shaohui; Wang, Haoqian; Chaine, Raphaëlle; Deng, Zhigang; Kim, Min H.
Audio-driven talking head generation has wide applications in virtual games, hosts, online meetings, etc. Recently, great achievements have been made in synthesizing talking heads based on neural radiance fields. However, the existing few-shot talking head synthesis methods still suffer from inaccurate deformation and lack of visual consistency. Therefore, we propose a Generalizable Dynamic Radiance Field (GDRF), which can rapidly generalize to unseen identities with few-shot. We introduce a warping module with 3D constraints to act in feature volume space, which is identity adaptive and exhibits excellent shape-shifting abilities. Our method can generate more accurately deformed and view consistent target images compared to previous methods. Furthermore, we map the audio signal to 3DMM parameters by applying an LSTM network, which helps get long-term context and generate more continuous and natural video. Extensive experiments demonstrate the superiority of our proposed method.
Sketch-to-Architecture: Generative AI-aided Architectural Design
(The Eurographics Association, 2023) Li, Pengzhi; Li, Baijuan; Li, Zhiheng; Chaine, Raphaëlle; Deng, Zhigang; Kim, Min H.
Recently, the development of large-scale models has paved the way for various interdisciplinary research, including architecture. By using generative AI, we present a novel workflow that utilizes AI models to generate conceptual floorplans and 3D models from simple sketches, enabling rapid ideation and controlled generation of architectural renderings based on textual descriptions. Our work demonstrates the potential of generative AI in the architectural design process, pointing towards a new direction of computer-aided architectural design.
Avatar Emotion Recognition using Non-verbal Communication
(The Eurographics Association, 2023) Bazargani, Jalal Safari; Sadeghi-Niaraki, Abolghasem; Choi, Soo-Mi; Chaine, Raphaëlle; Deng, Zhigang; Kim, Min H.
Among the sources of information about emotions, body movements, recognized as ''kinesics'' in non-verbal communication, have received limited attention. This research gap suggests the need to investigate suitable body movement-based approaches for making communication in virtual environments more realistic. Therefore, this study proposes an automated emotion recognition approach suitable for use in virtual environments. This study consists of two pipelines for emotion recognition. For the first pipeline, i.e., upper-body keypoint-based recognition, the HEROES video dataset was employed to train a bidirectional long short-term memory model using upper-body keypoints capable of predicting four discrete emotions: boredom, disgust, happiness, and interest, achieving an accuracy of 84%. For the second pipeline, i.e., wrist-movement-based recognition, a random forest model was trained based on 17 features computed from acceleration data of wrist movements along each axis. The model achieved an accuracy of 63% in distinguishing three discrete emotions: sadness, neutrality, and happiness. The findings suggest that the proposed approach is a noticeable step toward automated emotion recognition, without using any additional sensors other than the head mounted display (HMD).
Hand Shadow Art: A Differentiable Rendering Perspective
(The Eurographics Association, 2023) Gangopadhyay, Aalok; Singh, Prajwal; Tiwari, Ashish; Raman, Shanmuganathan; Chaine, Raphaëlle; Deng, Zhigang; Kim, Min H.
Shadow art is an exciting form of sculptural art that produces captivating artistic effects through the 2D shadows cast by 3D shapes. Hand shadows, also known as shadow puppetry or shadowgraphy, involve creating various shapes and figures using your hands and fingers to cast meaningful shadows on a wall. In this work, we propose a differentiable rendering-based approach to deform hand models such that they cast a shadow consistent with a desired target image and the associated lighting configuration. We showcase the results of shadows cast by a pair of two hands and the interpolation of hand poses between two desired shadow images. We believe that this work will be a useful tool for the graphics community.
TreeGCN-ED: A Tree-Structured Graph-Based Autoencoder Framework For Point Cloud Processing
(The Eurographics Association, 2023) Singh, Prajwal; Tiwari, Ashish; Sadekar, Kaustubh; Raman, Shanmuganathan; Chaine, Raphaëlle; Deng, Zhigang; Kim, Min H.
Point cloud is a widely used technique for representing and storing 3D geometric data. Several methods have been proposed for processing point clouds for tasks such as 3D shape classification and clustering. This work presents a tree-structured autoencoder framework to generate robust embeddings of point clouds through hierarchical information aggregation using graph convolution. We visualize the t-SNE map to highlight the ability of learned embeddings to distinguish between different object classes. We further demonstrate the robustness of these embeddings in applications such as point cloud interpolation, completion, and single image-based point cloud reconstruction. The anonymized code is available here for research purposes.
Reconstructing Baseball Pitching Motions from Video
(The Eurographics Association, 2023) Kim, Jiwon; Kim, Dongkwon; Yu, Ri; Chaine, Raphaëlle; Deng, Zhigang; Kim, Min H.
Baseball is one of the most loved sports in the world. In baseball game, the pitcher's control ability is a key factor for determining the outcome of the game. There are a lot of video data shooting baseball games, and learning baseball pitching motions from video can be possible thanks to the pose estimation techniques. However, reconstructing pitching motions using pose estimators is challenging. When we watch a baseball game, motion blur occurs inevitably because the pitcher throws a ball into the strike zone as fast as possible. To tackle this problem, We propose a framework using physics simulation and deep reinforcement learning to reconstruct baseball pitching motions based on unsatisfactory poses estimated from video. We set the target point and design rewards to encourage the character to throw the ball to the target point. Consequently, we can reconstruct plausible pitching motion.
A Simple Stochastic Regularization Technique for Avoiding Overfitting in Low Resource Image Classification
(The Eurographics Association, 2023) Ji, Ya Tu; Wang, Bai Lun; Ren, Qing Dao Er Ji; Shi, Bao; Wu, Nier E.; Lu, Min; Liu, Na; Zhuang, Xu Fei; Xu, Xuan Xuan; Wang, Li; Dai, Ling Jie; Yao, Miao Miao; Li, Xiao Mei; Chaine, Raphaëlle; Deng, Zhigang; Kim, Min H.
Drop type technique, as a method that can effectively regulate the co-adaptations and prediction ability of neural network units, is widely used in model parameter optimization to reduce overfitting problems. However, low resource image classification faces serious overfitting problems, and the data sparsity problem weakens or even disappears the effectiveness of most regularization methods. This paper is inspired by the value iteration strategy and attempts a Drop type method based on Metcalfe's law, named Metcalfe-Drop. The experimental results indicate that using Metcalfe-Drop technique as a basis to determine parameter sharing is more effective than randomly controlling neurons according to a certain probability. Our code is available at https://gitee.com/giteetu/metcalfe-drop.git.
Multi-scale Monocular Panorama Depth Estimation
(The Eurographics Association, 2023) Mohadikar, Payal; Fan, Chuanmao; Zhao, Chenxi; Duan, Ye; Chaine, Raphaëlle; Deng, Zhigang; Kim, Min H.
Panorama images are widely used for scene depth estimation as they provide comprehensive scene representation. The existing deep-learning monocular panorama depth estimation networks produce inconsistent, discontinuous, and poor-quality depth maps. To overcome this, we propose a novel multi-scale monocular panorama depth estimation framework. We use a coarseto- fine depth estimation approach, where multi-scale tangent perspective images, projected from 360 images, are given to coarse and fine encoder-decoder networks to produce multi-scale perspective depth maps, that are merged to get low and high-resolution 360 depth maps. The coarse branch extracts holistic features that guide fine branch extracted features using a Multi-Scale Feature Fusion (MSFF) module at the network bottleneck. The performed experiments on the Stanford2D3D benchmark dataset show that our model outperforms the existing methods, producing consistent, smooth, structure-detailed, and accurate depth maps.
Combining Transformer and CNN for Super-Resolution of Animal Fiber Microscopy Images
(The Eurographics Association, 2023) Li, Jiagen; Ji, Yatu; Lu, Min; Wang, Li; Dai, Lingjie; Xu, Xuanxuan; Wu, Nier; Liu, Na; Chaine, Raphaëlle; Deng, Zhigang; Kim, Min H.
The images of cashmere and wool fibers used for scientific research in the textile field are mostly acquired manually under an optical microscope. However, due to the interference of microscope quality, shooting environment, focal length selection, acquisition techniques and other factors, the quality of the obtained photographs tends to have a low resolution, and it is difficult to display the fine fiber texture structure and scale details. To address the above problems, a lightweight super-resolution reconstruction algorithm with multi-scale hierarchical screening is proposed. Specifically, firstly, a hybrid module incorporating SwinTransformer and enhanced channel attention is proposed to extract the global features and obtain the important localization among them, in addition, a multi-scale hierarchical screening filtering module is proposed based on the residual model, which amplifies the feature information focusing on high-frequency regions by splitting the channels to allow the model to adaptively weight the features according to the feature weights and amplifies the feature information focusing on high-frequency regions. Finally, the global average pooling attention module integrates and weights the high-frequency features again to enhance details such as edges and textures. A large number of experiments show that compared with other state-of-the-art algorithms, the proposed method significantly improves the image quality on the fiber dataset, and at the same time proves the effectiveness of the proposed method at all scales in five public datasets, occupies less memory parameters than SwinIR, and not only improves the PSNR and SSIM, but also reduces the parameters compared with the light-weight ESRT.
Detection of Impurities in Wool Based on Improved YOlOV8
(The Eurographics Association, 2023) Liu, Yang; Ji, Yatu; Ren, Qing Dao Er Ji; Shi, Bao; Zhuang, Xufei; Yao, Miaomiao; Li, Xiaomei; Chaine, Raphaëlle; Deng, Zhigang; Kim, Min H.
In the current production process of wool products, the cleaning of wool raw materials has been realized in an automated way. However, detecting whether the washed and dried wool still contains excessive impurities still requires manual testing. This method greatly reduces production efficiency. To solve the problem of detecting wool impurities, we propose an improved model based on YOLOv8. Our work applied some techniques to solve the low resource model training problem, and incorporated a block for small object detection into the new neural network structure. The newly proposed model achieved an accuracy of 84.3% on the self built dataset and also achieved good results on the VisDrone2019 dataset.
Color3d: Photorealistic Texture Mapping for 3D Mesh
(The Eurographics Association, 2023) Zhao, Chenxi; Fan, Chuanmao; Mohadikar, Payal; Duan, Ye; Chaine, Raphaëlle; Deng, Zhigang; Kim, Min H.
3D reconstruction plays a significant role in various fields, including medical imaging, architecture, and forensic science, in both research and industry. The quality of color is one of the criteria that determine reconstruction performance. However, the predicted color from deep learning often suffers from low quality and a lack of details. While traditional texture mapping methods can provide superior color, they are restricted by mesh quality. In this study, we propose Color3D, a comprehensive procedure that applies photorealistic colors to the reconstructed mesh, accommodating both static objects and animations. The necessary inputs include multiview RGB images, depth images, camera poses, and camera intrinsic. Compared to traditional methods, our approach replaces face colors directly from the texture map with vertex colors from multiview images. The colors of the faces are obtained by interpolating the vertex colors of each triangle. Our method can generate high-quality color for different objects, and the performance remains strong even when the input mesh is not perfect.
Progressive Graph Matching Network for Correspondences
(The Eurographics Association, 2023) Feng, Huihang; Liu, Lupeng; Xiao, Jun; Chaine, Raphaëlle; Deng, Zhigang; Kim, Min H.
This paper presents a progressive graph matching network shorted as PGMNet. The method is more explainable and can match features from easy to hard. PGMNet contains two major blocks: sinkformers module and guided attention module. First, we use sinkformers to get the similar matrix which can be seen as an assignment matrix between two sets of feature keypoints. Matches with highest scores in both rows and columns are selected as pre-matched correspondences. These pre-matched matches can be leveraged to guide the update and matching of ambiguous features. The matching quality can be progressively improved as the the transformer blocks go deeper as visualized in Figure 1. Experiments show that our method achieves better results with typical attention-based methods.
Emotion-based Interaction Technique Using User's Voice and Facial Expressions in Virtual and Augmented Reality
(The Eurographics Association, 2023) Ko, Beom-Seok; Kang, Ho-San; Lee, Kyuhong; Braunschweiler, Manuel; Zünd, Fabio; Sumner, Robert W.; Choi, Soo-Mi; Chaine, Raphaëlle; Deng, Zhigang; Kim, Min H.
This paper presents a novel interaction approach based on a user's emotions within augmented reality (AR) and virtual reality (VR) environments to achieve immersive interaction with virtual intelligent characters. To identify the user's emotions through voice, the Google Speech-to-Text API is used to transcribe speech and then the RoBERTa language processing model is utilized to classify emotions. In AR environment, the intelligent character can change the styles and properties of objects based on the recognized user's emotions during a dialog. On the other side, in VR environment, the movement of the user's eyes and lower face is tracked by VIVE Pro Eye and Facial Tracker, and EmotionNet is used for emotion recognition. Then, the virtual environment can be changed based on the recognized user's emotions. Our findings present an interesting idea for integrating emotionally intelligent characters in AR/VR using generative AI and facial expression recognition.
Revisiting Visualization Evaluation Using EEG and Visualization Literacy Assessment Test
(The Eurographics Association, 2023) Yim, Soobin; Jung, Chanyoung; Yoon, Chanyoung; Yoo, Sangbong; Choi, Seongwon; Jang, Yun; Chaine, Raphaëlle; Deng, Zhigang; Kim, Min H.
Using EEG signals, also known as Electroencephalogram, can provide a quantitative measure of human cognitive load, making it an effective tool for evaluating visualization. However, the suitability of EEG for visualization evaluation has not been verified in previous studies. This paper investigates the feasibility of utilizing EEG data in visualization evaluation by comparing previous experiments. We trained and estimated individual CNN models for each subject using the EEG data. Our study demonstrates that EEG-based visualization evaluation provides a more feasible estimate of the difficulties experienced by subjects during the visualization task compared to previous studies that used accuracy and response time.
Visualization System for Analyzing Congestion Pricing Policies
(The Eurographics Association, 2023) Choi, SeokHwan; Seo, Seongbum; Yoo, Sangbong; Jang, Yun; Chaine, Raphaëlle; Deng, Zhigang; Kim, Min H.
Traffic congestion, which increases every year, has a negative impact on environmental pollution and productivity. Congestion pricing policy has been shown to be effective in Singapore, London, and Stockholm as one of the ways to solve traffic congestion. Pricing policy has different effects depending on a target area, pricing scheme, and toll. In general, congestion pricing policy researchers conduct statistical analysis of simulation model predictions within a fixed region and time range. However, existing research techniques make analyzing all traffic data characteristics with spatiotemporal dependency difficult. In this paper, we propose a visualization system for analyzing the influence of congestion pricing policy using SUMO and TCI. Our system provides a district-level analysis process to explore the influence of pricing policy over time and area.

BibTeX (PG2023 Short Papers and Posters)

Browse

Recent Submissions

Results Per Page

Sort Options