loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Authors: Igor Vozniak ; Pavel Astreika ; Philipp Müller ; Nils Lipp ; Christian Müller and Philipp Slusallek

Affiliation: German Research Center for Artificial Intelligence, Stuhlsatzenhausweg 3 (Campus D3 2), Saarbr ücken, Germany

Keyword(s): Voxel Grid, 3D Convolutions, Voxel Grid Representation, High-Definition Voxel Grid, Reconstruction.

Abstract: Voxel grids are an effective means to represent 3D data, as they accurately preserve spatial relations. However, the inherent sparseness of voxel grid representations leads to significant memory consumption in deep learning architectures, in particular for high-resolution (HD) inputs. As a result, current state-of-the-art approaches to the reconstruction of 3D data tend to avoid voxel grid inputs. In this work, we propose HD-VoxelFlex, a novel 3D CNN architecture that can be flexibly applied to HD voxel grids with only moderate increase in training parameters and memory consumption. HD-VoxelFlex introduces three architectural novelties. First, to improve the models’ generalizability, we introduce a random shuffling layer. Second, to reduce information loss, we introduce a novel reducing skip connection layer. Third, to improve modelling of local structure that is crucial for HD inputs, we incorporate a kNN distance mask as input. We combine these novelties with a “bag of tricks” iden tified in a comprehensive literature review. Based on these novelties we propose six novel building blocks for our encoder-decoder HD-VoxelFlex architecture. In evaluations on the ModelNet10/40 and PCN datasets, HD-VoxelFlex outperforms the state-of-the-art in all point cloud reconstruction metrics. We show that HD-VoxelFlex is able to process high-definition (128 3 , 192 3 ) voxel grid inputs at much lower memory consumption than previous approaches. Furthermore, we show that HD-VoxelFlex, without additional fine-tuning, demonstrates competitive performance in the classification task, proving its generalization ability. As such, our results underline the neglected potential of voxel grid input for deep learning architectures. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 18.227.209.101

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Vozniak, I.; Astreika, P.; Müller, P.; Lipp, N.; Müller, C. and Slusallek, P. (2024). HD-VoxelFlex: Flexible High-Definition Voxel Grid Representation. In Proceedings of the 19th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 4: VISAPP; ISBN 978-989-758-679-8; ISSN 2184-4321, SciTePress, pages 204-219. DOI: 10.5220/0012374800003660

@conference{visapp24,
author={Igor Vozniak. and Pavel Astreika. and Philipp Müller. and Nils Lipp. and Christian Müller. and Philipp Slusallek.},
title={HD-VoxelFlex: Flexible High-Definition Voxel Grid Representation},
booktitle={Proceedings of the 19th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 4: VISAPP},
year={2024},
pages={204-219},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0012374800003660},
isbn={978-989-758-679-8},
issn={2184-4321},
}

TY - CONF

JO - Proceedings of the 19th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 4: VISAPP
TI - HD-VoxelFlex: Flexible High-Definition Voxel Grid Representation
SN - 978-989-758-679-8
IS - 2184-4321
AU - Vozniak, I.
AU - Astreika, P.
AU - Müller, P.
AU - Lipp, N.
AU - Müller, C.
AU - Slusallek, P.
PY - 2024
SP - 204
EP - 219
DO - 10.5220/0012374800003660
PB - SciTePress