Efficiency Optimization Strategies for Point Transformer Networks
Jannis Unkrig, Markus Friedrich
2024
Abstract
The Point Transformer, and especially its successor Point Transformer V2, are among the state-of-the-art architectures for point cloud processing in terms of accuracy. However, like many other point cloud processing architectures, they suffer from the inherently irregular structure of point clouds, which makes efficient processing computationally expensive. Common workarounds include reducing the point cloud density, or cropping out partitions, processing them sequentially, and then stitching them back together. However, those approaches inherently limit the architecture by either providing less detail or less context. This work provides strategies that directly address efficiency bottlenecks in the Point Transformer architecture, and therefore allows processing larger point clouds in a single feed-forward operation. Specifically, we propose using uniform point cloud sizes in all stages of the architecture, a k-D tree-based k-nearest neighbor search algorithm that is not only efficient on large point clouds, but also generates intermediate results that can be reused for downsampling, and a technique for normalizing local densities which improves overall accuracy. Furthermore, our architecture is simpler to implement and does not require custom CUDA kernels to run efficiently.
DownloadPaper Citation
in Harvard Style
Unkrig J. and Friedrich M. (2024). Efficiency Optimization Strategies for Point Transformer Networks. In Proceedings of the 19th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 3: VISAPP; ISBN 978-989-758-679-8, SciTePress, pages 65-76. DOI: 10.5220/0012325000003660
in Bibtex Style
@conference{visapp24,
author={Jannis Unkrig and Markus Friedrich},
title={Efficiency Optimization Strategies for Point Transformer Networks},
booktitle={Proceedings of the 19th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 3: VISAPP},
year={2024},
pages={65-76},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0012325000003660},
isbn={978-989-758-679-8},
}
in EndNote Style
TY - CONF
JO - Proceedings of the 19th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 3: VISAPP
TI - Efficiency Optimization Strategies for Point Transformer Networks
SN - 978-989-758-679-8
AU - Unkrig J.
AU - Friedrich M.
PY - 2024
SP - 65
EP - 76
DO - 10.5220/0012325000003660
PB - SciTePress