AKDT: Adaptive Kernel Dilation Transformer for Effective Image Denoising
Alexandru Brateanu, Raul Balmez, Adrian Avram, Ciprian Orhei
2025
Abstract
Image denoising is a fundamental yet challenging task, especially when dealing with high-resolution images and complex noise patterns. Most existing methods rely on standard Transformer architectures, which often suffer from high computational complexity and limited adaptability to varying noise levels. In this paper, we introduce the Adaptive Kernel Dilation Transformer (AKDT), a novel Transformer-based model that fully harnesses the power of learnable dilation rates within convolutions. AKDT consists of several layers and custom-designed blocks, including our novel Learnable Dilation Rate (LDR) module, which is utilized to construct a Noise Estimator module (NE). At the core of AKDT, the NE is seamlessly integrated within standard Transformer components to form the Noise-Guided Feed-Forward Network (NG-FFN) and Noise-Guided Multi-Headed Self-Attention (NG-MSA). These noise-modulated Transformer components enable the model to achieve unparalleled denoising performance while significantly reducing computational costs. Extensive experiments across multiple image denoising benchmarks demonstrate that AKDT sets a new state-of-the-art, effectively handling both real and synthetic noise. The source code and pre-trained models are publicly available at https://github.com/albrateanu/AKDT.
DownloadPaper Citation
in Harvard Style
Brateanu A., Balmez R., Avram A. and Orhei C. (2025). AKDT: Adaptive Kernel Dilation Transformer for Effective Image Denoising. In Proceedings of the 20th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 3: VISAPP; ISBN 978-989-758-728-3, SciTePress, pages 418-425. DOI: 10.5220/0013157700003912
in Bibtex Style
@conference{visapp25,
author={Alexandru Brateanu and Raul Balmez and Adrian Avram and Ciprian Orhei},
title={AKDT: Adaptive Kernel Dilation Transformer for Effective Image Denoising},
booktitle={Proceedings of the 20th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 3: VISAPP},
year={2025},
pages={418-425},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0013157700003912},
isbn={978-989-758-728-3},
}
in EndNote Style
TY - CONF
JO - Proceedings of the 20th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 3: VISAPP
TI - AKDT: Adaptive Kernel Dilation Transformer for Effective Image Denoising
SN - 978-989-758-728-3
AU - Brateanu A.
AU - Balmez R.
AU - Avram A.
AU - Orhei C.
PY - 2025
SP - 418
EP - 425
DO - 10.5220/0013157700003912
PB - SciTePress