AKDT: Adaptive Kernel Dilation Transformer for Effective Image Denoising

Alexandru Brateanu; Raul Balmez; Adrian Avram; Ciprian Orhei

doi:10.5220/0013157700003912

AKDT: Adaptive Kernel Dilation Transformer for Effective Image Denoising

Alexandru Brateanu, Raul Balmez, Adrian Avram, Ciprian Orhei

2025

Abstract

Image denoising is a fundamental yet challenging task, especially when dealing with high-resolution images and complex noise patterns. Most existing methods rely on standard Transformer architectures, which often suffer from high computational complexity and limited adaptability to varying noise levels. In this paper, we introduce the Adaptive Kernel Dilation Transformer (AKDT), a novel Transformer-based model that fully harnesses the power of learnable dilation rates within convolutions. AKDT consists of several layers and custom-designed blocks, including our novel Learnable Dilation Rate (LDR) module, which is utilized to construct a Noise Estimator module (NE). At the core of AKDT, the NE is seamlessly integrated within standard Transformer components to form the Noise-Guided Feed-Forward Network (NG-FFN) and Noise-Guided Multi-Headed Self-Attention (NG-MSA). These noise-modulated Transformer components enable the model to achieve unparalleled denoising performance while significantly reducing computational costs. Extensive experiments across multiple image denoising benchmarks demonstrate that AKDT sets a new state-of-the-art, effectively handling both real and synthetic noise. The source code and pre-trained models are publicly available at https://github.com/albrateanu/AKDT.

Download

Paper Citation

in Harvard Style

Brateanu A., Balmez R., Avram A. and Orhei C. (2025). AKDT: Adaptive Kernel Dilation Transformer for Effective Image Denoising. In Proceedings of the 20th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 3: VISAPP; ISBN 978-989-758-728-3, SciTePress, pages 418-425. DOI: 10.5220/0013157700003912

in Bibtex Style

@conference{visapp25,
author={Alexandru Brateanu and Raul Balmez and Adrian Avram and Ciprian Orhei},
title={AKDT: Adaptive Kernel Dilation Transformer for Effective Image Denoising},
booktitle={Proceedings of the 20th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 3: VISAPP},
year={2025},
pages={418-425},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0013157700003912},
isbn={978-989-758-728-3},
}

in EndNote Style

TY - CONF

JO - Proceedings of the 20th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 3: VISAPP
TI - AKDT: Adaptive Kernel Dilation Transformer for Effective Image Denoising
SN - 978-989-758-728-3
AU - Brateanu A.
AU - Balmez R.
AU - Avram A.
AU - Orhei C.
PY - 2025
SP - 418
EP - 425
DO - 10.5220/0013157700003912
PB - SciTePress