Enhancing Image Quality to Improve Medical Image Classiﬁcation:

Application to Nuclear Medicine Planar Images

Ouassim Boukhennoufa

1 a

, Laurent Comas

2,3 b

, Jean-Marc Nicod

1 c

Noureddine Zerhouni

1 d

and Hatem Boulahdour

2,3 e

SUPMICROTECH, CNRS, Institut FEMTO-ST, F-25000, Besanc¸on, France

CHU Besanc¸on, M

edecine Nucl

eaire, F-25000, Besanc¸on, France

Universit

e de Franche-Comt

e, SINERGIES, F-25000, Besanc¸on, France

ouassim.boukhennoufa@univ-fcomte.fr, lcomas@chu-besancon.fr, {jean-marc.nicod, Noureddine.Zerhouni}@ens2m.fr,

Keywords:

Image Quality Enhancement, Artiﬁcial Intelligence, Nuclear Medicine, Medical Imaging, Parathyroid Glands.

Abstract:

Nuclear Medicine images are obtained by injecting small amounts of radio-tracers into the body to track

speciﬁc organs. Particular cameras detect radiations emitted from the radio-tracers resulting in images that

visualize the function of the organs rather than their structure. The association of the cameras and radio-

tracers causes low resolution and low signal-to-noise ratio, therefore, the images are often of poor quality.

Image Quality Enhancement (IQE) is one possible solution to this problem as it improves the clarity of the

images by removing noise and correcting distortions. In this paper, we propose a methodology based on

artiﬁcial intelligence (AI) with the integration of an IQE step for the detection of normal/abnormal parathyroid

glands. Two different IQE techniques are employed, one based on a statistical ﬁlter and the other on AI.

The enhanced images are processed with a Convolutional Neural Network (CNN), and Lasso regression for

features extraction and selection. Finally, several AI models are used for binary image classiﬁcation. The

obtained results achieved an accuracy of 83% in distinguishing normal/abnormal parathyroid glands. IQE step

signiﬁcantly improves the accuracy of the AI model by 16.9% over the initial accuracy of 71%, demonstrating

the importance of IQE in assessing image classiﬁcation performance.

1 INTRODUCTION

HyperParaThyroidism (HPT), is a prevalent en-

docrine disorder typiﬁed by elevated or anoma-

lous parathyroid hormone levels and hypercal-

cemia (Walker and Silverberg, 2018). HPT

can impact one or more Parathyroid Glands

(PGs) (Bilezikian et al., 2016). PGs are typi-

cally 3-5 mm in size, and conventional imaging

methods are unable to detect them (Sung, 2015). To

give an accurate diagnosis, physicians refer to clinical

data, Nuclear Medicine (NM) images, but also other

image modalities such as ultrasonography (Khan

et al., 2017). There are limitations in some image

modalities such as ultrasound to locate very small

https://orcid.org/0000-0002-9193-9552

https://orcid.org/0009-0008-9338-7529

https://orcid.org/0000-0001-9521-6215

https://orcid.org/0000-0002-8847-3202

https://orcid.org/0000-0001-5834-5720

PGs (Lee et al., 2021).

Nuclear medicine (NM) is one remedy to this is-

sue and improve the accuracy of physicians’ diag-

noses. NM is a ﬁeld of medicine that uses radio-

pharmaceuticals to diagnose and evaluate the func-

tioning of the body (Nieciecki et al., 2015). In NM,

several image acquisition techniques exist for PGs

detection in NM, double isotope and double phase

are the most common. The ﬁrst involves the injec-

tion of two isotopes, followed by image acquisition,

while the second implicates the injection of one iso-

tope followed by two time-domain acquisitions (Pe-

tranovi

c Ov

cari

cek et al., 2021). In the case of double

isotope, physicians proceed with image subtraction

after normalization to obtain an image with only PGs

for diagnosis (Tlili et al., 2023) (Petranovi

c Ov

cari

cek

et al., 2021).

The problem with NM images is the limitation

of the information as they are very weak in signal,

therefore, the images are highly noisy (Kim et al.,

2020). Researchers developed different medical as-

Boukhennoufa, O., Comas, L., Nicod, J.-M., Zerhouni, N. and Boulahdour, H.

Enhancing Image Quality to Improve Medical Image Classiﬁcation: Application to Nuclear Medicine Planar Images.

DOI: 10.5220/0013113800003911

Paper published under CC license (CC BY-NC-ND 4.0)

In Proceedings of the 18th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2025) - Volume 1, pages 303-310

ISBN: 978-989-758-731-3; ISSN: 2184-4305

303

sistance tools based on Deep Learning (DL) to assist

physicians in their diagnosis even if the related works

remain limited. The study in (Yoshida et al., 2022)

proposed a transfer learning methodology by apply-

ing a pre-trained RetinaNet (Lin et al., 2017) model

to dual-phase

99m

Tc-sestamibi images. The work in-

cluded 281 patients with conﬁrmed HPT, distributed

as 92 for training, 45 for validation, and 44 for testing.

The model achieved a sensitivity of 82% and a mean

false positive indication of 0.44.

The authors in (Boukhennoufa et al., 2024) devel-

oped a medical assistance tool to automatically sub-

tract the dual isotope

99m

Tc-sestamibi and

123

I im-

ages. The methodology consisted of combining the

images with statistical features such as kurtosis and

entropy extracted from each image. The images were

normalized and processed with a CNN model for fea-

ture extraction, whereas the statistical features were

processed with a random forest model for the same

objective as with the images. The combination of the

extracted features was processed with a support vector

machine to predict a subtraction factor that was used

to compute a subtracted image. The results yielded to

a mean correlation of 0.95 with the reference images

(performed by physicians).

At the preoperative level with

99m

Tc-sestamibi

single-photon emission computed tomography, the

study (Sandqvist et al., 2022) proposed to predict

the presence of overlooked parathyroid hormone us-

ing six predictors such as calcium level and parathy-

roid hormone. The data were extracted from 349 pa-

tients with conﬁrmed primary HPT or multi-glandular

disease, and a decision tree with Bayesian hyperpa-

rameter optimization methodology was employed for

the classiﬁcation purpose. A 5-Fold cross-validation

technique was used, where it achieved a true-positive

prediction rate of 72% for multi-glandular cases and a

misclassiﬁcation rate of 6% for primary HPT patients.

In another study using clinical data (Samaras

et al., 2024), the objective was to distinguish patients

with primary HPT and Multi-Glandular Disease with

an explainable machine learning methodology. The

data were extracted from 134 patients and were highly

imbalanced: 26 patients with MGD, and the rest with

primary HPT, this issue was solved randomly by over-

sampling the MGD class. The used data were gender,

age, size of the abnormal gland, number of affected

glands, and multiplication of the parathyroid hormone

with the calcium level in blood. A benchmarking of

several ML algorithms such as LightGBM and sup-

port vector machine was proposed with a SHAP ex-

plainability methodology. The reported results indi-

cated that the oversampling methodology contributed

to assessing the speciﬁcity from 66.67% to 81.48%.

The different works showed promising results in

HPT diagnosis using different types of data. Never-

theless, the problem of noisy images is still not yet

addressed. One remedy is to apply Image Quality En-

hancement (IQE): an important step to improve the

precision of DL models. IQE consists of removing

noise and improving the brightness of images to high-

light important features (Bhardwaj et al., 2018). To

the best of our knowledge, there is no other study in-

cluding an IQE step for PGs detection.

In this paper, we propose a new process for nor-

mal/abnormal PGs detection by applying two IQE

techniques on only dual-isotope static images. The

ﬁrst employs a statistical ﬁlter approach, while the

second is based on AI. The ﬁltered images are passed

into a DL model and a Lasso regression for feature ex-

traction and selection. Finally, AI models are bench-

marked for image classiﬁcation.

The remainder of the paper is organized as fol-

lows: Section 2 details the used data and the proposed

methodology. Section 3 presents and discusses the

results of the proposed approach. The work is con-

cluded in Section 4.

2 MATERIALS AND METHODS

In this section, the patient’s demographics, the used

data, and the proposed methodology are explained.

2.1 Patients and Data Characteristics

Between June 2012 and December 2023, 923 pa-

tients underwent dual isotope (

99m

Tc-sestamibi/

123

planar scintigraphy. The acquisitions were realized

on a Discovery NMCT 670 or an Inﬁna GE Health-

care. First, 18.5 MBq +-10% of

123

I was adminis-

tered intravenously. 3h later, 740 MBq +/- 10% of

99m

Tc-sestamibi was injected. 5 min later, the acquisi-

tion started with planar imaging of the thyroid region

(PINHOLE) for 10 min and mediastinum Low Energy

High Resolution (LEHR) for 5 min (dual isotope set-

ting with photopeak’s centered over 140,5 keV +/-

7,5% and 159 keV -5% + 10% window for

99m

Tc-

sestamibi and

123

I, respectively). A subtraction of the

two PINHOLE images is performed using XELERIS

Software by the physicians. They begin by extract-

ing the thyroid from the

123

I image, which is merged

with the

99m

Tc-sestamibi image. Next, the images

are normalized according to the thyroid intensity in

both images. This ensures that both images are uni-

formly scaled. The subtraction is performed and ad-

justed then between the two normalized images (Tlili

et al., 2023). This results in a Subtracted Image (SI),

BIOIMAGING 2025 - 12th International Conference on Bioimaging

304

Raw Images

Input

Images

Image

normalization

Image pre-

processing

Image Quality

Enhancement

Image

processing

Convolutional

Neural Network

Feature

Extraction

Lasso regression

Feature

selection

Normal/abnormal

glands detection

modeling

• Kurtosis

• Mean

• Std

• Entropy

Statistical

Features

Figure 1: Overview of the proposed methodology.

used by the physicians to identify abnormal PGs.

Due to the absence of images in the Picture

Archive and Communication Systems, scintigraphy

with no thyroid ﬁxation in some cases, and cases

judged doubtful case by physicians, 215 patients were

removed. The characteristics are summarized in Ta-

ble 1. In this study, 5 images per patient are used:

LEHR of

99m

Tc-sestamibi and

123

I, PINHOLE of

99m

Tc-sestamibi and

123

I, and the Subtracted Image.

The image size is 128 × 128 pixels with a grayscale

channel.

Table 1: Patient’s characteristics summary.

Patients Age Gender Weight Height

number range F:M range range

708 [18 − 99] 601:222 [40 − 167] [143 − 189]

2.2 Proposed Methodology

Figure 1 highlights the overall process of the proposed

methodology, starting with raw images as inputs to

the AI modeling where the objective is to detect ab-

normal PGs.

From raw images, statistical features are extracted

and are used later in the process: kurtosis, mean pixel

value, standard deviation, and entropy. The objective

of these features is to help the AI model in PGs di-

agnosis. These characteristics can help the model un-

derstand extracting patterns between images of nor-

mal/abnormal PGs cases. For instance, in the im-

ages of abnormal glands, the mean pixel value may

be larger than in normal glands. Also, since the quan-

tity of used data is not very high, adding these features

means adding more data, hence, enlarging the popu-

lation. The next step of the methodology consists of

image normalization, as explained in Subsection 2.1,

the normalization performed by physicians is one im-

portant step in order to subtract the images. Also, in

the AI pipeline, image normalization is a very impor-

tant step to increase performance and speed up the

learning convergence. For these reasons, the images

in this study are normalized using the MinMax nor-

malization method (Zhang et al., 2024). Speciﬁcally,

each image type is normalized globally, meaning that

the images are scaled in function of the minimum and

maximum pixel values of the whole dataset. For ex-

ample, in LEHR

99m

Tc-sestamibi of each patient, the

images are normalized with the minimum and max-

imum pixel values of all the LEHR

99m

Tc-sestamibi

images. This ensures that the images are normalized

uniformly and the characteristics of all the data are

taken into account. Equation 1 represents the nor-

malization, where NI, RI, t, n represents Normalized

Image, Raw Image, one of the image types, and the

number of patients, respectively.

MinMax

− min(RI

, ...RI

)

max(RI

, ...RI

) − min(RI

, ...RI

)

(1)

The second step of the process is about enhancing

the quality of images, once the images are normal-

ized, they’re processed with two different techniques

and compared differently according to the diagnosis

results. The ﬁrst is based on a statistical ﬁlter called

Non-Local Means Denoising (NLMD) (Buades et al.,

2011). The latter is an image processing technique for

IQE, it reduces the noise by averaging similar pixels

throughout the whole image. In opposition to other

statistical ﬁlters that consider only neighborhood pix-

els, NLMD locates patches of pixels with similar pat-

terns across all the images. These patterns are used to

remove noise more effectively, but also preserve the

details of the images. NLMD is a form of segmen-

tation as it merges the pixels with similar characteris-

tics into patches. NLMD computes the denoised pixel

value by using a weighted average of all the image

pixels as shown in Equation 2. Where i and j are po-

sitions of the pixel values.

P, P are the denoised pixel

value at position i, original pixel value at position j,

Enhancing Image Quality to Improve Medical Image Classiﬁcation: Application to Nuclear Medicine Planar Images

305

Figure 2: SCUNet architecture from (Zhang et al., 2023). Licensed under a Creative Commons Attribution 4.0 International

License (CC BY 4.0). No modiﬁcations were made.

respectively. w is the similarity weight between pixels

at position i and j. Finally, C represents a normaliza-

tion factor ensuring that weight sum to 1 shown in

Equation3.

P(i) =

C(i)

∑

w(i, j)P( j) (2)

The weights are computed with an Euclidean distance

as represented in Equation 3, where I(i) and I( j) are

patches centered around i and j. A ﬁltering parameter

h is used to control the degree of smoothing, deﬁned

as 0.8 in this study.

w(i, j) = e

−

∥

I(i)−I( j)

∥

, C(i) =

∑

w(i, j) (3)

NLMD ensures that pixels similar to one in the posi-

tion i contribute more to its denoising value, reduc-

ing noise while preserving important details. For all

these reasons, NLMD is chosen in this work. The sec-

ond IQE technique is based on AI using a Swin-Conv-

UNet (SCUNet) denoising network. It is a combina-

tion of Swin transformers, Convolution Neural Net-

work (CNN), and U-NET model. A Swin-Conv (SC)

block is used as the main backbone of the U-NET

model (Zhang et al., 2023). SCUNet performs in a

segmentation way where it groups patches of similar

patterns. To the best of our knowledge, it is the state-

of-the-art image denoising model. For these reasons,

it is used in this study to provide a fair comparison

with NLMD as they both perform with the patch prin-

ciple. Figure 2 details the architecture of the SCUNet

model. The images are ﬁrst passed to a convolution

ﬁlter of size 3, followed by the SC block, the U-NET

model with residual connections, and ﬁnally, another

SC block and a convolution ﬁlter of size 3 to recon-

struct the denoised image.

The IQE methodologies are summarized in Fig-

ure 3 where IQE1 and IQE2 represent the NLMD

and SCUNet techniques, respectively. The two IQEs

are used separately for the ﬁve different images

(PINHOLES

99m

Tc-sestamibi,

123

I, and subtracted),

(LEHR

99m

Tc-sestamibi and

123

I).

Input Images

Lehr

99m

Tc-sestamibi

Lehr

123

Pinhole

99m

Tc-sestamibi

Pinhole

123

Pinhole

Subtracted

Filtered Images

Lehr

99m

Tc-sestamibi

Lehr

123

Pinhole

99m

Tc-sestamibi

Pinhole

123

Pinhole

Subtracted

Statistical

ﬁlter

IQE 1

Artiﬁcial

intelligence

IQE 2

Figure 3: Image Quality Enhancement process.

In the third step of the process and once the im-

ages are processed for IQE, a CNN is used for fea-

ture extraction, to decrease the size of the images and

preserve only the important features for further pro-

cessing. After a series of trials, the architecture in

Table 2 was found to be the best performing one for

the diagnosis purpose. The input layer is of shape

128 × 128 × 1 = 16384 features.

Table 2: CNN architecture for feature extraction.

Layer Hyperparameters

Conv2D - 32 ﬁlters of size 3,

relu function

MaxPooling2D - ﬁlter size of 2

Conv2D - 64 ﬁlters of size 3,

relu function

Batch Normalization /

MaxPooling2D - ﬁlter size of 2

Conv2D - 128 ﬁlters of size 3,

relu function

MaxPooling2D - ﬁlter size of 2

GlobalAveragePooling2D /

Dense 128 nodes, relu function

The fourth step of the methodology consists of ap-

BIOIMAGING 2025 - 12th International Conference on Bioimaging

306

plying the Least Absolute Shrinkage and Selection

Operator (Lasso) regression (Tibshirani, 1996) for

feature selection before feeding them to AI models for

classiﬁcation. Lasso is a form of linear regression that

aims to reduce and remove non-relevant/redundant

features to avoid over-ﬁtting. This is achieved by

shrinking some features to zero, leaving only the most

important ones. Lasso prevents over-ﬁtting problems

and can do both feature selection and regularization,

which leads to more generalized models. A pair

of features with corresponding labels (0 for normal

glands and 1 for abnormal). The objective is to min-

imize the function represented in Equation (4). y is

the label, X are the features, β a coefﬁcient for each

feature, n the number of patients, f the total number

of features.

minimize





∑

k=1

−

∑

z=1

+ λ

∑

z=1





(4)

Lasso is a modiﬁed version of linear regression by

the addition of the penalty term λ

∑

z=1

that helps

the model in the generalization process and avoids

over-ﬁtting.

The last step of the process consists of concate-

nating the statistical features extracted at the begin-

ning from the raw images and the features resulting

from the Lasso regression into a single matrix that is

used for binary classiﬁcation. The latter is achieved

by benchmarking several Machine Learning (ML) al-

gorithms. The training is consolidated with a 10-Fold

cross-validation technique. The data are divided into

10 folds, for 10 iterations, 9 folds are used for the

model training and 1 fold for validation. This ensures

that all the data are used at least one time for training

and once for testing, it also reduces the variance in the

performance providing trustful results, more reliable

model evaluation, and a robust model.

With this proposed methodology, it is ensured that

only important features from the enhanced images are

used with an addition of the statistical information to

enrich the training of the ML models with the objec-

tive of improving the diagnosis. Also, the quantity of

data is not very high, for these reasons, the choice of

ML models for image classiﬁcation is evident.

3 RESULTS AND DISCUSSIONS

In this section, the results of the proposed methodolo-

gies are detailed with discussions.

3.1 Image Quality Enhancement

In order to evaluate the IQE techniques, the two most

common metrics for such tasks are employed: Peak

Signal-to-Noise Ratio (PSNR), and Structural Simi-

larity Index Measure(SSIM). Both metrics are used

to evaluate the image-denoising, PSNR measures the

quality of a denoised/reconstructed image compared

to the original. A value higher than 30dB indicates

a high image quality enhancement, whereas a lower

value expresses a lower IQE. SSIM measures the sim-

ilarity between two images by comparing the lumi-

nance, contrast, and structure. Its values range be-

tween [0 − 1], from poor quality to very high quality.

Table 3 shows the obtained results applying the two

metrics on the used data, the values are the mean

across the whole data with the 5 different image types.

Table 3: Performance evaluation of IQE techniques. P:

PINHOLE, L: LEHR.

Image type

IQE

technique

PSNR SSIM

99m

Tc-sestamibi

IQE1 38.54 0.98

IQE2 32.65 0.91

123

IQE1 37.46 0.97

IQE2 31.52 0.79

Subtracted P.

IQE1 37.85 0.97

IQE2 32.42 0.85

99m

Tc-sestamibi

IQE1 39.44 0.98

IQE2 33.47 0.92

123

IQE1 39.52 0.98

IQE2 30.65 0.75

The results report a dominance using IQE1 (sta-

tistical ﬁlter-based method), in comparison to IQE2

(AI-based method). The ﬁlter-based methodologies

are simpler and designed especially for special tasks,

image denoising in this context, making them highly

effective for IQE purposes. Another reason is that ﬁl-

ters apply a ﬁxed set of operations and treat the im-

ages individually according to their local characteris-

tics which leads to reliable results. On the other hand,

AI-based methodologies for IQE are more complex

and require a large amount of data, which leads to the

non-generalization of new data. Finally, the dataset is

very limited in terms of size which led to the SCUNet

model not performing well compared to NLMD. AI-

based techniques may capture information that isn’t

caught by traditional ﬁlters, but for this speciﬁc task

in this work, NLMD is more performant as the goal

is to enhance the image quality, while features (infor-

mation) are extracted later in the process.

An example application of the two IQEs (NLMD

and SCUNet) on two random subtracted PINHOLES

of two patients is shown in Figure 4. NLMD effec-

tively reduces the noise and enhances the key regions

Enhancing Image Quality to Improve Medical Image Classiﬁcation: Application to Nuclear Medicine Planar Images

307

(indicated in circles) with arrows in the original im-

ages while maintaining the important structures such

as the edges of the images, also in terms of metrics

it achieved PSNR of 37.25, and 38.19, respectively in

the two cases. On the other hand, the application of

the SCUNet (IQE2) ﬁlter in the same two cases re-

ports a decrease in the PSNR and SSIM with 4.1 and

0.07, respectively. It results in partial noise removal

without completely reducing it. Important structures,

such as edges, were slightly modiﬁed, resulting in a

loss of information that could affect the performance

of ML models in terms of accurate image classiﬁca-

tion.

In summary, with IQE1 the noise is removed from im-

ages without modifying the structures which is the ob-

jective of the use case in this study. NLMD (IQE1) is

selected over SCUNet (IQE2) for the next steps and

the rest of the process.

3.2 Image Classiﬁcation

The images are processed with IQE1, and fed to the

CNN for feature extraction and Lasso regression for

feature selection. A benchmarking of ML models

for binary classiﬁcation (normal/abnormal PGs de-

tection) is applied. The images are associated with

the extracted statistical features explained in Subsec-

tion 2.2. The corresponding labels are divided into

472 positive (abnormal), and 236 negative (normal).

The imbalanced data problem is solved by reducing

the number of positive cases to 250. As a result, 250

positive and 236 negative cases are used to benchmark

the ML models. The remaining positive cases aren’t

discarded as they’re used in the test phase.

Table 4 reports the results with different ML models

with various metrics: Accuracy, Area Under Curve,

and Recall. The best-performing model is presented

in each image type with the mean values reported by

10-Fold cross-validation. The results clearly show

that PINHOLE images outperform LEHR images.

This was expected since physicians primarily rely on

the PINHOLE images for the diagnosis.

Table 4: Classiﬁcation results with IQE1. P: PINHOLE, L:

LEHR.

Image type Model Acc AUC Recall

99m

Tc-sestamibi LGBM 77% 74% 73%

123

I RF 75% 71% 71%

Subtracted P. LGBM 81% 80% 80%

99m

Tc-sestamibi SVM 67% 66% 67%

123

I SVM 65% 65% 63%

Combined P. LGBM 83% 83% 81%

The combination of the 3 PINHOLE images im-

proves the performance by 2.47%. This suggests that

using multiple images provides integral information,

which increases the model’s ability to make more ac-

curate classiﬁcations.

To study the ability of the model to distinguish

between the normal and abnormal classes, an AUC

metric was used that gave different results in Ta-

ble 4. A Receiver Operating Characteristic (ROC)

curve with the combination of the 3 PINHOLE im-

ages using LGBM is presented in Figure 5. The corre-

sponding AUC is 0.83 (83%) which expresses a good-

performing model indicating the high ability to dif-

ferentiate between the two classes, meaning that the

model does not tend to a particular class.

Figure 5: ROC curve with the test data.

The results are good, but there is still room for

improvement. A perfect AUC should be 1.0 (100%),

further future works needs to be oriented in a way to

improve this value by increasing the size of the dataset

if available, or AI model improvement.

3.3 Ablation Study

To study the impact of IQE1 on image classiﬁcation

with the different ML models, an ablation study is

performed. To achieve this, the proposed approach

was repeated without the image processing (IQE1)

step. The aim is to evaluate the contribution of

IQE1 to the global methodology, and whether the per-

formance increases or not. The obtained results as

highlighted in Table 5 show a signiﬁcant decrease in

the performance, for example, using combined PIN-

HOLES, the accuracy dropped by 14.4% compared to

the methodology with IQE1 application. This signif-

icant decrease in accuracy emphasizes the important

role of IQE1 in assessing image classiﬁcation perfor-

mance. IQE1 improves image quality by removing

noise and highlighting darker regions, this improves

the relevance and details of images. By doing so, the

BIOIMAGING 2025 - 12th International Conference on Bioimaging

308

IQE1 (NLMD) IQE2 (SCUNet)Original

PSNR: 33.15

SSIM: 0,9

PSNR: 34.69

SSIM: 0,9

PSNR: 37,25

SSIM: 0,97

PSNR: 38.19

SSIM: 0,98

IQE1 (NLMD) IQE2 (SCUNet)Original

Figure 4: Example application of NLMD and SCUNet on subtracted PINHOLE.

ML models focus more on the key features in the im-

ages, resulting in improved performance.

Table 5: Classiﬁcation results with IQE1. P: PINHOLE, L:

LEHR.

Image type Model Acc AUC Recall

99m

Tc-sestamibi LGBM 67% 63% 62%

123

I RF 69% 69% 68%

Subtracted P. LGBM 70% 69% 69%

99m

Tc-sestamibi SVM 58% 57% 58%

123

I SVM 61% 62% 63%

Combined P. LGBM 71% 70 70%

This study clearly demonstrates that IQE1 is cru-

cial to ensuring that AI models achieve optimal per-

formance by providing cleaner, more targeted input

images.

3.4 Performance Limitations

As mentioned in Section 1, physicians usually rely on

multiple data sources before giving the ﬁnal diagno-

sis. The proposed methodology demonstrates good

performance even when relying on a single type of

data (static images). This highlights the effectiveness

of the proposed approach in providing reliable in-

sights with limited data, especially in scenarios where

access to multiple modalities is limited.

While the proposed methodology achieves an ac-

curacy of 83%, the remaining 17% represents cases

where the model couldn’t give the real class label (as

given and assigned by senior physicians). Figure 6

shows an example of misclassiﬁcation, the predicted

labels are in red, whereas the real labels are below

each case. For the right case, the affected PG can’t

be observed in the image as it is in the posterior of

the thyroid, only its corresponding tomographic im-

ages could reveal the PG. On the other hand, in the

right case, the image reveals a gland in the blue cir-

cle, however, the diagnosis report revealed that it was

a thyroid gland and not a PG. The computed tomog-

raphy images were able to show this difference.

Suivi de thèse | Besançon | 11/03/2024 8

DIAGNOSTIQUE PAR CLASSIFICATION

Normal

Abnormal

Normal

Predicted

Real

Figure 6: Misclassiﬁcation example.

For future works, including tomographic images

should be considered for an accurate diagnosis, as

it proved its efﬁciency (

Oks

uz et al., 2011) (Petra-

novi

c Ov

cari

cek et al., 2021). Also, medical object

detection and segmentation could be more accurate

as they only focus on speciﬁc regions of images.

4 CONCLUSIONS

This paper presented a methodology for abnor-

mal PGs detection using static dual-isotope

99m

Tc-

sestamibi and

123

I PINHOLE and LEHR images.

First, statistical features were extracted to be com-

bined with the images for classiﬁcation. The im-

ages were normalized using the MinMax technique.

Then, an image processing approach for IQE was ap-

plied using the statistical ﬁlter-based technique IQE1

(NLMD) and AI-based approach IQE2 (SCUNet).

The obtained results showed that IQE1 outperformed

IQE2 for this task, therefore IQE1 was selected for

the rest of the process. Next, the ﬁltered images were

fed to CNN for feature extraction and then to Lasso

for feature selection. Finally, the statistical features

were combined with the selected features for nor-

mal/abnormal PGs diagnosis. After extensive simu-

lations, the results showed that the proposed method-

ology achieved an accuracy of 83% by combining the

PINHOLE images. The results also showed that IQE1

improved the diagnosis results by 16.9%, boosting

image classiﬁcation. The obtained performance in-

Enhancing Image Quality to Improve Medical Image Classiﬁcation: Application to Nuclear Medicine Planar Images

309

dicates the potential of the proposed methodology to

be a reliable medical assistance tool by providing a

primary diagnosis using only one type of data.

COMPLIANCE WITH ETHICAL

STANDARDS

This study was registered by the Clinical Research

and Innovation Delegation of the University Hospital

Center of Besanc¸on under the number 2023/796.

ACKNOWLEDGMENT

This work has been achieved in the frame of the

EIPHI Graduate school (contract “ANR-17-EURE-

0002”).

REFERENCES

Bhardwaj, N., Kaur, G., and Singh, P. K. (2018). A system-

atic review on image enhancement techniques. Sen-

sors and Image Processing: Proceedings of CSI 2015,

pages 227–235.

Bilezikian, J., Cusano, N., Khan, A., Liu, J., Marcocci,

C., and Bandeira, F. (2016). Primary hyperparathy-

roidism. nature reviews disease primers. 2016; 2:

16033.

Boukhennoufa, O., Comas, L., Nicod, J.-M., Ungureanu,

C., Zerhouni, N., and Boulahdour, H. (2024). Au-

tomatic detection of parathyroid glands in nuclear

medicine. In 2024 IEEE International Symposium on

Biomedical Imaging (ISBI), pages 1–4. IEEE.

Buades, A., Coll, B., and Morel, J.-M. (2011). Non-local

means denoising. Image Processing On Line, 1:208–

212.

Khan, A., Hanley, D., Rizzoli, R., Bollerslev, J., Young,

J., Rejnmark, L., Thakker, R., D’amour, P., Paul, T.,

Van Uum, S., et al. (2017). Primary hyperparathy-

roidism: review and recommendations on evaluation,

diagnosis, and management. a canadian and interna-

tional consensus. Osteoporosis International, 28:1–

19.

Kim, K., Lee, M.-H., and Lee, Y. (2020). Investigation of a

blind-deconvolution framework after noise reduction

using a gamma camera in nuclear medicine imaging.

Nuclear Engineering and Technology, 52(11):2594–

2600.

Lee, S.-W., Shim, S. R., Jeong, S. Y., and Kim, S.-J. (2021).

Direct comparison of preoperative imaging modali-

ties for localization of primary hyperparathyroidism:

a systematic review and network meta-analysis. JAMA

Otolaryngology–Head & Neck Surgery, 147(8):692–

706.

Lin, T.-Y., Goyal, P., Girshick, R., He, K., and Doll

ar, P.

(2017). Focal loss for dense object detection. In

Proceedings of the IEEE international conference on

computer vision, pages 2980–2988.

Nieciecki, M., Cacko, M., and Kr

olicki, L. (2015). The

role of ultrasound and nuclear medicine methods in

the preoperative diagnostics of primary hyperparathy-

roidism. Journal of Ultrasonography, 15(63):398–

409.

Oks

uz, M.

O., Dittmann, H., Wicke, C., M

ussig, K., Bares,

R., Pfannenberg, C., and Eschmann, S. M. (2011). Ac-

curacy of parathyroid imaging: a comparison of pla-

nar scintigraphy, spect, spect-ct, and c-11 methion-

ine pet for the detection of parathyroid adenomas and

glandular hyperplasia. DIR, 17(4):297.

Petranovi

c Ov

cari

cek, P., Giovanella, L., Carri

o Gasset, I.,

Hindi

e, E., Huellner, M. W., Luster, M., Piccardo, A.,

Weber, T., Talbot, J.-N., and Verburg, F. A. (2021).

The eanm practice guidelines for parathyroid imaging.

EJNMMI, 48:2801–2822.

Samaras, A.-D., Tsimara, M., Voidila, S., Papandrianos, N.,

Zampakis, P., Moustakidis, S., Papageorgiou, E., and

Kalogeropoulou, C. (2024). Explainable classiﬁcation

of patients with primary hyperparathyroidism using

highly imbalanced clinical data derived from imaging

and biochemical procedures. AS, 14(5):2171.

Sandqvist, P., Sundin, A., Nilsson, I.-L., Gryb

ack, P., and

Sanchez-Crespo, A. (2022). Primary hyperparathy-

roidism, a machine learning approach to identify

multiglandular disease in patients with a single ade-

noma found at preoperative sestamibi-spect/ct. Euro-

pean Journal of Endocrinology, 187(2):257–263.

Sung, J. Y. (2015). Parathyroid ultrasonography: the

evolving role of the radiologist. Ultrasonography,

34(4):268.

Tibshirani, R. (1996). Regression shrinkage and selection

via the lasso. Journal of the Royal Statistical Society

Series B: Statistical Methodology, 58(1):267–288.

Tlili, G., Mesguich, C., Gaye, D., Tabarin, A., Haissa-

guerre, M., and Hindi

e, E. (2023). Dual-tracer 99mtc-

sestamibi/123i imaging in primary hyperparathy-

roidism. QJNMMI.

Walker, M. D. and Silverberg, S. J. (2018). Primary hy-

perparathyroidism. Nature Reviews Endocrinology,

14(2):115–125.

Yoshida, A., Ueda, D., Higashiyama, S., Katayama, Y.,

Matsumoto, T., Yamanaga, T., Miki, Y., and Kawabe,

J. (2022). Deep learning-based detection of parathy-

roid adenoma by 99mtc-mibi scintigraphy in patients

with primary hyperparathyroidism. Annals of Nuclear

Medicine, 36(5):468–478.

Zhang, K., Li, Y., Liang, J., Cao, J., Zhang, Y., Tang, H.,

Fan, D.-P., Timofte, R., and Gool, L. V. (2023). Practi-

cal blind image denoising via swin-conv-unet and data

synthesis. Machine Intelligence Research, 20(6):822–

836.

Zhang, Z., Zhang, Q., Gao, Z., Zhang, R., Shutova, E.,

Zhou, S., and Zhang, S. (2024). Gradient-based pa-

rameter selection for efﬁcient ﬁne-tuning. In Proceed-

ings of the IEEE/CVF Conference on Computer Vision

and Pattern Recognition, pages 28566–28577.

BIOIMAGING 2025 - 12th International Conference on Bioimaging

310