
Table 6: Accuracy scores for main represented labels.
Label Accuracy
Ear Wax 0.99
Normal 0.84
Partial Obstruction 0.85
Pseudotympanum 0.98
Tympanosclerosis 0.87
Mean 0.91
hensive multi-label analysis.
We plan to enhance our model by expanding our
dataset to include a wider range of conditions, espe-
cially rare ones, improving robustness and diagnostic
accuracy. Additionally, we will adopt a multimodal
approach, integrating tonal and vocal audiometry with
endoscopy imaging to enhance diagnostic precision.
We also aim to leverage advanced vision language
models like LLaMA to boost our classification per-
formance. These developments are expected to signif-
icantly advance patient outcomes in clinical settings.
5 COMPLIANCE WITH ETHICAL
STANDARDS
This study was conducted in accordance with the prin-
ciples outlined in the Declaration of Helsinki. In-
formed consent was obtained from all individual par-
ticipants involved in the study. Additionally, all pa-
tient data were collected in the current clinical prat-
ice without modifying the patient’s treatment path-
way. To date, no study suggests that a photo of an
eardrum or a earcanal can be used to identify a pa-
tient. Taken together, this data does not fall within
the CNIL’s definition of paersonal sensitive data. The
data were anonymized to ensure privacy and confi-
dentiality. Prior to their treatment, patients have given
their consent for their data to be processed electron-
ically and used anonymously for clinical and scien-
tific studies, in accordance with general data protec-
tion regulations.
REFERENCES
Cha, D., Pae, C., Seong, S., Choi, J., and Park, H. (2019).
Automated diagnosis of ear disease using ensem-
ble deep learning with a big otoendoscopy image
database.
Chen, Y.-C., Chu, Y.-C., Huang, C.-Y., Lee, Y.-T., Lee, W.-
Y., Hsu, C.-Y., Yang, A. C., Liao, W.-H., and Cheng,
Y.-F. (2022). Smartphone-based artificial intelligence
using a transfer learning algorithm for the detection
and diagnosis of middle ear diseases: A retrospective
deep learning study.
Davis, J. and Goadrich, M. (2006). The relationship be-
tween precision-recall and roc curves. pages 233–240.
Gulshan, V., Peng, L., and et al., M. C. (2016). Develop-
ment and validation of a deep learning algorithm for
detection of diabetic retinopathy in retinal fundus pho-
tographs. JAMA, 316(22):2402–2410.
Habib, A., Xu, Y., Bock, K., et al. (2023a). Evaluating
the generalizability of deep learning image classifi-
cation algorithms to detect middle ear disease using
otoscopy. Scientific Reports, 13:5368.
Habib, A.-R., Xu, Y., Bock, K., Mohanty, S., Sederholm,
T., Weeks, W. B., Dodhia, R., Ferres, J. L., Perry, C.,
Sacks, R., and Singh, N. (2023b). Evaluating the gen-
eralizability of deep learning image classification al-
gorithms to detect middle ear disease using otoscopy.
Scientific Reports, 13(1):5368.
He, K., Zhang, X., Ren, S., and Sun, J. (2016). Deep resid-
ual learning for image recognition. pages 770–778.
Huang, G., Liu, Z., van der Maaten, L., and Weinberger,
K. Q. (2017). Densely connected convolutional net-
works. Journal of Neural Networks.
Kingma, D. P. and Ba, J. (2015). Adam: A method for
stochastic optimization.
Kohavi, R. (1995). A study of cross-validation and boot-
strap for accuracy estimation and model selection.
14(2):1137–1145.
LeCun, Y., Bengio, Y., and Hinton, G. (2015). Deep learn-
ing. Nature, 521(7553):436–444.
Livingstone, D., Talai, A. S., Chau, J., and Forkert, N. D.
(2019). Building an otoscopic screening prototype
tool using deep learning. Journal of Otolaryngology -
Head & Neck Surgery, 48(66). Published: 26 Novem-
ber 2019. Accesses: 3521, Citations: 27, Altmetric:
1.
Olson, D. and Delen, D. (2008). Advanced data mining
techniques.
Shorten, C. and Khoshgoftaar, T. M. (2019). A survey on
image data augmentation for deep learning. Journal
of Big Data, 6(1):Article 60.
Simonyan, K. and Zisserman, A. (2014). Very deep con-
volutional networks for large-scale image recognition.
arXiv preprint arXiv:1409.1556.
Viscaino, M., Maass, J., Delano, P., Torrente, M., Stott, C.,
et al. (2020). Computer-aided diagnosis of external
and middle ear conditions: A machine learning ap-
proach. PLOS ONE, 15(3):e0229226.
Wu, Z., Lin, Z., Li, L., Pan, H., Chen, G., Fu, Y., and Qiu,
Q. (2020). Deep learning for classification of pediatric
otitis media. Pediatrics. First published: 28 December
2020. Editor’s Note: This Manuscript was accepted
for publication on November 23, 2020. The authors
have no funding, financial relationships, or conflicts
of interest to disclose.
Zeng, X., Jiang, Z., and et al., W. L. (2021). Efficient and
accurate identification of ear diseases using an ensem-
ble deep learning model.
Zhou, Z., Huang, H., and Fang, B. (2021). Application of
weighted cross-entropy loss function in intrusion de-
tection. Journal of Computer and Communications,
9(11):1.
ICPRAM 2025 - 14th International Conference on Pattern Recognition Applications and Methods
800