Flower Classification and Key Parameter Analysis Based on ViT

Ruochen Deng

2024

Abstract

Flower classification holds significant implications for various fields, including plant resource survey and plant taxonomy education. This paper proposes employing the Vision Transformer (ViT) model for flower classification tasks. The study aims to investigate the impact of varying depth and head parameters in ViT model on their performance. Through an analysis of accuracy performance and attention properties, the research explores optimal strategies for setting depth and head parameters. Additionally, it delves into the phenomenon of attention collapse within the multi-head attention mechanism, utilizing mean attention distance plots for in-depth analysis. Results reveal a positive correlation between model depth, number of heads, and classification accuracy. Moreover, insights gleaned from attention collapse observations provide valuable guidance for optimizing depth and head parameter settings. This study offers valuable insights into the performance of ViT models in flower classification tasks, while also contributing to the understanding of depth and head parameters in self-attention mechanisms for future research endeavors.

Download


Paper Citation


in Harvard Style

Deng R. (2024). Flower Classification and Key Parameter Analysis Based on ViT. In Proceedings of the 1st International Conference on Engineering Management, Information Technology and Intelligence - Volume 1: EMITI; ISBN 978-989-758-713-9, SciTePress, pages 194-199. DOI: 10.5220/0012922900004508


in Bibtex Style

@conference{emiti24,
author={Ruochen Deng},
title={Flower Classification and Key Parameter Analysis Based on ViT},
booktitle={Proceedings of the 1st International Conference on Engineering Management, Information Technology and Intelligence - Volume 1: EMITI},
year={2024},
pages={194-199},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0012922900004508},
isbn={978-989-758-713-9},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 1st International Conference on Engineering Management, Information Technology and Intelligence - Volume 1: EMITI
TI - Flower Classification and Key Parameter Analysis Based on ViT
SN - 978-989-758-713-9
AU - Deng R.
PY - 2024
SP - 194
EP - 199
DO - 10.5220/0012922900004508
PB - SciTePress