
tion. Combining the local and foreground selection
attentions enhances accuracy. The proposed LFSM
module complements feature reconstruction methods
and improve performance evident on plant species
datasets and the ablation studies also validate the ef-
fectiveness of LFS attention.
ACKNOWLEDGMENTS
This research utilized Queen Mary’s Apocrita
HPC facility, supported by QMUL Research-IT.
http://doi.org/10.5281/zenodo.438045.
The research leading to this paper was partially
supported by the EU Horizon Europe research and
innovation Programme under Grant Agreement No.
101086461.
REFERENCES
Doersch, C., Gupta, A., and Zisserman, A. (2020).
Crosstransformers: spatially-aware few-shot transfer.
Advances in Neural Information Processing Systems,
33:21981–21993.
Garcin, C., Joly, A., Bonnet, P., Lombardo, J.-C., Af-
fouard, A., Chouet, M., Servajean, M., Lorieul, T.,
and Salmon, J. (2021). Pl@ ntnet-300k: a plant im-
age dataset with high label ambiguity and a long-tailed
distribution. In NeurIPS 2021-35th Conference on
Neural Information Processing Systems.
Huang, H., Zhang, J., Yu, L., Zhang, J., Wu, Q., and Xu,
C. (2021a). Toan: Target-oriented alignment network
for fine-grained image categorization with few labeled
samples. IEEE Transactions on Circuits and Systems
for Video Technology, 32(2):853–866.
Huang, H., Zhang, J., Zhang, J., Xu, J., and Wu, Q.
(2021b). Low-rank pairwise alignment bilinear net-
work for few-shot fine-grained image classification.
IEEE Transactions on Multimedia, 23:1666–1680.
Li, W., Wang, L., Xu, J., Huo, J., Gao, Y., and Luo, J.
(2019). Revisiting local descriptor based image-to-
class measure for few-shot learning. In Proceedings
of the IEEE/CVF conference on computer vision and
pattern recognition, pages 7260–7268.
Li, X., Song, Q., Wu, J., Zhu, R., Ma, Z., and Xue,
J.-H. (2023). Locally-enriched cross-reconstruction
for few-shot fine-grained image classification. IEEE
Transactions on Circuits and Systems for Video Tech-
nology.
Li, X., Wu, J., Sun, Z., Ma, Z., Cao, J., and Xue, J.-H.
(2020). Bsnet: Bi-similarity network for few-shot
fine-grained image classification. IEEE Transactions
on Image Processing, 30:1318–1331.
Nguyen, H. Q., Nguyen, C. Q., Le, D. D., and Pham, H. H.
(2023). Enhancing few-shot image classification with
cosine transformer. IEEE Access.
Nilsback, M.-E. and Zisserman, A. (2008). Automated
flower classification over a large number of classes.
In 2008 Sixth Indian conference on computer vision,
graphics & image processing, pages 722–729. IEEE.
Snell, J., Swersky, K., and Zemel, R. (2017). Proto-
typical networks for few-shot learning. In Guyon,
I., Luxburg, U. V., Bengio, S., Wallach, H., Fer-
gus, R., Vishwanathan, S., and Garnett, R., editors,
Advances in Neural Information Processing Systems,
volume 30. Curran Associates, Inc.
Sun, X., Xv, H., Dong, J., Zhou, H., Chen, C., and Li, Q.
(2021). Few-shot learning for domain-specific fine-
grained image classification. IEEE Transactions on
Industrial Electronics, 68(4):3588–3598.
Sung, F., Yang, Y., Zhang, L., Xiang, T., Torr, P. H., and
Hospedales, T. M. (2018). Learning to compare: Re-
lation network for few-shot learning. In Proceedings
of the IEEE conference on computer vision and pat-
tern recognition, pages 1199–1208.
Van Horn, G., Mac Aodha, O., Song, Y., Cui, Y., Sun,
C., Shepard, A., Adam, H., Perona, P., and Belongie,
S. (2018). The inaturalist species classification and
detection dataset. In Proceedings of the IEEE con-
ference on computer vision and pattern recognition,
pages 8769–8778.
Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones,
L., Gomez, A. N., Kaiser, Ł., and Polosukhin, I.
(2017). Attention is all you need. Advances in neural
information processing systems, 30.
Vinyals, O., Blundell, C., Lillicrap, T., Wierstra, D., et al.
(2016). Matching networks for one shot learning. Ad-
vances in neural information processing systems, 29.
Wertheimer, D., Tang, L., and Hariharan, B. (2021). Few-
shot classification with feature map reconstruction
networks. In Proceedings of the IEEE/CVF con-
ference on computer vision and pattern recognition,
pages 8012–8021.
Wu, J., Chang, D., Sain, A., Li, X., Ma, Z., Cao, J., Guo,
J., and Song, Y.-Z. (2023). Bi-directional feature re-
construction network for fine-grained few-shot image
classification. In Proceedings of the AAAI Conference
on Artificial Intelligence, volume 37, pages 2821–
2829.
Ye, H.-J., Hu, H., Zhan, D.-C., and Sha, F. (2020). Few-
shot learning via embedding adaptation with set-to-
set functions. In Proceedings of the IEEE/CVF con-
ference on computer vision and pattern recognition,
pages 8808–8817.
Zha, Z., Tang, H., Sun, Y., and Tang, J. (2023). Boosting
few-shot fine-grained recognition with background
suppression and foreground alignment. IEEE Trans-
actions on Circuits and Systems for Video Technology.
Zhang, C., Cai, Y., Lin, G., and Shen, C. (2020). Deepemd:
Few-shot image classification with differentiable earth
mover’s distance and structured classifiers. In Pro-
ceedings of the IEEE/CVF conference on computer vi-
sion and pattern recognition, pages 12203–12213.
Zhang, W., Liu, X., Xue, Z., Gao, Y., and Sun, C. (2021).
Ndpnet: A novel non-linear data projection network
for few-shot fine-grained image classification. arXiv
preprint arXiv:2106.06988.
Zhu, Y., Liu, C., and Jiang, S. (2020). Multi-attention meta
learning for few-shot fine-grained image recognition.
In IJCAI, pages 1090–1096.
Local Foreground Selection Aware Attentive Feature Reconstruction for Few-Shot Fine-Grained Plant Species Classification
539