Conference on Computer Vision and Pattern Recog-
nition (CVPR), pages 770–778, Las Vegas, NV, USA.
IEEE.
Kang, B., Liu, Z., Wang, X., Yu, F., Feng, J., and Dar-
rell, T. (2019). Few-Shot Object Detection via Fea-
ture Reweighting. In Proceedings of the IEEE/CVF
International Conference on Computer Vision, pages
8420–8429.
Kulyukin, V., Gharpure, C., and Nicholson, J. (2005).
RoboCart: Toward robot-assisted navigation of gro-
cery stores by the visually impaired. In 2005
IEEE/RSJ International Conference on Intelligent
Robots and Systems, pages 2845–2850.
Lin, T.-Y., Doll
´
ar, P., Girshick, R., He, K., Hariharan,
B., and Belongie, S. (2017). Feature Pyramid Net-
works for Object Detection. In The IEEE Conference
on Computer Vision and Pattern Recognition, pages
2117–2125, Honolulu.
Lin, T.-Y., Maire, M., Belongie, S., Bourdev, L., Girshick,
R., Hays, J., Perona, P., Ramanan, D., Zitnick, C. L.,
and Doll
´
ar, P. (2015). Microsoft COCO: Common Ob-
jects in Context. arXiv:1405.0312 [cs].
L
´
opez-de-Ipi
˜
na, D., Lorido, T., and L
´
opez, U. (2011). In-
door Navigation and Product Recognition for Blind
People Assisted Shopping. In Bravo, J., Herv
´
as, R.,
and Villarreal, V., editors, Ambient Assisted Living,
volume 6693, pages 33–40. Springer Berlin Heidel-
berg, Berlin, Heidelberg.
Merler, M., Galleguillos, C., and Belongie, S. (2007). Rec-
ognizing Groceries in situ Using in vitro Training
Data. In 2007 IEEE Conference on Computer Vision
and Pattern Recognition, Minneapolis. IEEE.
Munjal, B., Amin, S., Tombari, F., and Galasso, F. (2019).
Query-Guided End-To-End Person Search. In 2019
IEEE/CVF Conference on Computer Vision and Pat-
tern Recognition (CVPR), pages 811–820.
Osokin, A., Sumin, D., and Lomakin, V. (2020). OS2D:
One-Stage One-Shot Object Detection by Matching
Anchor Features.
Qiao, S., Shen, W., Qiu, W., Liu, C., and Yuille, A. (2017).
ScaleNet: Guiding Object Proposal Generation in Su-
permarkets and Beyond. In 2017 IEEE International
Conference on Computer Vision (ICCV), pages 1809–
1818, Venice. IEEE.
Ranst, W. V., Smedt, F. D., Berte, J., and Goedem
´
e, T.
(2018). Fast Simultaneous People Detection and Re-
identification in a Single Shot Network. In 2018 15th
IEEE International Conference on Advanced Video
and Signal Based Surveillance (AVSS), pages 1–6.
Redmon, J. and Farhadi, A. (2016). YOLO9000: Better,
Faster, Stronger. arXiv:1612.08242 [cs].
Ren, S., He, K., Girshick, R., and Sun, J. (2015). Faster R-
CNN: Towards Real-Time Object Detection with Re-
gion Proposal Networks. In Advances in Neural In-
formation Processing Systems 28 (NIPS 2015), pages
91–99.
Srivastava, M. M. (2020). Bag of Tricks for Retail Prod-
uct Image Classification. In Campilho, A., Karray, F.,
and Wang, Z., editors, Image Analysis and Recogni-
tion, volume 12131, pages 71–82. Springer Interna-
tional Publishing, Cham.
Tonioni, A. and Di Stefano, L. (2017). Product recognition
in store shelves as a sub-graph isomorphism problem.
In Lecture Notes in Computer Science (Including Sub-
series Lecture Notes in Artificial Intelligence and Lec-
ture Notes in Bioinformatics), volume 10484 LNCS,
pages 682–693. Springer Verlag.
Tonioni, A., Serra, E., and Stefano, L. D. (2018). A
deep learning pipeline for product recognition on store
shelves. In 2018 IEEE International Conference on
Image Processing, Applications and Systems (IPAS),
pages 25–31.
Wang, W., Cui, Y., Li, G., Jiang, C., and Deng, S. (2020).
A self-attention-based destruction and construction
learning fine-grained image classification method for
retail product recognition. Neural Computing and Ap-
plications, 32(18):14613–14622.
Wang, Z., Zheng, L., Li, Y., and Wang, S. (2019). Linkage
Based Face Clustering via Graph Convolution Net-
work.
Xiao, J., Xie, Y., Tillo, T., Huang, K., Wei, Y., and Feng,
J. (2019). IAN: The Individual Aggregation Network
for Person Search. Pattern Recognition, 87:332–340.
Xiao, T., Li, S., Wang, B., Lin, L., and Wang, X. (2017).
Joint Detection and Identification Feature Learning
for Person Search. In 2017 IEEE Conference on Com-
puter Vision and Pattern Recognition (CVPR), pages
3376–3385, Honolulu, HI. IEEE.
Zhou, X., Girdhar, R., Joulin, A., Kr
¨
ahenb
¨
uhl, P., and
Misra, I. (2022). Detecting Twenty-thousand Classes
using Image-level Supervision. arXiv:2201.02605
[cs].
VISAPP 2023 - 18th International Conference on Computer Vision Theory and Applications
722