Krizhevsky, A., Hinton, G., et al. (2009). Learning multiple
layers of features from tiny images.
Kurakin, A., Goodfellow, I., and Bengio, S. (2016). Ad-
versarial machine learning at scale. arXiv preprint
arXiv:1611.01236.
Liu, J., Sun, Y., Han, C., Dou, Z., and Li, W. (2020). Deep
representation learning on long-tailed data: A learn-
able embedding augmentation perspective. In IEEE
Conference on Computer Vision and Pattern Recogni-
tion (CVPR), pages 2970–2979.
Liu, X.-Y., Wu, J., and Zhou, Z.-H. (2008). Exploratory
undersampling for class-imbalance learning. IEEE
Transactions on Systems, Man, and Cybernetics, Part
B (Cybernetics), 39(2):539–550.
Liu, Z., Miao, Z., Zhan, X., Wang, J., Gong, B., and Yu,
S. X. (2019). Large-scale long-tailed recognition in an
open world. In IEEE Conference on Computer Vision
and Pattern Recognition (CVPR), pages 2537–2546.
Long, J., Shelhamer, E., and Darrell, T. (2015). Fully con-
volutional networks for semantic segmentation. In
IEEE Conference on Computer Vision and Pattern
Recognition (CVPR), pages 3431–3440.
Maciejewski, T. and Stefanowski, J. (2011). Local neigh-
bourhood extension of smote for mining imbalanced
data. In 2011 IEEE symposium on computational in-
telligence and data mining (CIDM), pages 104–111.
Madry, A., Makelov, A., Schmidt, L., Tsipras, D., and
Vladu, A. (2018a). Towards deep learning models re-
sistant to adversarial attacks. In International Confer-
ence on Learning Representations (ICLR).
Madry, A., Makelov, A., Schmidt, L., Tsipras, D., and
Vladu, A. (2018b). Towards deep learning models re-
sistant to adversarial attacks. In International Confer-
ence on Learning Representations (ICLR).
Menon, A. K., Jayasumana, S., Rawat, A. S., Jain, H., Veit,
A., and Kumar, S. (2020). Long-tail learning via logit
adjustment. arXiv preprint arXiv:2007.07314.
Oquab, M., Bottou, L., Laptev, I., and Sivic, J. (2014).
Learning and transferring mid-level image represen-
tations using convolutional neural networks. In IEEE
Conference on Computer Vision and Pattern Recogni-
tion (CVPR), pages 1717–1724.
Park, S., Hong, Y., Heo, B., Yun, S., and Choi, J. Y. (2022).
The majority can help the minority: Context-rich mi-
nority oversampling for long-tailed classification. In
IEEE Conference on Computer Vision and Pattern
Recognition (CVPR), pages 6887–6896.
Park, S., Lim, J., Jeon, Y., and Choi, J. Y. (2021). Influence-
balanced loss for imbalanced visual classification.
In IEEE Conference on International Conference on
Computer Vision (ICCV), pages 735–744.
Ren, J., Yu, C., Ma, X., Zhao, H., Yi, S., et al. (2020). Bal-
anced meta-softmax for long-tailed visual recognition.
Advances in Neural Information Processing Systems
(NeurIPS), 33:4175–4186.
Ren, S., He, K., Girshick, R., and Sun, J. (2015). Faster
r-cnn: Towards real-time object detection with region
proposal networks. Advances in Neural Information
Processing Systems (NeurIPS), 28.
Rony, J., Hafemann, L. G., Oliveira, L. S., Ayed, I. B.,
Sabourin, R., and Granger, E. (2019). Decoupling
direction and norm for efficient gradient-based l2 ad-
versarial attacks and defenses. In IEEE Conference
on Computer Vision and Pattern Recognition (CVPR),
pages 4322–4330.
Sinha, A., Namkoong, H., and Duchi, J. (2018). Certifying
some distributional robustness with principled adver-
sarial training. In International Conference on Learn-
ing Representations (ICLR).
Szegedy, C., Toshev, A., and Erhan, D. (2013). Deep neu-
ral networks for object detection. Advances in Neural
Information Processing Systems (NeurIPS), 26.
Tan, M. and Le, Q. (2019). Efficientnet: Rethinking model
scaling for convolutional neural networks. In Inter-
national Conference on Machine Learning (ICML),
pages 6105–6114.
Tomek, I. (1976). Two modifications of cnn. IEEE Trans-
actions on Systems, Man, and Cybernetics, SMC-
6(11):769–772.
Wang, X., Lian, L., Miao, Z., Liu, Z., and Yu, S. X.
(2020). Long-tailed recognition by routing di-
verse distribution-aware experts. arXiv preprint
arXiv:2010.01809.
Wang, Y.-X., Ramanan, D., and Hebert, M. (2017). Learn-
ing to model the tail. Advances in Neural Information
Processing Systems (NeurIPS), 30.
Yun, S., Han, D., Oh, S. J., Chun, S., Choe, J., and Yoo,
Y. (2019). Cutmix: Regularization strategy to train
strong classifiers with localizable features. In IEEE
Conference on International Conference on Computer
Vision (ICCV), pages 6023–6032.
Zhang, H., Cisse, M., Dauphin, Y. N., and Lopez-Paz, D.
(2018). mixup: Beyond empirical risk minimization.
In International Conference on Learning Representa-
tions (ICLR).
Zhang, H., Yu, Y., Jiao, J., Xing, E., El Ghaoui, L., and Jor-
dan, M. (2019). Theoretically principled trade-off be-
tween robustness and accuracy. In International Con-
ference on Machine Learning (ICML), pages 7472–
7482.
Zhang, Y., Kang, B., Hooi, B., Yan, S., and Feng, J. (2023).
Deep long-tailed learning: A survey. IEEE Trans-
actions on Pattern Analysis and Machine Intelligence
(PAMI).
Zhong, Z., Cui, J., Liu, S., and Jia, J. (2021). Improving
calibration for long-tailed recognition. In IEEE Con-
ference on Computer Vision and Pattern Recognition
(CVPR), pages 16489–16498.
VISAPP 2024 - 19th International Conference on Computer Vision Theory and Applications
220