
tions, particularly considering that MotionMixer em-
ploys displacement-based representations with values
that are typically very low. It can be observed that the
adversarial attacks algorithms learn to exert the most
variation in ∆s to the last frame, in order to fool the
models. Additionally, in the case of MotionMixer, the
last displacement is replaced with the positions of the
last frame for this evaluation, that’s why the ∆s drop
to zero for the last displacement frame.
6 CONCLUSIONS AND FUTURE
DIRECTION
We observed that models are easily fooled by adver-
sarial attacks as same as in the initial stages of CNNs
on image classification. Furthermore, we showed
that 3D spatial transformations also behave as no-
gradient-based attack methods and have strong ef-
fects on the model performance. As a future direc-
tion, we plan to use these methods as data augmenta-
tion for more realistic scenarios such as small short-
period rotations or spatio-temporal windowed noise.
We also plan to explore black-box methods since we
observed white-box attacks worked successfully and
also a white-box method using the gradients to guide
the 3D spatial transformations.
ACKNOWLEDGEMENTS
The research leading to these results is funded by the
German Federal Ministry for Economic Affairs and
Climate Action within the project “ATTENTION –
Artificial Intelligence for realtime injury prediction”.
The authors would like to thank the consortium for
the successful cooperation.
REFERENCES
Abdollahpourrostam, A., Abroshan, M., and Moosavi-
Dezfooli, S.-M. (2023). Revisiting DeepFool: gen-
eralization and improvement.
Bouazizi, A., Holzbock, A., Kressel, U., Dietmayer, K., and
Belagiannis, V. (2022). MotionMixer: MLP-based 3D
Human Body Pose Forecasting.
Carlini, N. and Wagner, D. (2016). Towards Evaluating the
Robustness of Neural Networks.
Chen, J., Lin, X., Xiong, H., Wu, Y., Zheng, H., and Xuan,
Q. (2021). Smoothing Adversarial Training for GNN.
IEEE Transactions on Computational Social Systems,
8(3):618–629.
Dang, L., Nie, Y., Long, C., Zhang, Q., and Li, G. (2021).
MSR-GCN: Multi-Scale Residual Graph Convolution
Networks for Human Motion Prediction.
Diao, Y., Shao, T., Yang, Y.-L., Zhou, K., and Wang, H.
(2021). BASAR:Black-box Attack on Skeletal Action
Recognition.
Dong, Y., Liao, F., Pang, T., Su, H., Zhu, J., Hu, X., and Li,
J. (2017). Boosting Adversarial Attacks with Momen-
tum.
Dong, Y., Pang, T., Su, H., and Zhu, J. (2019). Evad-
ing Defenses to Transferable Adversarial Examples by
Translation-Invariant Attacks.
Entezari, N., Al-Sayouri, S. A., Darvishzadeh, A., and Pa-
palexakis, E. E. (2020a). All You Need Is Low (Rank).
In Proceedings of the 13th International Conference
on Web Search and Data Mining, pages 169–177, New
York, NY, USA. ACM.
Entezari, N., Al-Sayouri, S. A., Darvishzadeh, A., and Pa-
palexakis, E. E. (2020b). All You Need Is Low (Rank).
In Proceedings of the 13th International Conference
on Web Search and Data Mining, pages 169–177, New
York, NY, USA. ACM.
Fang, M., Yang, G., Gong, N. Z., and Liu, J. (2018). Poison-
ing Attacks to Graph-Based Recommender Systems.
Fu, J., Yang, F., Liu, X., and Yin, J. (2022). Learning
Constrained Dynamic Correlations in Spatiotemporal
Graphs for Motion Prediction.
Goodfellow, I. J., Shlens, J., and Szegedy, C. (2014). Ex-
plaining and Harnessing Adversarial Examples.
Guo, W., Du, Y., Shen, X., Lepetit, V., Alameda-Pineda,
X., and Moreno-Noguer, F. (2022). Back to MLP: A
Simple Baseline for Human Motion Prediction.
Gupta, K., Pesquet-Popescu, B., Kaakai, F., Pesquet, J.-
C., and Malliaros, F. D. (2021). An Adversarial At-
tacker for Neural Networks in Regression Problems.
In IJCAI Workshop on Artificial Intelligence Safety
(AI Safety), Montreal/Virtual, Canada.
Hamdi, A., Rojas, S., Thabet, A., and Ghanem, B. (2019).
AdvPC: Transferable Adversarial Perturbations on 3D
Point Clouds.
Huang, Q., Dong, X., Chen, D., Zhou, H., Zhang, W., and
Yu, N. (2022). Shape-invariant 3D Adversarial Point
Clouds.
Ionescu, C., Papava, D., Olaru, V., and Sminchisescu, C.
(2014). Human3.6M: Large Scale Datasets and Pre-
dictive Methods for 3D Human Sensing in Natural En-
vironments. IEEE Transactions on Pattern Analysis
and Machine Intelligence, 36(7):1325–1339.
Liu, J., Akhtar, N., and Mian, A. (2019). Adversarial Attack
on Skeleton-based Human Action Recognition.
Liu, X., Yin, J., Liu, J., Ding, P., Liu, J., and Liu, H.
(2021). TrajectoryCNN: A New Spatio-Temporal
Feature Learning Network for Human Motion Predic-
tion. IEEE Transactions on Circuits and Systems for
Video Technology, 31(6):2133–2146.
Lyu, K., Chen, H., Liu, Z., Zhang, B., and Wang, R. (2022).
3D Human Motion Prediction: A Survey.
Ma, T., Nie, Y., Long, C., Zhang, Q., and Li, G. (2022). Pro-
gressively Generating Better Initial Guesses Towards
Fooling Neural Networks for Motion Forecasting via Adversarial Attacks
241