icance of our approach, including a per class analysis.
Our approach did not surpass some state-of-the-art
methods, mainly due to restricted information of the
used datasets. However, our results showed that our
data augmentation might improve HAR accuracy. To
achieve more competitive results, in future works, we
intend to explore the complementarity of our multi-
stream architecture with other features, such as IDT
(Wang et al., 2013) and I3D (Carreira and Zisserman,
2017). In addition, the SEVR principles could also be
employed to 3D CNNs for video classification prob-
Authors thank CAPES, FAPEMIG (grant CEX-
APQ-01744-15), FAPESP (grants #2017/09160-1 and
#2017/12646-3), CNPq (grant #305169/2015-7) for
the financial support, and NVIDIA Corporation for
the donation of two Titan Xp (GPU Grant Program).
