6 CONCLUSION
In this paper, we proposed pitching classification
method using V-Net with reconstruction, and habit
detection is performed based on Grad-CAM. It can
provide higher accuracy than the conventional video
classification method by using reconstruction and SE-
block. By using our proposed method, we can
understand important movements. Thus, our method
will be useful to the analysis of human movements.
However, some results of habit detection include
an ambiguous heat map or a blurred heat map. Since
the improved version (
A. Chattopadhyay and A. Sarkar,
2018)
of Grad-CAM has been proposed, we would
like to try it in the future to make visualized images
clearer. Furthermore, we would like to confirm the
effectiveness of the proposed method by applying our
method to other video classification datasets.
REFERENCES
R. R. Selvaraju, M. Cogswell, A. Das, R. Vedantam, D.
Parikh, D. Batra, โGrad-CAM: Visual Explanations
From Deep Networks via Gradient-Based
Localizationโ, In Proc. International Conference on
Computer Vision, pp. 618-626, 2017.
F. Milletari, N. Navab, S.A. Ahmadi, โV-Net: Fully
Convolutional Neural Networks for Volumetric
Medical Image Segmentationโ, In Proc. International
Conference on 3D Vision, pp. 565-571, 2016.
I. Laptev, T. Lindeberg, โSpace-Time Interest Pointsโ, In
Proc. International Conference on Computer Vision,
pp.432-439, 2003.
H. Wang, A. Klaser, C. Schmid, C.L. Liu, โAction
recognition by dense trajectoriesโ, In Proc. IEEE
Conference on Computer Vision and Patern
Recognition, pp.3169-3176, 2011.
D. Tran1, L. Bourdev, R. Fergus, L. Torresani, M. Paluri1,
โLearning Spatiotemporal Features with 3D
Convolutional Networksโ, In Proc. International
Conference on Computer Vision, pp.4489-4497, 2015.
S. Ji, W. Xu, M. Yang, K. Yu, โ3D Convolutional Neural
Networks for Human Action Recognitionโ, IEEE
Transactions on Pattern Analysis and Machine
Intelligence, Volume, Vol.35, pp.221-231, 2013.
K. He, X. Zhang, S. Ren, J. Sun. Deep Residual Learning
for Image Recognition. In Proc. Computer Vision and
Pattern Recognition, pp.770-778, 2016.
O. Ronneberger, P. Fischer, T. Brox, โU-Net:
Convolutional Networks for Biomedical Image
Segmentationโ, In Proc. Medical Image Computing and
Computer-Assisted Intervention, pp.234-241, 2015.
M. D. Zeiler R. Fergus, โVisualizing and understanding
convolutional networksโ, In Proc. European
Conference on Computer Vision, pp.818-833, 2014.
S. Ioffe, โBatch renormalization: Towards reducing
minibatch dependence in batch-normalized modelsโ, In
Proc. Neural Information Processing Systems, pp.1942-
1950, 2017.
J. Hu, L. Shen, G. Sun, โSqueeze-and-Excitation
Networksโ, In Proc. IEEE Conference on Computer
Vision and Pattern Recognition, pp.7132-7141, 2018.
M. Lin, Q. Chen, S. Yan, โNetwork in network,โ In Proc.
International Conference on Learning Representations,
2014.
A. Chattopadhyay, A. Sarkar, P. Howlader,
V.N.Balasubramanian, โGrad-CAM++: Generalized
Gradient-Based Visual Explanations for Deep
Convolutional Networksโ, In Proc. Winter Conference
on Applications of Computer Vision, 2018.