Sign Language Recognition Based on Subspace Representations in the Spatio-Temporal Frequency Domain

Ryota Sato; Suzana Beleza; Erica Shimomoto; Matheus Silva de Lima; Nobuko Kato; Kazuhiro Fukui

Research.Publish.Connect.

*Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Country:
Subject:

Advanced Search Affiliations Search

If you're looking for an exact phrase use quotation marks on text fields.

Proceedings

Proceedings Search *Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

Papers

Papers Search *Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

Authors

Authors Search *Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

Advanced Search

Paper

Sign Language Recognition Based on Subspace Representations in the Spatio-Temporal Frequency Domain

Topics: Classification and Clustering; Feature Selection and Extraction; Linear Models and Dimensionality Reduction; Machine Learning Methods; Shape Representation

In Proceedings of the 13th International Conference on Pattern Recognition Applications and Methods ICPRAM - Volume 1, 152-159, 2024 , Rome, Italy

Authors: Ryota Sato ¹ ; Suzana Beleza ¹ ; Erica Shimomoto ² ; Matheus Silva de Lima ¹ ; Nobuko Kato ³ and Kazuhiro Fukui ¹

Affiliations: ¹ University of Tsukuba, Department of Computer Science, Tsukuba, Ibaraki, Japan ; ² National Institute of Advanced Industrial Science and Technology (AIST), Tokyo, Japan ; ³ Tsukuba University of Technology, Faculty of Industrial Technology, Tsukuba, Ibaraki, Japan

Keyword(s): Sign Language Recognition, 3D Fast Fourier Transform, Product Grassmann Manifold, Subspace-Based Methods.

Abstract: This paper proposes a subspace-based method for sign language recognition in videos. Typical subspace-based methods represent a video as a low-dimensional subspace generated by applying principal component analysis (PCA) to a set of images from the video. Such representation is compact and practical for motion recognition under few learning data. However, given the complex motion and structure in sign languages, subspace-based methods need to improve performance as they do not consider temporal information like the order of frames. To address this issue, we propose processing time-domain information on the frequency-domain by applying the three-dimensional fast Fourier transform (3D-FFT) to sign videos, where a sign video is represented as a 3D amplitude spectrum tensor, which is invariant to deviations in the spatial and temporal directions of target objects. Further, a 3D amplitude spectral tensor is regarded as one point on the Product Grassmann Manifold (PGM). By unfolding the te nsor in all three dimensions, PGM can account for the temporal information. Finally, we calculate video similarity by using the distances between two corresponding points on the PGM. The effectiveness of the proposed method is demonstrated on private and public sign language recognition datasets, showing a significant performance improvement over conventional subspace-based methods. (More)

CC BY-NC-ND 4.0

Guest: Register as new SciTePress user now for free.

SciTePress user: please login.

My Papers

You are not signed in, therefore limits apply to your IP address 216.73.216.166

In the current month:

Recent papers: 100 available of 100 total

2⁺ years older papers: 200 available of 200 total

Paper citation in several formats:

Sato, R., Beleza, S., Shimomoto, E., Silva de Lima, M., Kato, N. and Fukui, K. (2024). Sign Language Recognition Based on Subspace Representations in the Spatio-Temporal Frequency Domain. In Proceedings of the 13th International Conference on Pattern Recognition Applications and Methods - ICPRAM; ISBN 978-989-758-684-2; ISSN 2184-4313, SciTePress, pages 152-159. DOI: 10.5220/0012577000003654

@conference{icpram24,
author={Ryota Sato and Suzana Beleza and Erica Shimomoto and Matheus {Silva de Lima} and Nobuko Kato and Kazuhiro Fukui},
title={Sign Language Recognition Based on Subspace Representations in the Spatio-Temporal Frequency Domain},
booktitle={Proceedings of the 13th International Conference on Pattern Recognition Applications and Methods - ICPRAM},
year={2024},
pages={152-159},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0012577000003654},
isbn={978-989-758-684-2},
issn={2184-4313},
}

TY - CONF

JO - Proceedings of the 13th International Conference on Pattern Recognition Applications and Methods - ICPRAM
TI - Sign Language Recognition Based on Subspace Representations in the Spatio-Temporal Frequency Domain
SN - 978-989-758-684-2
IS - 2184-4313
AU - Sato, R.
AU - Beleza, S.
AU - Shimomoto, E.
AU - Silva de Lima, M.
AU - Kato, N.
AU - Fukui, K.
PY - 2024
SP - 152
EP - 159
DO - 10.5220/0012577000003654
PB - SciTePress