Body Part Information Additional in Multi-decoder Transformer-Based Network for Human Object Interaction Detection

Zihao Guo; Fei Li; Rujie Liu; Ryo Ishida; Genta Suzuki

Research.Publish.Connect.

*Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Country:
Subject:

Advanced Search Affiliations Search

If you're looking for an exact phrase use quotation marks on text fields.

Proceedings

Proceedings Search *Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

Papers

Papers Search *Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

Authors

Authors Search *Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

Advanced Search

Paper

Body Part Information Additional in Multi-decoder Transformer-Based Network for Human Object Interaction Detection

Topics: Deep Learning for Visual Understanding ; Event and Human Activity Recognition; Machine Learning Technologies for Vision

In Proceedings of the 18th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 5 VISAPP: VISAPP, 221-229, 2023 , Lisbon, Portugal

Authors: Zihao Guo ¹ ; Fei Li ¹ ; Rujie Liu ¹ ; Ryo Ishida ² and Genta Suzuki ²

Affiliations: ¹ Fujitsu Research & Development Center Co., Ltd., Beijing, China ; ² Fujitsu Research, Fujitsu Limited, Kawasaki, Japan

Keyword(s): Human Object Interaction Detection, Transformer, Multi-decoder, Body Part Information, Channel Attention.

Abstract: Human Object Interaction Detection is one of the essential branches of video understanding. However, many complex scenes exist, such as humans interacting with multiple objects. The whole human body as the subject of interaction in the complex interaction environment may misjudge the interaction with the wrong objects. In this paper, we propose a Transformer based structure with the body part additional module to solve this problem. The Transformer structure is applied to provide powerful information mining capability. Moreover, a multi-decoder structure is adopted for solving different sub-problems, enabling models to focus on different regions to provide more powerful performance. The most important contribution of our work is the proposed body part additional module. It introduces the body part information for Human-Object Interaction(HOI) detection, which refines the subject of the HOI triplet and assists the interaction detection. The body part additional module also includes th e Channel Attention module to ensure the balance between the information, preventing the model from paying too much attention to the body part or the Human-Object pair. We got better performance than the State-Of-The-Art model. (More)

CC BY-NC-ND 4.0

Guest: Register as new SciTePress user now for free.

SciTePress user: please login.

My Papers

You are not signed in, therefore limits apply to your IP address 216.73.216.108

In the current month:

Recent papers: 100 available of 100 total

2⁺ years older papers: 200 available of 200 total

Paper citation in several formats:

Guo, Z., Li, F., Liu, R., Ishida, R. and Suzuki, G. (2023). Body Part Information Additional in Multi-decoder Transformer-Based Network for Human Object Interaction Detection. In Proceedings of the 18th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2023) - Volume 5: VISAPP; ISBN 978-989-758-634-7; ISSN 2184-4321, SciTePress, pages 221-229. DOI: 10.5220/0011755300003417

@conference{visapp23,
author={Zihao Guo and Fei Li and Rujie Liu and Ryo Ishida and Genta Suzuki},
title={Body Part Information Additional in Multi-decoder Transformer-Based Network for Human Object Interaction Detection},
booktitle={Proceedings of the 18th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2023) - Volume 5: VISAPP},
year={2023},
pages={221-229},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0011755300003417},
isbn={978-989-758-634-7},
issn={2184-4321},
}

TY - CONF

JO - Proceedings of the 18th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2023) - Volume 5: VISAPP
TI - Body Part Information Additional in Multi-decoder Transformer-Based Network for Human Object Interaction Detection
SN - 978-989-758-634-7
IS - 2184-4321
AU - Guo, Z.
AU - Li, F.
AU - Liu, R.
AU - Ishida, R.
AU - Suzuki, G.
PY - 2023
SP - 221
EP - 229
DO - 10.5220/0011755300003417
PB - SciTePress