Transforming the Emotion in Speech using a Generative Adversarial Network

Kenji Yasuda; Ryohei Orihara; Yuichi Sei; Yasuyuki Tahara; Akihiko Ohsuga

Research.Publish.Connect.

*Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Country:
Subject:

Advanced Search Affiliations Search

If you're looking for an exact phrase use quotation marks on text fields.

Proceedings

Proceedings Search *Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

Papers

Papers Search *Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

Authors

Authors Search *Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

Advanced Search

Paper

Transforming the Emotion in Speech using a Generative Adversarial Network

Topics: Deep Learning; Neural Networks

In Proceedings of the 11th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART, 427-434, 2019 , Prague, Czech Republic

Authors: Kenji Yasuda ; Ryohei Orihara ; Yuichi Sei ; Yasuyuki Tahara and Akihiko Ohsuga

Affiliation: Graduate School of Information and Engineering, University of Electro-Communications, 1–5–1 Chofugaoka, Chofu-shi, Tokyo, 182–8585 and Japan

Keyword(s): Deep Learning, Domain Transfer, Generative Adversarial Network, Unsupervised Learning, Voice Conversion.

Related Ontology Subjects/Areas/Topics: Artificial Intelligence ; Biomedical Engineering ; Biomedical Signal Processing ; Computational Intelligence ; Health Engineering and Technology Applications ; Human-Computer Interaction ; Methodologies and Methods ; Neural Networks ; Neurocomputing ; Neurotechnology, Electronics and Informatics ; Pattern Recognition ; Physiological Computing Systems ; Sensor Networks ; Signal Processing ; Soft Computing ; Theory and Methods

Abstract: In recent years, natural and highly accurate outputs in domain transfer tasks have been achieved by deep learning techniques. Especially, the advent of Generative Adversarial Networks (GANs) has enabled the transfer of objects between unspecified domains. Voice conversion is a popular example of speech domain transfer, which can be paraphrased as domain transfer of speakers. However, most of the voice conversion studies have focused only on transforming the identities of speakers. Understanding other nuances in the voice is necessary for natural speech synthesis. To resolve this issue, we transform the emotions in speech by the most promising GAN model, CycleGAN. In particular, we investigate the usefulness of speech with low emotional intensity as training data. Such speeches are found to be useful when the training data contained multiple speakers.

CC BY-NC-ND 4.0

Guest: Register as new SciTePress user now for free.

SciTePress user: please login.

My Papers

You are not signed in, therefore limits apply to your IP address 216.73.216.181

In the current month:

Recent papers: 100 available of 100 total

2⁺ years older papers: 200 available of 200 total

Paper citation in several formats:

Yasuda, K., Orihara, R., Sei, Y., Tahara, Y., Ohsuga and A. (2019). Transforming the Emotion in Speech using a Generative Adversarial Network. In Proceedings of the 11th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART; ISBN 978-989-758-350-6; ISSN 2184-433X, SciTePress, pages 427-434. DOI: 10.5220/0007258504270434

@conference{icaart19,
author={Kenji Yasuda and Ryohei Orihara and Yuichi Sei and Yasuyuki Tahara and Akihiko Ohsuga},
title={Transforming the Emotion in Speech using a Generative Adversarial Network},
booktitle={Proceedings of the 11th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART},
year={2019},
pages={427-434},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0007258504270434},
isbn={978-989-758-350-6},
issn={2184-433X},
}

TY - CONF

JO - Proceedings of the 11th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART
TI - Transforming the Emotion in Speech using a Generative Adversarial Network
SN - 978-989-758-350-6
IS - 2184-433X
AU - Yasuda, K.
AU - Orihara, R.
AU - Sei, Y.
AU - Tahara, Y.
AU - Ohsuga, A.
PY - 2019
SP - 427
EP - 434
DO - 10.5220/0007258504270434
PB - SciTePress