loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Authors: Shonosuke Gonda ; Fumihiko Sakaue and Jun Sato

Affiliation: Nagoya Institute of Technology, Nagoya 466-8555, Japan

Keyword(s): Image Synthesis, Image Distribution, GAN, Multi-Discriminator, Clip, Foundation Model, Multimodal.

Abstract: In a Generative Adversarial Network (GAN), in which the generator and discriminator learn adversarially, the performance of the generator can be improved by improving the discriminator’s discriminatory ability. Thus, in this paper, we propose a method to improve the generator’s generative ability by adversarially training a single generator with multiple discriminators, each with different expertise. By each discriminator having different expertise, the overall discriminatory ability of the discriminator is improved, which improves the generator’s performance. However, it is not easy to give multiple discriminators independent expertise. To address this, we propose CLIP-MDGAN, which leverages CLIP, a large-scale learning model that has recently attracted a lot of attention, to classify a dataset into multiple classes with different visual features. Based on CLIP-based classification, each discriminator is assigned a specific subset of images to promote the development of independent expertise. Furthermore, we introduce a method to gradually increase the number of discriminators in adversarial training to reduce instability in training multiple discriminators and reduce training costs. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 18.222.28.236

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Gonda, S., Sakaue, F. and Sato, J. (2025). CLIP-MDGAN: Multi-Discriminator GAN Using CLIP Task Allocation. In Proceedings of the 20th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 2: VISAPP; ISBN 978-989-758-728-3; ISSN 2184-4321, SciTePress, pages 464-470. DOI: 10.5220/0013231900003912

@conference{visapp25,
author={Shonosuke Gonda and Fumihiko Sakaue and Jun Sato},
title={CLIP-MDGAN: Multi-Discriminator GAN Using CLIP Task Allocation},
booktitle={Proceedings of the 20th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 2: VISAPP},
year={2025},
pages={464-470},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0013231900003912},
isbn={978-989-758-728-3},
issn={2184-4321},
}

TY - CONF

JO - Proceedings of the 20th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 2: VISAPP
TI - CLIP-MDGAN: Multi-Discriminator GAN Using CLIP Task Allocation
SN - 978-989-758-728-3
IS - 2184-4321
AU - Gonda, S.
AU - Sakaue, F.
AU - Sato, J.
PY - 2025
SP - 464
EP - 470
DO - 10.5220/0013231900003912
PB - SciTePress