the generator to progressively enhance its ability to
deceive the discriminator by producing images that
are progressively more challenging to differentiate
from real images. This adversarial training process
leads to the improvement of both the generator and
discriminator over time.
Fig. 2 illustrates the image generation process
based on gass. First, the image material is imported
with a batch size of 64, and this batch of data is
standardized, and then the GAN model is established.
Taking a random noise as input, the generator network
transforms it into sample data by generating outputs.
This process is repeated multiple times to produce
diverse samples from the given seed noise input, a
learning curve is drawn and evaluated based on the
training data.
Figure 2: The pipeline of the model (Picture credit:
Original).
2.2.1 Generator and Discriminator
In the GAN model, the Generator network assumes
the crucial role of generating fresh images. It takes
random noise as input and undergoes a transformative
process to produce an image that closely resembles
the desired target domain. The generator learns to
map the input noise to the output image by leveraging
deep neural networks, such as convolutional neural
networks (CNN) or generative models like
Variational Autoencoders (VAEs). The generator
produces images that are close to real images. The
inner workings of the generator effectively capture
the underlying patterns and arrangements of the data
set. The discriminator network, on the other hand,
acts as a critic. Through learning, the discriminator is
able to react quickly to images and effectively
distinguish between images that are real or fictitious
by the generator that fall within the target range. The
discriminator is also implemented using deep neural
networks, typically CNNs. And the discriminator, on
the other hand, is trained to classify images as either
real or fake. Its primary goal is to effectively identify
the generated images and distinguish them from real
images with high accuracy.
2.2.2 GAN
The GAN model’s significance lies in its ability to
generate new and original images that resemble real
artworks. It has opened up new possibilities for
artistic expression, creative design, and data
synthesis. GAN is a generative model for deep
learning characterized by two adversarial networks,
generative and discriminative, to learn the
distribution of data and generate new samples.The
structure of a GAN consists of two main components:
a generator and a discriminator. The generator is
responsible for generating false samples from random
noise and trying to deceive the discriminator. The
discriminator, on the other hand, is a binary classifier
that distinguishes between real samples and false
samples generated by the generator. The generator
receives a random noise vector as input, maps it to the
data sample space through a series of transformations,
and generates spurious samples. The discriminator
receives the true samples and the false samples
generated by the generator and tries to distinguish the
classes. The goal of the discriminator is to maximize
the ability to correctly classify the true and false
samples. The generator's goal is to minimize the
discriminator's ability to discriminate the generated
false samples, even if the discriminator is unable to
distinguish false samples from true samples. By
iteratively training the generator and the
discriminator, the two networks work against each
other and gradually improve their performance. The
generator and discriminator can be modeled using a
deep neural network, such as a multilayer perceptron
(MLP) or CNN. The input to the generator network is
a random noise vector and its output is a generated
sample. The input to the discriminator network is real
samples or generated samples and its output is the
classification result for the input samples. Through
the adversarial training process, GAN is able to
optimize the dynamic balance between the generator
and the discriminator to generate more realistic
samples. It has a wide range of applications in areas
such as generating images, language modeling, and
audio synthesis. By training the GAN model on a
dataset of existing artworks, it can learn the
underlying patterns, styles, and textures present in the
training data and generate new artworks that capture
the essence of the art style. In the implementation
flow of this experiment, the GAN model is trained
using a two-step process. Initially, the generator
network receives random noise as input and utilizes it
to generate an image. Subsequently, the discriminator
can discriminate between the generated images and
the real images in the training database. The
discriminator is then used to evaluate the screened
situation and provide feedback to the generator.