Recursive Aggregation Operator (ReX) in this model
reduces peak RAM and average latency of various
adaptive models on the device by bypassing large
early activations and local representation methods,
greatly improving the efficiency of image
classification (Qian, 2022). At the same time, the
layer channel setting principle derived through the
progressive increase in the number of channels has
effectively resolved the bottleneck problem of the
image classification layer, leading to a significant
improvement in the accuracy of image classification
(Han, 2021).
To enhance image classification accuracy, this
study utilizes the ReXNet model as the classifier for
both AI-generated and real images. The image dataset
is primarily sourced through web scraping, providing
real images for the model. Additionally, AI is
employed to generate synthetic data, mimicking the
characteristics of real images, for model training and
evaluation, available on Kaggle. Data preprocessing
involves standardization and normalization, ensuring
consistent dimensions and magnitudes, which is
crucial for subsequent modeling. The ReXNet model
is employed for feature extraction and classification,
undergoing cyclic training with a specified number of
epochs. Backpropagation is used to update model
parameters based on calculated loss for different data
batches. The study incorporates early stopping
techniques during training to prevent overfitting,
maximizing training efficiency. The experiment
indicates that this research is able to effectively
extract image features. As an image classifier, this
model can accurately and efficiently differentiate
between real images and AI-generated images.
2 METHODOLOGIES
2.1 Dataset Description and
Preprocessing
The dataset for this study is primarily obtained
through random web crawling and AI generation, and
can be accessed on Kaggle (Kaggle, 2024). This
dataset contains 538 AI-generated images and 435
real images randomly collected from the web,
providing a basis for further analysis of their
similarities. Each image has a size of 224x224 pixels,
encompassing a variety of themes, with special
emphasis on people, animals, landscapes, and
psychedelia. The dataset has been divided into
training set and test set. Simultaneously, resizing and
normalizing the images eliminates differences in
dimensions and scales, making the model easier to
converge and laying the foundation for subsequent
modeling. Figure 1 and Figure 2 respectively
illustrate partial AI-generated images and real
images.
Figure 1: AI generated image
(Photo/Picture credit:
Original).
Figure 2: Real image
(Photo/Picture credit: Original).
2.2 Proposed Approach
This study primarily utilizes the ReXNet model as the
classifier for AI-generated images and real images.
Before establishing the model, the data needs to be
normalized for further modeling and analysis. This
model is further optimized based on Convolutional
Neural Network (CNN), and to some extent, it solves
the bottleneck problem in the representation of the
shrinking layer through progressive increase in the
number of channels. When these techniques are
combined, they can better extract the feature
information in the images, effectively improve the
training efficiency of the model, and achieve better
performance in image classification tasks. The
following Figure 3 illustrates the structure of the
system.
Figure 3: Model flow chart
(Photo/Picture credit: Original).
2.2.1 ReXNet
RexNet is a lightweight neural network architecture
designed to strike a balance between model accuracy
and computational efficiency, enabling efficient
image classification tasks in resource-constrained
environments. Its primary goal is to maintain a high
level of accuracy while optimizing computational
resources. In comparison to the traditional network
architecture paradigm, there may be expression
bottleneck issues, which in turn affect model
performance. This model undergoes slight
adjustments on the benchmark network, adopts a
progressively increasing approach to channel count
design, and replaces more expansion layers to address