The advent of deep learning, particularly
convolutional neural networks (CNN), offers a
transformative avenue to automate complex visual
tasks, including image classification. The application
of deep learning techniques to agricultural processes,
such as maturity assessment, has shown promising
results in enhancing accuracy and efficiency
(Mohanty et al., 2016).
One of the major challenges in conducting
predictive work regarding the ripeness of oil palm
fruit bunches lies in acquiring appropriate images for
fruit maturity detection. Typically, fruits are
segregated, and images are captured either when they
are on the ground or while still on the tree (Suharjito
et al., 2023).
There are two critical moments requiring fruit
maturity classification: 1) while the fruit is still on the
tree to determine the optimal harvesting time and 2)
when it's within the production area before oil
extraction. However, it's uncommon to find work or
images of fruits during this latter stage, despite it
being arguably the most crucial. Large companies
usually have fruit suppliers, and accurately
classifying incoming fruit is essential. Additionally,
for a final evaluation of one's own fruits, determining
their maturity is crucial.
In fruit unloading zones, there are often inclined
ramps or reception platforms where fruits are
transported from trucks to the oil extraction area.
During transit on these ramps, fruit bunches are
typically not well-separated and may stack on top of
one another. Our aim is precisely to develop a model
capable of classifying fruit at this stage of the process.
Hence, this work's primary objective is to establish a
database using images captured specifically on these
loading ramps.
In response to the limitations of traditional
methods and building upon promising prior deep
learning research, this study aims to harness deep
learning for oil palm fruit bunch maturity
classification. Primary objectives include developing
a robust deep learning model capable of accurately
distinguishing between different maturity stages,
utilizing images to capture dynamic changes in fruit
bunches over time. To achieve these goals, images at
various maturity states will be annotated, and
YOLOv8 will be employed for maturity detection.
This study seeks to provide technological
advancement, enhancing maturity assessment
accuracy, and contributing to sustainable practices in
oil palm cultivation.
2 RELATED WORKS
The use of video data for crop monitoring has
emerged as a valuable tool in precision agriculture.
Video-based approaches provide a dynamic
understanding of crop growth and maturation
processes over time. Successfully applied in various
crops such as grapes (Kangune et al., 2019; Zhao et
al., 2023) and wheat (Virlet et al., 2016), this
methodology showcases its potential to capture
temporal changes in oil palm fruit bunches.
Recent research has made significant strides in the
maturity classification of oil palm fruit, leveraging
advanced technologies. Many studies rely on non-
invasive methods, predominantly visual-based,
avoiding direct contact with the fruit. Some authors
employ computer vision and machine learning
systems, extracting color features or other image
characteristics using methods like support vector
machine (SVM) (Septiarini et al., 2019) and artificial
neural networks (ANN). For example, Septiarini A. et
al. (2021) use different machine learning algorithms
as Naïve Bayes, SVM and ANN. Others utilize
Raman spectroscopy, as demonstrated by Raj T. et al.
(2021) employing Raman signal features as input for
KNN. Considering the importance of segmentation in
traditional machine learning and/or computer vision
methods, some authors have focused on this aspect
(Septiarini et al., 2020).
The integration of deep learning techniques into
agriculture has gained ground, offering innovative
solutions to various challenges, including crop
monitoring, disease detection, and yield prediction.
Deep learning models, particularly Convolutional
Neural Networks (CNN), have shown remarkable
success in image-based tasks, providing a foundation
for their application in maturity classification. Recent
works, including the use of convolutional neural
networks capable of classifying oil palm fruit through
knowledge transfer, for example, Suharjito et al.,
(2021), compare various CNN models, such as
MobileNetV1, MobileNetV2, NASNet Mobile, and
EfficientNetB0, with transfer learning (Suharjito et
al., 2021). On the other hand, models such as YOLO
show promising results when it comes to classifying
multiple fruits in a single image with internal
segmentation. Authors using the YOLO model have
employed various versions, ranging from YOLOv3
(Mohd Basir Selvam et al., 2021) to YOLOv5
(Mansour et al., 2022). Some authors have even
compared YOLO with other CNN models (Junior &
Suharjito, 2023; Mansour et al., 2022).
However, effective classification models depend
on a robust database, emphasizing the fundamental