This paper investigates the features that make an
individual a potential consumer of OFD services. By
employing supervised machine learning models and
incorporating feedback from OFD users, we aim to
delineate the profile of potential OFD users.
Recognizing the significant impact of dataset quality
on accuracy based on previous research, this study
utilizes a dataset that includes geographical locations
and demographic information related to online food
delivery. Extensive efforts were made to portray a
general image of OFD service users using this data.
Multiple models were implemented in this study, and
a comprehensive comparison of these models was
conducted to leverage their strengths and address
their weaknesses. Additionally, we placed particular
emphasis on geographical factors such as longitude
and latitude, as well as demographic factors like
household size, education level, and monthly income.
To present the findings clearly and intuitively,
machine learning models and data visualization
techniques were emphasized.
The dataset comprises rich independent features
such as latitude and longitude, gender, occupation,
household size, and individual feedback. The aim of
this research is to analyze relevant data through
machine learning models to comprehensively depict
the profile of potential OFD users, thereby
uncovering common characteristics and needs within
these user groups. This approach not only provides
guidance for OFD platforms to customize their
services effectively and improve customer
satisfaction but also offers insights for refining
marketing strategies and user interface designs. By
thoroughly analyzing these factors, we hope to
enhance the understanding of diverse user group
needs and thus support the development of the online
food delivery industry.
The remainder part of this paper is constructed
below. The second section is about the related work
about the OFD service. Section 3 is mainly about the
research method and detail of the dataset. The
machine learning model applied is discussed in
Section 4. Section 5 is about the result and the
discussion of this experiment. Finally, we discuss the
conclusion of the result and future work about OFD
services.
2 RELATED WORKS
The satisfaction of the user of online food delivery
system is determined by different factors. Previous
studies on factors affecting satisfaction with online
food delivery services have spanned a wide range of
research areas. Machine learning witnessed a vast
progress in the last decades, applied in extensive
aspects of researches. Common machine learning
methods include random forest, XGBoost, decision
trees, artificial neural networks, Bayesian networks,
and so on. (Yeo et al., 2017) Some studies trained
their models on datasets and employed various
machine learning models for prediction. Their
research achieved high accuracy. Huycock Tan
(2021) initially implemented a linear regression
model, providing insights into online food delivery
users. Their model focused on online food delivery
services during the COVID-19 pandemic, illustrating
factors that positively impact OFD user satisfaction.
Another study by Janmejay (2024) applied a decision
tree classification model, uncovering preferences of
OFD users, offering better decision guidance for
online food delivery aggregators. SVM and Ridge
regression was applied in the research of Wang, W.M.
(2018).
So far, the world of auto machine learning models
and deep learning models has enlarged greatly. For
example, PLS-ANN (Foo et al., 2018) and Bi-LSTM
(Tam and Tanriöver, 2021). They have been
successfully applicated into a wide range of fields and
gain outstanding results.
3 METHOD
In this research, dataset is preprocessed into the form
which the machine learning models and neural
network are easy to manipulate on. machine learning
techniques are implemented to predict the likelihood
of potential user based on the features given.
Optimization are conduct to improve the performance
of the models, especially for the neural network.
Based on the algorithm of random forest, the
influence of different features on the result are
conducted (Figure 1).
Figure 1: Research workflow (Picture credit: Original).