3.3 Predictive Modeling and
Evaluation
In this study, Logistic Regression is applied to predict
whether customers would fall into Cluster 1 or
Cluster 2, and they are classified according to their
buying behaviour. As shown in Figure 8, the
confusion matrix shows the performance of the
model, with the X-axis representing the predicted
cluster and the Y-axis representing the actual cluster.
Figure 8: Confusion Matrix of Logistic Regression
Predictions (Photo/Picture credit: Original).
The results obtained from the model evaluation
show excellent accuracy of 98% to 100% and similar
high recall. The overall accuracy of the model is
about 99.28%, and the F1 scores that balance
accuracy and recall are above 99%. These metrics
show a highly reliable model, which can accurately
segment customers according to the data.
The author finds its high accuracy that shows the
model is robust and effective and has great potential
to assist strategic marketing initiatives. The small
number of misclassifications (3 out of 416) further
emphasizes the predictive ability of the model and
ensures that marketing resources are effectively
allocated to target customers. This modelling can
provide operational insights and realize personalized
marketing strategies, thus significantly improving
customer engagement and conversion rates.
4 CONCLUSIONS
This study introduces a comprehensive method for
consumer personality analysis, leveraging a Kaggle
dataset to unveil customer purchasing behaviours. A
methodology is proposed to deeply investigate rich
consumer data, utilizing the synergy of KMeans
clustering, the Apriori algorithm, and logistic
regression. KMeans clustering is employed to
partition the dataset into distinct segments, unveiling
inherent groupings within the consumer base. The
Apriori algorithm adeptly mines the intricate
associations between consumer segments and their
purchasing tendencies. Subsequently, logistic
regression is utilized to predict cluster membership,
enabling the measurement of potential purchasing
decisions among consumers. The integration of these
methods yields a robust model capable of analyzing
and interpreting the complex dynamics of customer
interactions.
Extensive experiments are conducted to evaluate
the proposed method. The experimental results
demonstrate the model's effectiveness, with high
clustering accuracy and forecasting modelling
affirming the efficacy of segmented customer profiles.
The utilization of association rules further solidifies
the advantages of analysis and furnishes concrete,
data-driven insights for strategic marketing
endeavours. Looking ahead, future research will
consider individual consumers and their influence on
purchasing patterns as the primary objective. The
focus will be on enhancing the accuracy of market
segmentation and customizing marketing strategies
with greater precision. The vision of this study is to
explore the dynamic relationship between evolving
consumer characteristics and market trends, guiding
enterprises to successfully engage with customers in
a more nuanced manner.
REFERENCES
Abdolvand., Neda., Amir, A., and Mohammad, A., (2015).
Performance management using a value-based
customer-centered model. International Journal of
Production Research. vol. 53(18). pp: 5472-5483.
Akhondzadeh, N., Elham., and Amir, A., (2015). Mining
the dominant patterns of customer shifts between
segments by using top-k and distinguishing sequential
rules. Management decision. vol. 53(9). pp: 1976-2003.
Alves, G., Miguel., and Tobias, M., (2023). A review on
customer segmentation methods for personalized
customer targeting in e-commerce use cases.
Information Systems and e-Business Management. vol.
21(3). pp: 527-570.
Anshari., Muhammad., et al. (2019). Customer relationship
management and big data enabled: Personalization &
customization of services. Applied Computing and
Informatics. vol. 15(2) pp: 94-101.
Batrinca., Bogdan., and Philip, C., Treleaven., (2015).
Social media analytics: a survey of techniques, tools
and platforms. Ai & Society. vol. 30. pp: 89-116.
Bell., David., and Chidozie, M., (2018). Data-driven agent-
based exploration of customer behavior. vol. 94(3) pp:
195-212.