4 CONCLUSION
Personality prediction is a valuable tool in hiring and
developing the recommendation system. It can also
help users obtain a better mutual understanding in
conversations, improve self-awareness, and achieve
personal growth. This paper discusses implementing
different machine learning methods in personality
prediction based on MBTI. Among the three analyzed
methods, Naive Bayes, Support Vector Machine, and
Extreme Gradient Boosting, Extreme Gradient
Boosting obtains the highest accuracy on average.
However, the average accuracy for the three methods
is not different. This research cannot get into other
methods that are uncommon but show outstanding
performance, like Deep Learning and Recurrent
Neural Networks. Through detailed analysis of each
sub-group, this research also finds that the model’s
capability to predict each sub-group generally falls in
this sequence: N/S > I/E > T/F > J/P. As discussed in
3.5, this dataset is highly skewed, unbalanced, and
noisy. However, surprisingly, the N/S category is the
most unbalanced, as shown in Fig. 2. The sequence
above is an exact sequence of how “unbalanced” the
data is, ranging from high to low. The possible
conclusion that can be drawn from this observation is
that the “equal amount of data on each category” does
not matter as much. More data is most helpful, and it
can train the model to grasp the underlying pattern for
each category better. Unfortunately, due to the lack of
resources, this conclusion cannot be testified – there is
no alternative dataset. The nature of each category or
the inherent bias from the dataset may also be factors
for the difference in accuracy rate. A richer and greater
variety of datasets in the future can lead to more robust
conclusions. Another thing that remains unclear is
why models in some research have greater accuracy
when the model is trained on the same method and
dataset. Moreover, these are places where future
research can be conducted.
REFERENCES
G. Ryan, P. Katarina, and D. Suhartono, “MBTI Personality
Prediction Using Machine Learning and SMOTE for
Balancing Data Based on Statement Sentences,”
Information, vol. 14, no. 4, pp. 217, Apr. 2023.
W. Bleidorn and C. J. Hopwood, “Using Machine Learning
to Advance Personality Assessment and Theory,”
Personality and Social Psychology Review, vol. 23, no.
2, pp. 190–203, May 2018.
C. Stachl et al., “Personality Research and Assessment in
the Era of Machine Learning,” European Journal of
Personality, vol. 34, no. 5, pp. 613–631, Sep. 2020.
K. Ilmini and T. G. I. Fernando, “Persons’ Personality
Traits Recognition Using Machine Learning
Algorithms and Image Processing Techniques,”
Advances in Computer Science: An International
Journal, vol. 5, no. 1, pp. 40–44, Jan. 2016.
N. Lemos, K. Shah, R. Rade, and D. Shah, “Personality
Prediction Based on Handwriting Using Machine
Learning,” 2018 International Conference on
Computational Techniques, Electronics and
Mechanical Systems (CTEMS), Dec. 2018.
I. Maliki and M. A. B. Sidik, “Personality Prediction
System Based on Signatures Using Machine Learning,”
IOP Conference Series, vol. 879, no. 1, pp. 012068, Jul.
2020.
B. Cui and C. Qi, “Survey Analysis of Machine Learning
Methods for Natural Language Processing for MBTI
Personality Type Prediction,” 2017.
K. A., U. Kulsum, S. Rahman, Md. F. Hossain, P.
Chakraborty, and T. Choudhury, “A Comparative
Analysis of Machine Learning Approaches in
Personality Prediction Using MBTI,” in Advances in
intelligent systems and computing, 2021, pp. 13–23.
T. M, “Comparative Study of Personality Prediction From
Social Media by Using Machine Learning and Deep
Learning Method,” IJERT, Jun. 2021.
S. Ontoum and J. H. Chan, “Personality Type Based on
Myers-Briggs Type Indicator With Text Posting Style
by Using Traditional and Deep Learning,” arXiv
(Cornell University), Jan. 2022.
M. H. Amirhosseini and H. B. Kazemian, “Machine
Learning Approach to Personality Type Prediction
Based on the Myers–Briggs Type Indicator®,”
Multimodal Technologies and Interaction, vol. 4, no. 1,
pp. 9, Mar. 2020.
Z., S. Ashraf, and N. Sabahat, “Predicting MBTI
Personality type with K-means Clustering and Gradient
Boosting,” 2020 IEEE 23rd International Multitopic
Conference (INMIC), Nov. 2020.
“(MBTI) Myers-Briggs Personality Type Dataset,” Kaggle,
Sep. 22, 2017.
https://www.kaggle.com/datasets/datasnaek/mbti-type
A. Halevy, P. Norvig, and F. Pereira, “The unreasonable
effectiveness of data,” IEEE Intelligent Systems, vol.
24, no. 2, pp. 8–12, Mar. 2009.
Y. Mehta, N. Majumder, A. Gelbukh, and Z. Wang,
“Recent Trends in Deep Learning Based Personality
Detection,” Artificial Intelligence Review, vol. 53, no.
4, pp. 2313–2339, Oct. 2019.