Personality Classification of Myers Briggs Type Indicators (MBTI) Using BERT and Machine Learning

Agus Sihabuddin(1*)
(1) Department of Computer Sciences and Electronics, Faculty of Mathematics and Natural Sciences, Universitas Gadjah Mada
(*) Corresponding Author
Abstract
Personality classification using textual data from social media or online forums is a complex task due to the unstructured text and the multifaceted nature of personality. While the Myers-Briggs Type Indicator (MBTI) provides a comprehensive framework, adapting it to media data and handling diverse linguistic patterns requires effective algorithms. The psychological basis of MBTI is intricate, especially when using complex methods like deep learning, which can be challenging.
This study classifies personality types based on each individual's behavior on an online forum by observing the linguistic patterns of posted textual data using the SVM, Random Forest, BERT, and Word2Vec algorithms. The SVM and Random Forest algorithms are traditional machine learning algorithms known for their capabilities and effectiveness in text classification. Meanwhile, BERT and Word2Vec identify semantic relationships and contextual information from textual data. In addition, the IndoBERT model will be used for the BERT model because this study focuses on the classification of Indonesian language texts.
Testing was carried out using textual data from posts on the PersonalityCafe forum. The test results showed that the combination of the SVM and IndoBERT models outperformed other models with an accuracy rate of 82% and an F1 score of 75%.
Keywords
Full Text:
PDFReferences
Biriyai, A.H. dan Thomas, E.V., 2014, Online Discussion Forum: A Tool for Effective Student-Teacher Interaction, International Journal of Applied Sciences, 1, 3, 111-116.
Lucky, H., Roslynlia, dan Suhartono, D., 2021, Towards Classification of Personality Prediction Model: A Combination of BERT Word Embedding and MLSMOTE, 2021 1st International Conference on Computer Science and Artificial Intelligence (ICCSAI), 1, 346-350.
Utami, N. A., Maharani, W., dan Atastina, I., 2021, Personality Classification of Facebook Users According to Big Five Personality Using SVM (Support Vector Machine) Method, Procedia Computer Science, 1, 179, 177–184.
Kazameini, A., Fatehi, S., Mehta, Y., Eetemadi, S. dan Cambria, E., 2020, Personality Trait Detection Using Bagged SVM over BERT Word Embedding Ensembles. Available: https://arxiv.org/abs/2010.01309. [Accessed: 18-Jan-2024]
Murdrika, N., 2014, MBTI (Myer Briggs Type Indicator). Available: http://dewihardiningtyas.lecture.ub.ac.id/files/2012/04/mbti.pdf. [Accessed: 14-Mar-2024]
Gjurković, M. dan Šnajder, J., 2018, Reddit: A Gold Mine for Personality Prediction, Proceedings of the Second Workshop on Computational Modeling of People’s Opinions, Personality, and Emotions in Social Media, New Orleans.
Abidin, N.H.Z., Akmal, M., Mohd, N., Nincarean, D., Yusoff, N., Karimah, H., dan H, A., 2020, Improving Intelligent Personality Prediction using Myers-Briggs Type Indicator and Random Forest Classifier, International Journal of Advanced Computer Science and Applications, 11, 11, 192-199.
Zumma, Md. T., Munia, J. A., Halder, D., dan Rahman, Md. S., 2022, Personality Prediction from Twitter Dataset using Machine Learning, 2022 13th International Conference on Computing Communication and Networking Technologies (ICCCNT), 1-5.
Apriansyah, R., 2021, KLASIFIKASI KEPRIBADIAN PENGGUNA TWITTER BERDASARKAN DATA TWEET MENGGUNAKAN DEEP LEARNING, Tesis, Jurusan Ilmu Komputer FMIPA UGM, Yogyakarta.
Alsubhi, S.M., Alhothali, A.M., dan AlMansour, A.A., 2023, AraBig5: The Big Five Personality Traits Prediction Using Machine Learning Algorithm on Arabic Tweets, IEEE Access, 11, 112526–112534.
Amirhosseini, M.H. dan Kazemian, H., 2020, Machine Learning Approach to Personality Type Prediction Based on the Myers–Briggs Type Indicator®, Multimodal Technologies and Interaction, 4, 9.
Marklin, C., 2022, Word2Vec ─ Skip-Gram, https://medium.com/@corymaklin/word2vec-skip-gram-904775613b4c, [Accessed: 5-Feb-2024]
Vu, K., 2021, BERT Transformers: How Do They Work?, https://dzone.com/articles/bert-transformers-how-do-they-work, [Accessed: 3-Feb-2024]
Kristinic, D., Braović, M., Šerić, L., Božić-Štulić, D., 2020, Multi-Label Classifier Performace Evaluation with Confusion Matrix, International Conference on Soft Computing, Artificial Intelligence, and Machine Learning (SAIM 2020), 1-14.

Article Metrics


Refbacks
- There are currently no refbacks.
Copyright (c) 2025 IJCCS (Indonesian Journal of Computing and Cybernetics Systems)

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
View My Stats1