Convolutional Long Short-Term Memory Implementation for Indonesian News Classification
Text classification is now a well-studied field, particularly in Natural Language Processing (NLP). The text classification can be carried out using various methods, one of which is deep learning. Deep learning methods such as RNN, CNN, and LSTM are the most frequent methods used for text classification. This research aims to analyze the implementation of two deep learning methods combination, namely CNN and LSTM (C-LSTM), to classify Indonesian news texts. News texts used as data in this study were collected from Indonesian news portals. The obtained data were then divided into three categories based on their scope: "National," "International," and "Regional." Three research variables were tested in this study: the number of documents, the batch size value, and the learning rate value of the built C-LSTM. The experimental results showed that the F1-score obtained from the classification results using the C-LSTM method was 93.27%. The F1-score value generated by the C-LSTM method was higher than that of CNN (89.85%) and LSTM (90.87%). In summary, the combination method of two deep learning methods, namely CNN and LSTM (C-LSTM), outperforms CNN and LSTM.
K. Kowsari, K.J. Meimandi, M. Heidarysafa, S. Mendu, L. Barnes, dan D. Brown, “Text Classification Algorithms: A Survey,” Inf., Vol. 10, No. 4, hal. 1–68, 2019.
J. Li, Y. Xu, dan H. Shi, “Bidirectional LSTM with Hierarchical Attention for Text Classification,” Proc. 2019 IEEE 4th Adv. Inf. Technol. Electron. Autom. Control Conf. (IAEAC 2019), 2019, hal. 456–459.
M.A. Ramdhani, D.S. Maylawati, dan T. Mantoro, “Indonesian News Classification Using Convolutional Neural Network,” Indones. J. Electr. Eng. Comput. Sci., Vol. 19, No. 2, hal. 1000–1009, 2020.
M. Shi, K. Wang, dan C. Li, “A C-LSTM with Word Embedding Model for News Text Classification,” Proc. - 18th IEEE/ACIS Int. Conf. Comput. Inf. Sci. (ICIS 2019), 2019, hal. 253–257.
C. Li, G. Zhan, dan Z. Li, “News Text Classification Based on Improved Bi-LSTM-CNN,” Proc. - 9th Int. Conf. Inf. Technol. Med. Educ. (ITME 2018), 2018, hal. 890–893.
A. Kulkarni dan A. Shivananda, “Deep Learning for NLP,” dalam Natural Language Processing Recipes, New York, AS: Apress, 2019, Ch. 6, hal. 185–227.
P. Zhou, Z. Qi, S. Zheng, J. Xu, H. Bao, dan B. Xu, “Text Classification Improved by Integrating Bidirectional LSTM with Two-dimensional Max Pooling,” Proc. COLING 2016 - 26th Int. Conf. Comput. Linguist. Tech. Pap., 2016, hal. 3485–3495.
F. Zhu, X. Dong, R. Song, Y. Hong, dan Q. Zhu, “A Multiple Learning Model Based Voting System for News Headline Classification,” dalam Natural Language Processing and Chinese Computing (NLPCC 2017), Lecture Notes in Computer Science, Vol. 10619, X. Huang, J. Jiang, D. Zhao, Y. Feng, dan Yu Hong, Eds., Cham, Swiss: Springer, 2018, hal. 797–806.
I. Goodfellow, Y. Bengio, dan A. Courville, Deep Learning, Cambridge, AS: The MIT Press, 2016.
L. Yang dan A. Shami, “On Hyperparameter Optimization of Machine Learning Algorithms: Theory and Practice,” Neurocomput., Vol. 415, hal. 295–316, 2020.
Suyanto, “Ukuran Evaluasi Model Klasifikasi,” dalam Machine Learning Tingkat Dasar dan Lanjut, Bandung, Indonesia: INFORMATIKA, 2018, hal. 334.
I. Kandel dan M. Castelli, “The Effect of Batch Size on the Generalizability of the Convolutional Neural Networks on a Histopathology Dataset,” ICT Express, Vol. 6, No. 4, hal. 312–315, 2020.
P.M. Radiuk, “Impact of Training Set Batch Size on the Performance of Convolutional Neural Networks for Diverse Datasets,” Inf. Technol. Manag. Sci., Vol. 20, No. 1, hal. 20–24, 2018.