Court Decision Prediction Model Using Natural Language Processing and Random Forest

https://doi.org/10.22146/ijccs.108377

Nasution Nasution(1*), Suprapto Suprapto(2)

(1) Universitas Gadjah Mada
(2) Department of Computer Science and Electronics
(*) Corresponding Author

Abstract


The increasing number of criminal cases in Indonesia, which reached 288,472 in 2023, or rose by 15% from the previous year, has created a substantial workload for judicial professionals. This situation highlights the urgent need for artificial intelligence–based decision support systems to accelerate and improve the quality of legal decision-making. This study proposes a court decision prediction approach using the Random Forest algorithm combined with Natural Language Processing (NLP) techniques. The dataset consists of 21,630 court decisions from the Supreme Court of Indonesia, originally in PDF format and converted into XML. The research procedure includes text preprocessing, feature construction using Word2Vec and Fast Text, and Random Forest classification. Unlike previous studies employing LSTM, BiLSTM, and CNN methods with accuracy ranging from 49.14% to 77.32%, the proposed approach delivers better performance. Experimental results show that the model achieves a prediction accuracy of up to 63%-81% for Penalty Categories classification and up to 65%-80% for long punishment regression. These findings demonstrate the significant potential of applying NLP and Random Forest to develop predictive systems in Indonesian legal document analysis.

Keywords


Court decision prediction; Natural Language Processing; Random Forest; Machine Learning

Full Text:

PDF


References

F. S. Pratiwi, “Data Jumlah Kejahatan di Indonesia pada 2023,” 2023. [Online]. Available: https://dataindonesia.id/varia/detail/data-jumlah-kejahatan-di-indonesia-pada-2023 E. Q. Nuranti, E. Yulianti, and H. S. Husin, “Predicting the Category and the Length of Punishment in Indonesian Courts Based on Previous Court Decision Documents,” Computers, vol. 11, no. 6, Jun. 2022, doi: 10.3390/computers11060088. R. A. Shaikh, T. P. Sahu, and V. Anand, “Predicting Outcomes of Legal Cases based on Legal Factors using Classifiers,” in Procedia Computer Science, Elsevier B.V., 2020, pp. 2393–2402. doi: 10.1016/j.procs.2020.03.292. M. Y. Noguti, E. Vellasques, and L. S. Oliveira, “Legal Document Classification: An Application to Law Area Prediction of Petitions to Public Prosecution Service,” Oct. 2020, doi: 10.1109/IJCNN48605.2020.9207211. B. Strickson and B. De La Iglesia, “Legal Judgement Prediction for UK Courts,” in ACM International Conference Proceeding Series, Association for Computing Machinery, Mar. 2020, pp. 204–209. doi: 10.1145/3388176.3388183. V. Malik et al., “ILDC for CJPE: Indian Legal Documents Corpus for Court Judgment Prediction and Explanation,” May 2021, [Online]. Available: http://arxiv.org/abs/2105.13562 E. Mumcuoğlu, C. E. Öztürk, H. M. Ozaktas, and A. Koç, “Natural language processing in law: Prediction of outcomes in the higher courts of Turkey,” Inf Process Manag, vol. 58, no. 5, 2021, doi: 10.1016/j.ipm.2021.102684. N. Mustari, S. K. Sen, A. Banik, M. H. K. Mehedi, and A. A. Rasel, “Techniques to Estimate the Status of Legal Proceedings Considering Sequential Text Data,” in 2023 International Conference on Emerging Smart Computing and Informatics, ESCI 2023, Institute of Electrical and Electronics Engineers Inc., 2023. doi: 10.1109/ESCI56872.2023.10099995. R. Anantathanavit, J. Chongthanakorn, W. Kongsantinart, P. Praiwattana, and T. Thaipisutikul, “Utilizing AI and Natural Language Processing to Predict Supreme Court Decisions in Thailand,” in KST 2024 - 16th International Conference on Knowledge and Smart Technology, Institute of Electrical and Electronics Engineers Inc., 2024, pp. 45–50. doi: 10.1109/KST61284.2024.10499665. S. Abbara, M. Hafez, A. Kazzaz, A. Alhothali, and A. Alsolami, “ALJP: An Arabic Legal Judgment Prediction in Personal Status Cases Using Machine Learning Models,” Jul. 2023, [Online]. Available: http://arxiv.org/abs/2309.00238 T. Ansari, H. S. Dhillon, and M. Singh, “Machine Learning Model to Predict Results of Law Cases,” in 2024 4th International Conference on Advances in Electrical, Computing, Communication and Sustainable Technologies, ICAECT 2024, Institute of Electrical and Electronics Engineers Inc., 2024. doi: 10.1109/ICAECT60202.2024.10468789.



DOI: https://doi.org/10.22146/ijccs.108377

Article Metrics

Abstract views : 3222 | views : 1266

Refbacks

  • There are currently no refbacks.




Copyright (c) 2025 IJCCS (Indonesian Journal of Computing and Cybernetics Systems)

Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.



Copyright of :
IJCCS (Indonesian Journal of Computing and Cybernetics Systems)
ISSN 1978-1520 (print); ISSN 2460-7258 (online)
is a scientific journal the results of Computing
and Cybernetics Systems
A publication of IndoCEISS.
Gedung S1 Ruang 416 FMIPA UGM, Sekip Utara, Yogyakarta 55281
Fax: +62274 555133
email:ijccs.mipa@ugm.ac.id | http://jurnal.ugm.ac.id/ijccs



View My Stats1
View My Stats2