Sarcasm Detection For Sentiment Analysis in Indonesian Tweets

https://doi.org/10.22146/ijccs.41136

Yessi Yunitasari(1*), Aina Musdholifah(2), Anny Kartika Sari(3)

(1) Master Program of Computer Science; FMIPA UGM, Yogyakarta
(2) Department of Computer Science and Electronics, FMIPA UGM, Yogyakarta
(3) Department of Computer Science and Electronics, FMIPA UGM, Yogyakarta
(*) Corresponding Author

Abstract


Twitter is one of the social medias that are widely used at the moment. Tweet conversations can be classified according to their sentiments. The existence of sarcasm contained in a tweet sometimes causes incorrect determination of the tweet’s sentiment because sarcasm is difficult to analyze automatically, even by humans. Hence, sarcasm detection needs to be conducted, which is expected to improve the results of sentiment analysis. The effect of sarcasm detection on sentiment analysis can be seen in terms of accuracy, precision and recall. In this paper, detection of sarcasm is applied to Indonesian tweets. The feature extraction of sarcasm detection uses unigram and 4 Boazizi feature sets which consist of sentiment-relate features, punctuation-relate features, lexical and syntactic features, and top word features. Detection of sarcasm uses the Random Forest algorithm. The feature extraction of sentiment analysis uses TF-IDF, while the classification uses Naïve Bayes algorithm. The evaluation shows that sentiment analysis with sarcasm detection improves the  accuracy of sentiment analysis about 5.49%. The accuracy of the model is 80.4%, while the precision is 83.2%, and the recall is 91.3%.

Keywords


Naïve bayes;sarcasm;tweet;sentiment analysis;random forest

Full Text:

PDF


References

[1]      Pandey, “Twitter-sentiment-analysis-using-hybrid-cuckoo-search-method”, Elsevier, 2017[Online].Available:https://www.researchgate.net/publication/314510089_Twitter_sentiment_analysis_using_hybrid_cuckoo_search_method [Accessed: 18-Agustus-2017]

[2]      M. Bouazizi and T. Ohtsuki, “Sarcasm detection in twitter” IEEE. 1–6, 2015 [Online]. Available:https://www.researchgate.net/publication/306524590_A_PatternBased_Approach_for_Sarcasm_Detection_on_Twitter[Accessed: 8-Nov-2017]

[3]      Zhang and Gao., “Performance Analysis and Improvement of Naïve Bayes in Text Classification Application ” IEEE. 1–6, 2013 [Online]. Available: https://ieeexplore.ieee.org/xpl/mostRecentIssue.jsp?punumber=6775580&punumber%3D6775580%26filter%3DAND%28p_IS_Number%3A6784688%29%26pageNumber%3D5&pageNumber=6[Accessed: 8-Jan-2018]

[4]      Maynard and Greenwood, “Who cares about Sarcastic Tweets? Investigating the Impact of Sarcasm on Sentiment Analysis.,” Lrec, pp. 4238–4243, 2014 [Online]. Available: https://gate.ac.uk/sale/lrec2014/arcomem/sarcasm.pdf[Accessed: 1-Jan-2018]

[5]       E. Lunando and A. Purwarianti , “Indonesian social media sentiment analysis with sarcasm detection,” 2013 Int. Conf. Adv. Comput. Sci. Inf. Syst. ICACSIS 2013, pp. 195–198, 2013 [Online]. Available: https://www.semanticscholar.org/paper/Indonesian-social-media-sentiment-analysis-with-Lunando Purwarianti/c82b544f6f43c69bd0b0bf58f4b963f6c4f31377  [Accessed: 23-Jan-2018]

[6]       M. V. Datla, “Bench marking of classification algorithms: Decision Trees and Random Forests-a case study using R,” Int. Conf. Trends Autom. Commun. Comput. Technol. I-TACT 2015, 2016 [Online]. Available:https://www.scopus.com/record/display.uri?eid=2-s2.0-84979282682&origin=inward&txGid=45c76f2dcf97f506cbcafd87959e7f31[Accessed: 18-Nov-2018]

[7]       Suswanto, “Analisis Perbandingan Metode Machine Learning untuk Prediksi Khasiat Jamu” , pp. 1–7, 2016 [Online]. Available: https://scholar.google.com/citations?user=NzAuocoAAAAJ&hl=id [Accessed: 15-Nov-2018]

[8]        Fariz Prawira, “Pengaruh Pendeteksian Sarkasme Terhadap Ukuran Kualitas Analisis Sentimen Pada Twitter”, Jurusan Matematika FMIPA Universitas Gadjah Mada, Yogyakarta, 2017.

 

[9]       C. Manning, P. Raghavan, and H. Schutze, “An Introduction to Information Retrieval,” Inf. Retr. Boston., no. c, pp. 1–18, 2009 [Online]. Available: https://nlp.stanford.edu/IR-book/pdf/irbookonlinereading.pdf[Accessed: 6-Okt-2017]

[10]     M. Habibi, “Analisis Sentimen Dan Klasifikasi Komentar Mahasiswa Pada Sistem Evaluasi Pembelajaran Menggunakan Kombinasi Knn Berbasis Cosine Similarity Dan Supervised Model”, Tesis, Program Studi S2 Ilmu Komputer, Departemen Ilmu Komputer dan Elektronika, Fakultas Matematika dan Ilmu Pengetahuan Alam, Universitas Gadjah Mada Yogyakarta, 2017.



DOI: https://doi.org/10.22146/ijccs.41136

Article Metrics

Abstract views : 881 | views : 755

Refbacks

  • There are currently no refbacks.




Copyright (c) 2019 IJCCS (Indonesian Journal of Computing and Cybernetics Systems)

Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.



Copyright of :
IJCCS (Indonesian Journal of Computing and Cybernetics Systems)
ISSN 1978-1520 (print); ISSN 2460-7258 (online)
is a scientific journal the results of Computing
and Cybernetics Systems
A publication of IndoCEISS.
Gedung S1 Ruang 416 FMIPA UGM, Sekip Utara, Yogyakarta 55281
Fax: +62274 555133
email:ijccs.mipa@ugm.ac.id | http://jurnal.ugm.ac.id/ijccs



View My Stats1
View My Stats2