SENTIMENT ANALYSIS OF LEAGUE OF LEGENDS: WILD RIFT REVIEWS ON GOOGLE PLAY USING NAÃVE BAYES CLASSIFIER

Khadijah Khadijah; Nur Sabilly; Fajar Agung Nugroho

doi:10.21107/kursor.v12i01.328

Authors

Khadijah Khadijah Universitas Diponegoro, Indonesia
Nur Sabilly
Fajar Agung Nugroho

DOI:

https://doi.org/10.21107/kursor.v12i01.328

Keywords:

Sentiment Analysis, Naive Bayes Classifier, TF-IDF, BOW, Game reviews, League of Legend.

Abstract

League of Legends: Wild Rift is a mobile game with more than 48 million downloads worldwide. The game publishers could earn profit from selling item in the game (in-app purchases). Performance of player and players' impressions during the first week usually determine whether players would made in-app purchases or not. Therefore, it is important to understand the player opinions so that the game publisher could encourage the players to increase the in-app purchases. Therefore, this research utilized sentiment analysis to study the player opinions about the League of Legends: Wild Rift game based on the reviews given by the players on the Google Play Store. The sentiment analysis was applied by using Naive Bayes Classifier (NBC) algorithm which was well known for achieving good accuracy in the sentiment analysis task. In addition, data preprocessing and feature extraction should be carried out properly to increase the accuracy of the classifier. Therefore, this research investigated the impact of using stemming and transformation of informal words into formal words in the preprocessing stages, then compared two feature extraction algorithms, namely Term Frequency â€“ Inverse Document Frequency (TF-IDF) and Bag of Words (BOW). From the experiment, it was found that the use of stemming could decrease the accuracy of the classifier, but the use of transformation of non-standard words into standard words could improve the performance of the classifier, for both feature extractions, BOW and TF-IDF. In this case, BOW feature extraction was able to achieve better performance, compared to TF-IDF. The best model was achieved when not using stemming, applying the transformation of informal words into formal words, and using BOW bigram feature extraction, with the accuracy of 79,3%, precision of 82.10%, recall of 83.50%, and f1-score of 82,8.10%.

Downloads

Download data is not yet available.

References

[1] data.ai, â€œState of Mobile 2022,â€ San Francisco, California, USA, 2021. Accessed: Sep. 25, 2022. [Online]. Available: https://www.data.ai/en/go/state-of-mobile-2022/
[2] Y. Jiao, C. S. Tang, and J. Wang, â€œAn empirical study of play duration and in-app purchase behavior in mobile games,â€ Prod Oper Manag, vol. 31, no. 9, pp. 3435â€“3456, Sep. 2022, doi: 10.1111/POMS.13772.
[3] T. U. Haque, N. N. Saber, and F. M. Shah, â€œSentiment analysis on large scale Amazon product reviews,â€ 2018 IEEE International Conference on Innovative Research and Development, ICIRD 2018, pp. 1â€“6, Jun. 2018, doi: 10.1109/ICIRD.2018.8376299.
[4] S. W. H. Kwok, S. K. Vadde, and G. Wang, â€œTweet Topics and Sentiments Relating to COVID-19 Vaccination Among Australian Twitter Users: Machine Learning Analysis,â€ J Med Internet Res, vol. 23, no. 5, May 2021, doi: 10.2196/26953.
[5] B. Kurniawan, S. Effendi, and O. S. Sitompul, â€œKlasifikasi Konten Berita Dengan Metode Text Mining,â€ Dunia Teknologi Informasi - Jurnal Online, vol. 1, no. 1, Dec. 2012, Accessed: Sep. 26, 2022. [Online]. Available: https://jurnal.usu.ac.id/index.php/duniait/article/view/409
[6] C. S. K. Aditya, M. Haniâ€™ah, A. A. Fitrawan, A. Z. Arifin, and D. Purwitasari, â€œDeteksi Bot Spammer pada Twitter Berbasis Sentiment Analysis dan Time Interval Entropy,â€ Jurnal Buana Informatika, vol. 7, no. 3, pp. 179â€“186, 2016.
[7] S. Fransiska, Rianto, and A. I. Gufroni, â€œSentiment Analysis Provider by.U on Google Play Store Reviews with TF-IDF and Support Vector Machine (SVM) Method,â€ Scientific Journal of Informatics, vol. 7, no. 2, pp. 2407â€“7658, 2020, [Online]. Available: http://journal.unnes.ac.id/nju/index.php/sji
[8] X. Li and W. Yu, â€œFast Support Vector Machine Classification for Large Data Sets,â€ International Journal of Computer Intelligent System, vol. 7, pp. 197â€“212, 2014.
[9] I. Afdhal et al., â€œPenerapan Algoritma Random Forest Untuk Analisis Sentimen Komentar Di YouTube Tentang Islamofobia,â€ Jurnal Nasional Komputasi dan Teknologi Informasi (JNKTI), vol. 5, no. 1, pp. 122â€“129, Feb. 2022, doi: 10.32672/JNKTI.V5I1.4004.
[10] A. Singh, M. N. Halgamuge, and R. Lakshmiganthan, â€œImpact of Different Data Types on Classifier Performance of Random Forest, NaÃ¯ve Bayes, and K-Nearest Neighbors Algorithms,â€ 2017. [Online]. Available: www.ijacsa.thesai.org
[11] D. Jurafsky and J. H. Asghar, Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition, 2nd ed., . 2nd ed., London: Pearson, 2009.
[12] A. K. Sharma, S. Kumar Prajapat, and M. Aslam, â€œA Comparative Study between NaÃ¯ve Bayes and Neural Network (MLP) Classifier for Spam Email Detection,â€ International Journal of Computer Applications (IJCA), pp. 12â€“16, 2014.
[13] H. Simorangkir and K. M. Lhaksmana, â€œAnalisis Sentimen Pada Twitter Untuk Games Online Mobile Legends Dan Arena Of Valor Dengan Metode NaÃƒÂ¯ve Bayes Classifier,â€ eProceedings of Engineering, vol. 5, no. 3, Dec. 2018, Accessed: Sep. 25, 2022. [Online]. Available: https://openlibrarypublications.telkomuniversity.ac.id/index.php/engineering/article/view/7448
[14] M. E. Permana, H. Ramadhan, I. Budi, A. B. Santoso, and P. K. Putra, â€œSentiment analysis and topic detection of mobile banking application review,â€ in 2020 5th International Conference on Informatics and Computing, ICIC 2020, Institute of Electrical and Electronics Engineers Inc., Nov. 2020. doi: 10.1109/ICIC50835.2020.9288616.
[15] M. A. M. Salem and A. Y. A. Maghari, â€œSentiment analysis of mobile phone products reviews using classification algorithms,â€ in Proceedings - 2020 International Conference on Promising Electronic Technologies, ICPET 2020, Institute of Electrical and Electronics Engineers Inc., Dec. 2020, pp. 84â€“88. doi: 10.1109/ICPET51420.2020.00024.
[16] A. Y. Maghari and J. H. Zendah, â€œDetecting Significant Events in Arabic Microblogs using Soft Frequent Pattern Mining,â€ Journal of Engineering Research and Technology, vol. 6, no. 1, Mar. 2019, Accessed: Oct. 03, 2022. [Online]. Available: https://journals.iugaza.edu.ps/index.php/JERT/article/view/5365
[17] G. Gautam and D. Yadav, â€œSentiment analysis of twitter data using machine learning approaches and semantic analysis,â€ 2014 7th International Conference on Contemporary Computing, IC3 2014, pp. 437â€“442, 2014, doi: 10.1109/IC3.2014.6897213.
[18] B. A. Alhaj and A. Y. A. Maghari, â€œPredicting User Entries by Using Data Mining Algorithms,â€ Proceedings - 2017 Palestinian International Conference on Information and Communication Technology, PICICT 2017, pp. 110â€“114, Sep. 2017, doi: 10.1109/PICICT.2017.24.
[19] Mhd. T. A. Bangsa, S. Priyanta, and Y. Suyanto, â€œAspect-Based Sentiment Analysis of Online Marketplace Reviews Using Convolutional Neural Network,â€ IJCCS (Indonesian Journal of Computing and Cybernetics Systems), vol. 14, no. 2, p. 123, Apr. 2020, doi: 10.22146/ijccs.51646.
[20] N. A. Salsabila, Y. A. Winatmoko, A. A. Septiandri, and A. Jamal, â€œColloquial Indonesian Lexicon,â€ Proceedings of the 2018 International Conference on Asian Language Processing, IALP 2018, pp. 226â€“229, Jan. 2019, doi: 10.1109/IALP.2018.8629151.
[21] D. Satria and M. Mushthofa, â€œPerbandingan Metode Ekstraksi Ciri Histogram dan PCA untuk Mendeteksi Stoma pada Citra Penampang Daun Freycinetia,â€ Jurnal Ilmu Komputer dan Agri-Informatika, vol. 2, no. 1, pp. 20â€“28, May 2013, doi: 10.29244/JIKA.2.1.20-28.
[22] F. Alzami, E. D. Udayanti, D. P. Prabowo, and R. A. Megantara, â€œDocument Preprocessing with TF-IDF to Improve the Polarity Classification Performance of Unstructured Sentiment Analysis,â€ Kinetik: Game Technology, Information System, Computer Network, Computing, Electronics, and Control, pp. 235â€“242, Aug. 2020, doi: 10.22219/kinetik.v5i3.1066.
[23] Jurafsky, D., & Martin, J. H. (2021). Speech and Language Processing An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition First Edition.
[24] Rahmawan, F. (2011). Implementasi Question Answering System pada Dokumen Bahasa Indonesia Menggunakan Metode N-Gram. http://repository.ipb.ac.id/handle/123456789/54453
[25] Bishop, C. M. (2006). Pattern Recognition and Machine Learning (M. Jordan, J. Kleinberg, & B. Scholkopf, Ed.). Springer.

SENTIMENT ANALYSIS OF LEAGUE OF LEGENDS: WILD RIFT REVIEWS ON GOOGLE PLAY USING NAÃVE BAYES CLASSIFIER

Authors

DOI:

Keywords:

Abstract

Downloads

References

Downloads

Published

Issue

Section

Citation Check

Make a Submission

system

TOOLS

tanggal_penting

Important Date

template2

certificate

histats

purcase_contact

Purchase Contact

Information

SENTIMENT ANALYSIS OF LEAGUE OF LEGENDS: WILD RIFT REVIEWS ON GOOGLE PLAY USING NAÃVE BAYES CLASSIFIER

Authors

DOI:

Keywords:

Abstract

Downloads

References

Downloads

Published

Issue

Section

Citation Check

Make a Submission

system

TOOLS

tanggal_penting

Important Date

template2

certificate

histats

purcase_contact

Purchase Contact

Information

SENTIMENT ANALYSIS OF LEAGUE OF LEGENDS: WILD RIFT REVIEWS ON GOOGLE PLAY USING NAÃVE BAYES CLASSIFIER