CAN K-NEAREST NEIGHBOR METHOD BE USED TO PREDICT SUCCESS IN INDONESIA STATE UNIVERSITY STUDENT SELECTION

Harits Ar Roysid; Aris Maulana; Utomo Pujianto

doi:10.28961/kursor.v9i4.186

Authors

Harits Ar Roysid Electrical Engineering Department, State University of Malang, Indonesia
Aris Maulana Universitas Negeri Malang, Indonesia
Utomo Pujianto Universitas Negeri Malang, Indonesia

DOI:

https://doi.org/10.28961/kursor.v9i4.186

Keywords:

SNMPTN, KNN, SMOTE, CLASSIFICATION

Abstract

Seleksi Nasional Masuk Perguruan Tinggi Negeri (SNMPTN) is one of the selection pathways for student admissions to enter state universities (PTN) in Indonesia. This study aims to predict the chance of being accepted in the desired PTN and the lack of early monitoring of students for SNMPTN. The data source from the grades reports card of SMAN 1 Pakong, SMAN 8 Kediri, and SMAN 1 Pamekasan by using the average input of compulsory subjects, majors (Science / Social Sciences) and semester 1 to semester 5 which later the output to be accepted or not accepted An imbalanced dataset potentially affect the performance of the classification method used. Hence, we need to eliminate the imbalance class using SMOTE. Using 10-fold cross validation, this study compared K-Nearest Neighbor (KNN) without SMOTE and K-NN with SMOTE. The goal is to find the best prediction model between the two methods. The prediction model is applied to software for teachers to monitor student grades and ensuring students to pass the SNMPTN. The results show that KNN without SMOTE has higher accuracy than KNN with SMOTE. However, KNN with SMOTE outperform than KNN without SMOTE in precision and recall, KNN with SMOTE with K = 3 reached 80.08% Accuracy, 74.42% Precision and 91.68% Recall.

Downloads

Download data is not yet available.

References

[1] A. T. Wibowo and D. Fitrianah, â€œa K-Nearest Algorithm Based Application To Predict Snmptn Acceptance for High School,â€ Int. Res. J. Comput. Sci., vol. 5, no. 1, pp. 9â€“20, 2018.
[2] R. Siringoringo, â€œKlasifikasi Data Tidak Seimbang Menggunakan Algoritma SMOTE Dan K-Nearest Neighbor,â€ J. ISD, vol. 3, no. 1, pp. 44â€“49, 2018.
[3] P. A. Santoso, A. P. Wibawa, and U. Pujianto, â€œInternship recommendation system using simple additive weighting,â€ Bull. Soc. Informatics Theory Appl., vol. 2, no. 1, pp. 15â€“21, 2018.
[4] M. Vahdat, L. Oneto, D. Anguita, M. Funk, and M. Rauterberg, â€œCan Machine Learning explain Human Learningâ€¯?,â€ Neurocomputing, 2015.
[5] H. Y. Chen, C. H. Chuang, Y. J. Yang, and T. P. Wu, â€œExploring the risk factors of preterm birth using data mining,â€ Expert Syst. Appl., vol. 38, no. 5, pp. 5384â€“5387, 2011.
[6] H. Ar Rosyid, M. Palmerlee, and K. Chen, â€œDeploying learning materials to game content for serious education game developmentâ€¯: A case study,â€ Entertain. Comput., vol. 26, no. March 2017, pp. 1â€“9, 2018.
[7] M. D. Jaelani, A. P. Wibawa, and U. Pujianto, â€œTechnology acceptance model of student ability and tendency classification system,â€ Bull. Soc. Informatics Theory Appl., vol. 2, no. 2, pp. 47â€“57, 2018.
[8] A. S. B. Asmoro, W. S. G. Irianto, and U. Pujianto, â€œPerbandingan Kinerja Hasil Seleksi Fitur pada Prediksi Kinerja Akademik Siswa,â€ J. Edukasi dan Penelit. Inform., vol. 4, no. 2, pp. 84â€“89, 2018.
[9] R. A. Mollineda, V. Garcia, J. S. Sanchez, and R. Martin-felez, â€œSurrounding neighborhood-based SMOTE for learning from imbalanced data sets,â€ Prog Artif Intell, vol. 1, pp. 347â€“362, 2012.
[10] N. V Chawla, K. W. Bowyer, L. O. Hall, and W. P. Kegelmeyer, â€œSMOTEâ€¯: Synthetic Minority Over-sampling Technique,â€ J. Artif. Intell. Res. 16, vol. 16, pp. 321â€“357, 2002.
[11] H. Li, D. Pi, and C. Wang, â€œThe Prediction of Protein-Protein Interaction Sites Based on RBF Classifier Improved by SMOTE,â€ Math. Probl. Eng., vol. 2014, pp. 1â€“7, 2014.
[12] H. S. Khamis, K. W. Cheruiyot, and S. Kimani, â€œApplication of k- Nearest Neighbour Classification in Medical Data Mining,â€ Int. J. Inf. Commun. Technol. Res., vol. 4, no. 4, pp. 121â€“128, 2014.
[13] A. Giri, M. V. V. Bhagavath, B. Pruthvi, and N. Dubey, â€œA Placement Prediction System using k-nearest neighbors classifier,â€ Proc. - 2016 2nd Int. Conf. Cogn. Comput. Inf. Process. CCIP 2016, pp. 3â€“6, 2016.
[14] P.-N. Tan, M. Steinbach, and Vipin Kumar, Introduction to data mining. 2006.
[15] M. Junker, R. Hoch, and A. Dengel, â€œOn the Evaluation of Document Analysis Components by Recall, Precision, and Accuracy,â€ Proc. Fifth Int. Conf. Doc. Anal. Recognit., 1999.
[16] D. M. W. Powers, â€œEvaluation: From Precision, Recall and F-Measure to ROC, Informedness, Markedness & Correlation,â€ J. Mach. Learn. Technol., vol. 2, no. 1, pp. 37â€“63, 2011.

CAN K-NEAREST NEIGHBOR METHOD BE USED TO PREDICT SUCCESS IN INDONESIA STATE UNIVERSITY STUDENT SELECTION

Authors

DOI:

Keywords:

Abstract

Downloads

References

Additional Files

Published

Issue

Section

Citation Check

Make a Submission

system

TOOLS

tanggal_penting

Important Date

template2

certificate

histats

purcase_contact

Purchase Contact

Information