Main Article Content

Harits Ar Roysid Aris Maulana Utomo Pujianto


Seleksi Nasional Masuk Perguruan Tinggi Negeri (SNMPTN) is one of the selection pathways for student admissions to enter state universities (PTN) in Indonesia. This study aims to predict the chance of being accepted in the desired PTN and the lack of early monitoring of students for SNMPTN. The data source from the grades reports card of SMAN 1 Pakong, SMAN 8 Kediri, and SMAN 1 Pamekasan by using the average input of compulsory subjects, majors (Science / Social Sciences) and semester 1 to semester 5 which later the output to be accepted or not accepted An imbalanced dataset potentially affect the performance of the classification method used. Hence, we need to eliminate the imbalance class using SMOTE. Using 10-fold cross validation, this study compared K-Nearest Neighbor (KNN) without SMOTE and K-NN with SMOTE. The goal is to find the best prediction model between the two methods. The prediction model is applied to software for teachers to monitor student grades and ensuring students to pass the SNMPTN. The results show that KNN without SMOTE has higher accuracy than KNN with SMOTE. However, KNN with SMOTE outperform than KNN without SMOTE in precision and recall, KNN with SMOTE with K = 3 reached 80.08% Accuracy, 74.42% Precision and 91.68% Recall.

Article Details

How to Cite
ROYSID, Harits Ar; MAULANA, Aris; PUJIANTO, Utomo. CAN K-NEAREST NEIGHBOR METHOD BE USED TO PREDICT SUCCESS IN INDONESIA STATE UNIVERSITY STUDENT SELECTION. Jurnal Ilmiah Kursor, [S.l.], v. 9, n. 4, oct. 2019. ISSN 2301-6914. Available at: <>. Date accessed: 26 may 2020. doi:


[1] A. T. Wibowo and D. Fitrianah, “a K-Nearest Algorithm Based Application To Predict Snmptn Acceptance for High School,” Int. Res. J. Comput. Sci., vol. 5, no. 1, pp. 9–20, 2018.
[2] R. Siringoringo, “Klasifikasi Data Tidak Seimbang Menggunakan Algoritma SMOTE Dan K-Nearest Neighbor,” J. ISD, vol. 3, no. 1, pp. 44–49, 2018.
[3] P. A. Santoso, A. P. Wibawa, and U. Pujianto, “Internship recommendation system using simple additive weighting,” Bull. Soc. Informatics Theory Appl., vol. 2, no. 1, pp. 15–21, 2018.
[4] M. Vahdat, L. Oneto, D. Anguita, M. Funk, and M. Rauterberg, “Can Machine Learning explain Human Learning ?,” Neurocomputing, 2015.
[5] H. Y. Chen, C. H. Chuang, Y. J. Yang, and T. P. Wu, “Exploring the risk factors of preterm birth using data mining,” Expert Syst. Appl., vol. 38, no. 5, pp. 5384–5387, 2011.
[6] H. Ar Rosyid, M. Palmerlee, and K. Chen, “Deploying learning materials to game content for serious education game development : A case study,” Entertain. Comput., vol. 26, no. March 2017, pp. 1–9, 2018.
[7] M. D. Jaelani, A. P. Wibawa, and U. Pujianto, “Technology acceptance model of student ability and tendency classification system,” Bull. Soc. Informatics Theory Appl., vol. 2, no. 2, pp. 47–57, 2018.
[8] A. S. B. Asmoro, W. S. G. Irianto, and U. Pujianto, “Perbandingan Kinerja Hasil Seleksi Fitur pada Prediksi Kinerja Akademik Siswa,” J. Edukasi dan Penelit. Inform., vol. 4, no. 2, pp. 84–89, 2018.
[9] R. A. Mollineda, V. Garcia, J. S. Sanchez, and R. Martin-felez, “Surrounding neighborhood-based SMOTE for learning from imbalanced data sets,” Prog Artif Intell, vol. 1, pp. 347–362, 2012.
[10] N. V Chawla, K. W. Bowyer, L. O. Hall, and W. P. Kegelmeyer, “SMOTE : Synthetic Minority Over-sampling Technique,” J. Artif. Intell. Res. 16, vol. 16, pp. 321–357, 2002.
[11] H. Li, D. Pi, and C. Wang, “The Prediction of Protein-Protein Interaction Sites Based on RBF Classifier Improved by SMOTE,” Math. Probl. Eng., vol. 2014, pp. 1–7, 2014.
[12] H. S. Khamis, K. W. Cheruiyot, and S. Kimani, “Application of k- Nearest Neighbour Classification in Medical Data Mining,” Int. J. Inf. Commun. Technol. Res., vol. 4, no. 4, pp. 121–128, 2014.
[13] A. Giri, M. V. V. Bhagavath, B. Pruthvi, and N. Dubey, “A Placement Prediction System using k-nearest neighbors classifier,” Proc. - 2016 2nd Int. Conf. Cogn. Comput. Inf. Process. CCIP 2016, pp. 3–6, 2016.
[14] P.-N. Tan, M. Steinbach, and Vipin Kumar, Introduction to data mining. 2006.
[15] M. Junker, R. Hoch, and A. Dengel, “On the Evaluation of Document Analysis Components by Recall, Precision, and Accuracy,” Proc. Fifth Int. Conf. Doc. Anal. Recognit., 1999.
[16] D. M. W. Powers, “Evaluation: From Precision, Recall and F-Measure to ROC, Informedness, Markedness & Correlation,” J. Mach. Learn. Technol., vol. 2, no. 1, pp. 37–63, 2011.