Clustering Multi-Indicator Learning Outcomes of Vocational High School Students: A Comparison of K-Means and DBSCAN

Authors

  • Muhammad Fikri Aqil Universitas Negeri Makassar
  • Irwansyah Suwahyu UIN Sunan Kalijaga Yogyakarta

DOI:

https://doi.org/10.66053/aieds.v1i2.24

Keywords:

K-Means algorithm, DBSCAN, Student clustering, Vocational education, Learning outcomes

Abstract

Purpose – This study aims to compare the performance of K-Means and DBSCAN algorithms in clustering vocational high school students’ learning outcomes in the Network Administration subject to support data-driven educational decision making.
Methods – A quantitative experimental approach was employed using secondary academic data from vocational students. The variables analyzed included final examination scores, midterm examination scores, assignments, attendance, attitudes, and learning activities. Clustering was conducted using K-Means and DBSCAN algorithms implemented through data analysis software. Cluster quality and separation were evaluated using silhouette coefficients to assess the effectiveness of each algorithm in grouping student learning outcomes.
Findings – The results show that K-Means produces relatively stable and interpretable clusters when student performance data exhibit more uniform distributions. In contrast, DBSCAN demonstrates stronger capability in handling noisy data and identifying students with extreme performance levels as outliers. Both algorithms successfully reveal meaningful patterns in student learning outcomes, but differ in their sensitivity to data distribution and noise.
Research limitations – This study is limited to a single vocational subject and one institutional context, which may restrict the generalizability of the findings to other vocational domains.
Originality – This study provides empirical evidence on the comparative performance of partition-based and density-based clustering algorithms using multi-indicator learning outcome data in vocational education.

References

Aldowah, H., Al-Samarraie, H., & Fauzy, W. M. (2020). Educational data mining and learning analytics for 21st century higher education: A review and synthesis. Telematics and Informatics, 37, 13–49. https://doi.org/10.1016/j.tele.2019.01.007

Baig, M. I., Shuib, L., & Yadegaridehkordi, E. (2020). Big data in education: A state of the art, limitations, and future research directions. International Journal of Educational Technology in Higher Education, 17(1), 44. https://doi.org/10.1186/s41239-020-00223-0

Cahapin, E. L., Malabag, B. A., Santiago Jr., C. S., Reyes, J. L., Legaspi, G. S., & Adrales, K. L. (2023). Clustering of students admission data using k-means, hierarchical, and DBSCAN algorithms. Bulletin of Electrical Engineering and Informatics, 12(6), 3647–3656. https://doi.org/10.11591/eei.v12i6.4849

Daniel, B. (2020). B ig D ata and analytics in higher education: Opportunities and challenges. British Journal of Educational Technology, 46(5), 904–920. https://doi.org/10.1111/bjet.12230

DeFreitas, K., & Bernard, M. (2015). Comparative performance analysis of clustering techniques in educational data mining. Educational Data Mining, 10, 65–67.

Divayana, D. G. H., & Adiarta, A. (2024). Development of cse-ucla evaluation model modified by using weighted product in order to optimize digital library services in higher education of computer in bali. Jurnal Pendidikan Vokasi, 7(3).

Dutt, A., Ismail, M. A., & Herawan, T. (2023). A Systematic Review on Educational Data Mining. IEEE Access, 5, 15991–16005. https://doi.org/10.1109/ACCESS.2017.2654247

Enjelika, D., Hariani, N. P., & Sutabri, T. (2025). Perbandingan kinerja algoritma k-means dan dbscan dalam pengelompokan data nilai kelas viii a pada SMPN 01 PALEMBANG. 5(1).

Fawzia Omer, A., Mohammed, H. A., Awadallah, M. A., Khan, Z., Abrar, S. U., & Shah, M. D. (2022). Big Data Mining Using K-Means and DBSCAN Clustering Techniques. In M. Ouaissa, Z. Boulouard, M. Ouaissa, I. U. Khan, & M. Kaosar (Eds.), Big Data Analytics and Computational Intelligence for Cybersecurity (Vol. 111, pp. 231–246). Springer International Publishing. https://doi.org/10.1007/978-3-031-05752-6_15

Fenglei Ma, Weifeng Liu, & Tao Ma. (2025). Automatic differentiation application on stochastic finite element. 2010 International Conference on Computer, Mechatronics, Control and Electronic Engineering, 5609726. https://doi.org/10.1109/CMCE.2010.5609726

Gros, B., & García-Peñalvo, F. J. (2023). Future Trends in the Design Strategies and Technological Affordances of E-learning. In J. M. Spector, B. B. Lockee, & M. D. Childress (Eds.), Learning, Design, and Technology (pp. 345–367). Springer International Publishing. https://doi.org/10.1007/978-3-319-17461-7_67

Hasibuan, M. S., Lubis, A. H., & Sari, M. N. (2024). Perbandingan algoritma clustering DBSCAN dan K-Means dalam pengelompokan siswa terbaik. INFOTECH : Jurnal Informatika & Teknologi, 5(2), 301–309. https://doi.org/10.37373/infotech.v5i2.1457

Hooshyar, D., Yang, Y., Pedaste, M., & Huang, Y.-M. (2023). Clustering Algorithms in an Educational Context: An Automatic Comparative Approach. IEEE Access, 8, 146994–147014. https://doi.org/10.1109/ACCESS.2020.3014948

Irnanda, K. F., Windarto, A. P., Hartama, D., & Wanto, A. (2025). ANALISA METODE DATA MINING PADA PENGELOMPOKAN LAPANGAN KERJA INFORMAL SEKTOR NON-PERTANIAN. KOMIK (Konferensi Nasional Teknologi Informasi Dan Komputer), 3(1). https://doi.org/10.30865/komik.v3i1.1673

Luqman Ibrahim, A., & Mohammed, M. G. (2022). A new three-term conjugate gradient method for training neural networks with global convergence. Indonesian Journal of Electrical Engineering and Computer Science, 28(1), 551. https://doi.org/10.11591/ijeecs.v28.i1.pp551-558

Mohamed Nafuri, A. F., Sani, N. S., Zainudin, N. F. A., Rahman, A. H. A., & Aliff, M. (2022). Clustering Analysis for Classifying Student Academic Performance in Higher Education. Applied Sciences, 12(19), 9467. https://doi.org/10.3390/app12199467

Narang, H., Wu, F., & Ogunniyan, A. (2024). Numerical Solutions of Heat and Mass Transfer with the First Kind Boundary and Initial Conditions in Capillary Porous Cylinder Using Programmable Graphics Hardware. International Journal of Advanced Computer Science and Applications, 7(6). https://doi.org/10.14569/IJACSA.2016.070607

Oyelade, O. J., Oladipupo, O. O., & Obagbuwa, I. C. (2020). Application of k Means Clustering algorithm for prediction of Students Academic Performance. https://doi.org/10.48550/ARXIV.1002.2425

Picciano, A. G. (2022). The Evolution of Big Data and Learning Analytics in American Higher Education. Online Learning, 16(3). https://doi.org/10.24059/olj.v16i3.267

Rahman, G. A., Hasbi, I., Tenriawaru, A., Surimi, L., & Alfiyan, A. N. (2025). Penggunaan Algoritma Dbscan Dalam Pengelompokan Kabupaten/Kota Di Sulawesi Tenggara Berdasarkan Indikator Pendidikan. Simtek : Jurnal Sistem Informasi Dan Teknik Komputer, 10(1), 184–193. https://doi.org/10.51876/simtek.v10i1.1546

Romero, C., & Ventura, S. (2023). Educational Data Mining: A Review of the State of the Art. IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews), 40(6), 601–618. https://doi.org/10.1109/TSMCC.2010.2053532

Safii, M., Wahyudi, M., Solikhun, Zarlis, M., & Effendi, S. (2021). Food Self-sufficiency Decision Support Model Based on Provinces in Indonesia Using the Clustering Method. Journal of Physics: Conference Series, 1255(1), 012068. https://doi.org/10.1088/1742-6596/1255/1/012068

Salman, M. D., Rahmaddeni, R., Pratama, N. R., A, M. N. F., Setiawan, A. A., Zalianti, F., & Huda, I. B. (2025). Perbandingan Kinerja Algoritma Clustering K-Means dan K-Medoids dalam Pengelompokan Sekolah di Provinsi Riau Berdasarkan Ketersediaan Sarana dan Prasarana. MALCOM: Indonesian Journal of Machine Learning and Computer Science, 5(3), 797–806. https://doi.org/10.57152/malcom.v5i3.1950

Shahiri, A. M., Husain, W., & Rashid, N. A. (2022). A Review on Predicting Student’s Performance Using Data Mining Techniques. Procedia Computer Science, 72, 414–422. https://doi.org/10.1016/j.procs.2015.12.157

Utomo, M. H. T. (2023). Clustering Data Siswa Putus Sekolah Dengan Algoritma K-Means Dan DBSCAN. 18(2).

Xu, D., & Tian, Y. (2015). A Comprehensive Survey of Clustering Algorithms. Annals of Data Science, 2(2), 165–193. https://doi.org/10.1007/s40745-015-0040-1

Downloads

Published

2026-02-07