Comparison of C4.5 and Naïve Bayes Algorithms for Predicting Student Achievement

Fadli Dwi Yulianto

Abstract


This research aims to analyze and predict student achievement using data mining techniques with the C4.5 and Naive Bayes methods. The data used includes various factors that affect students' academic performance, such as previous grades, attendance, and parents' income. The C4.5 method, which is a decision tree algorithm, is used to identify patterns in the data and make rule-based decisions. Meanwhile, Naive Bayes, which is a probabilistic classification technique, is used to calculate the probability of achievement based on the distribution of features. The C4.5 algorithm model showed excellent performance in classifying students into the categories of “Underachieving” and “Achieving,” with perfect accuracy and F1-Score for both classes. On the other hand, the Naive Bayes model showed less than optimal results, especially in recognizing “Outstanding” students. Although the Naive Bayes model managed to correctly predict all the “Underachieving” students, it failed completely in detecting the “Achieving” students, as seen from the zero F1-Score for the class.

Full Text:

PDF

References


Alfarizi, M. R. S., Al-farish, M. Z., Taufiqurrahman, M., Ardiansah, G., & Elgar, M. (2023). Penggunaan Python Sebagai Bahasa Pemrograman untuk Machine Learning dan Deep Learning. Karya Ilmiah Mahasiswa Bertauhid (KARIMAH TAUHID), 2(1), 1–6.

Andini, Y., Hardinata, J. T., Purba, Y. P., Studi, P., Informasi, S., Utara, S., & Apriori, M. (2022). Penerapan Data Mining Terhadap Tata Letak Buku. Jurnal Technology Informatics & Computer System, XI(1), 9–15.

Br Sembiring, S. N., Winata, H., & Kusnasari, S. (2022). Pengelompokan Prestasi Siswa Menggunakan Algoritma K-Means. Jurnal Sistem Informasi Triguna Dharma (JURSI TGD), 1(1), 31. https://doi.org/10.53513/jursi.v1i1.4784

Dewi, S. P., Nurwati, N., & Rahayu, E. (2022). Penerapan Data Mining Untuk Prediksi Penjualan Produk Terlaris Menggunakan Metode K-Nearest Neighbor. Building of Informatics, Technology and Science (BITS), 3(4), 639–648. https://doi.org/10.47065/bits.v3i4.1408

Hana, F. M. (2020). Klasifikasi Penderita Penyakit Diabetes Menggunakan Algoritma Decision Tree C4.5. Jurnal SISKOM-KB (Sistem Komputer Dan Kecerdasan Buatan), 4(1), 32–39. https://doi.org/10.47970/siskom-kb.v4i1.173

Indahsari, G. J. F., Kasiliyani, A., & ... (2021). Sistem Pengambilan Keputusan Beban Kinerja Menggunakan Naive Bayes Studi Kasus Pdam Bandarmasih. … Terapan Riset Inovatif …, 571–581.

Karim, A., Darma, U. B., Purnama, I., Labuhanbatu, U., Harahap, S. Z., & Labuhanbatu, U. (2021). OR (Issue January).

Kawani, G. P. (2019). Implementasi Naive Bayes. Journal of Informatics, Information System, Software Engineering and Applications (INISTA), 1(2), 73–81. https://doi.org/10.20895/inista.v1i2.73

Kusumawati, K. (2023). Pemanfaatan Teknologi Informasi Dalam Pendidikan. Jurnal Limits, 5(1), 7–14. https://doi.org/10.59134/jlmt.v5i1.311

Muharram, R. F., Suryadi, A., Raya, J., No, T., Gedong, K., Rebo, P., & Timur, J. (2022). Implementasi Artificial Intelligence untuk Deteksi Masker Secara Realtime dengan Tensorflow dan SSD MobileNet Berbasis Python. Jurnal Widya, 3(2), 281–290. https://jurnal.amikwidyaloka.ac.id/index.php/awl

Noviriandini, A., & Nurajijah, N. (2019). Analisis Kinerja Algoritma C4.5 Dan Naïve Bayes Untuk Memprediksi Prestasi Siswa Sekolah Menengah Kejuruan. JITK (Jurnal Ilmu Pengetahuan Dan Teknologi Komputer), 5(1), 23–28. https://doi.org/10.33480/jitk.v5i1.607

Rahmayanti, A., Rusdiana, L., & Suratno, S. (2022). Perbandingan Metode Algoritma C4.5 Dan Naïve Bayes Untuk Memprediksi Kelulusan Mahasiswa. Walisongo Journal of Information Technology, 4(1), 11–22. https://doi.org/10.21580/wjit.2022.4.1.9654

Rambe, N. M. (2019). Peran Keluarga Dalam Meningkatkan Prestasi Belajar Siswa. Prosiding Seminar Nasional Fakultas Ilmu Sosial Universitas Negeri Medan, 3, 930– 934.

Romli, I., & Zy, A. T. (2020). Penentuan Jadwal Overtime Dengan Klasifikasi Data Karyawan Menggunakan Algoritma C4.5. Jurnal Sains Komputer & Informatika (J- SAKTI, 4(2), 694–702.

Rovidatul, Yunus, Y., & Nurcahyo, G. W. (2023). Perbandingan algoritma c4.5 dan naive bayes dalam prediksi kelulusan mahasiswa. Jurnal CoSciTech (Computer Science and Information Technology), 4(1), 193–199. https://doi.org/10.37859/coscitech.v4i1.4755




DOI: https://doi.org/10.55311/aiocsit.v4i2.311

Refbacks

  • There are currently no refbacks.