Klasterisasi Tingkat Kemiskinan di Indonesia menggunakan Algoritma K-Means
Abstract
Poverty is one of the deep social challenges around the world and is a major focus in the global development agenda. This article discusses the role of clustering methods in analyzing and understanding poverty issues. We use data from Statistics Indonesia (BPS) on 34 provinces in Indonesia to classify groups of people who are vulnerable to poverty. Clustering analysis helps us identify characteristics that may be overlooked by conventional approaches, which in turn enables the development of more targeted and effective solutions to poverty. We use the K-Means method in our analysis and present it within the framework of the CRISP-DM methodology. The results show that almost 95% of the poor in Indonesia belong to the 'Poor' group. Therefore, we recommend effective actions based on indicators that are the main factors of poverty, as well as designing specific policies for regions with similar characteristics. This article aims to contribute to the global effort to end poverty and achieve the vision of equitable and inclusive sustainable development.
References
E. Tania, “Upaya United Nations Development Programmed Melalui Proyek SWARGA dalam Meningkatkan Jumlah Perempuan sebagai Pengambil Keputusan di DPRD Provinsi DKI Jakarta,” Universitas Katolik Parahyangan, Bandung, 2017. Accessed: Sep. 30, 2023. [Online]. Available: https://repository.unpar.ac.id
A. S. Alisjahbana and E. Murniningtyas, Tujuan pembangunan berkelanjutan di Indonesia : konsep, target, dan strategi implementasi. 2018.
A. Silomba, Muh. Massyat, and Muh. S. Tajuddin, “Peranan Badan Perencanaan Pembangunan, Penelitian Dan Pengembangan Daerah Dalam Proses Perumusan Kebijakan Pembangunan di Kabupaten Mamasa,” Journal Peqguruang: Conference Series, vol. 2, no. 2, p. 423, Nov. 2020, doi: 10.35329/jp.v2i2.1519.
N. I. Febianto and N. Palasara, “Analisa Clustering K-Means Pada Data Informasi Kemiskinan Di Jawa Barat Tahun 2018,” Jurnal Sisfokom (Sistem Informasi dan Komputer), vol. 8, no. 2, pp. 130–140, Aug. 2019, doi: 10.32736/sisfokom.v8i2.653.
H. Liu, Y. Liu, R. Zhang, and X. Wu, “A Clustering Algorithm via Density Perception and Hierarchical Aggregation Based on Urban Multimodal Big Data for Identifying and Analyzing Categories of Poverty-Stricken Households in China,” Sci Program, vol. 2021, 2021, doi: 10.1155/2021/6692975.
N. T. Luchia et al., “Perbandingan K-Means dan K-Medoids Pada Pengelompokan Data Miskin di Indonesia,” MALCOM: Indonesian Journal of Machine Learning and Computer Science, vol. 2, no. 2, pp. 35–41, 2022.
M. A. Rahman, N. S. Sani, R. Hamdan, Z. A. Othman, and A. A. Bakar, “A clustering approach to identify multidimensional poverty indicators for the bottom 40 percent group,” PLoS One, vol. 16, no. 8 August, Aug. 2021, doi: 10.1371/journal.pone.0255312.
A. Y. Clark, N. Blumenfeld, E. Lal, S. Darbari, S. Northwood, and A. Wadpey, “Using k-means cluster analysis and decision trees to highlight significant factors leading to homelessness,” Mathematics, vol. 9, no. 17, Sep. 2021, doi: 10.3390/math9172045.
R. Y. Endartyana, “Pengelompokan Kabupaten/Kota Di Provinsi Papua Berdasarkan Indikator Angka Kemiskinan,” ,” Institut Teknologi Sepuluh Nopember , Surabaya, 2019.
M. P. Repollo, R. Aurelius, and C. Robielos, “Applying Clustering Algorithm on Poverty Analysis in a Community in the Philippines,” IEOM Society International, pp. 1511–1521, 2021.
N. Sepriyanti, R. Sani Nahampun, M. H. Zikri, I. Ambarani, and A. Rahmadeyan, “Penerapan K-Means Clustering Untuk Mengelompokkan Tingkat Kemiskinan di Provinsi Riau,” SENTIMAS: Seminar Nasional Penelitian dan Pengabdian Masyarakat, pp. 59–65, 2022, [Online]. Available: https://journal.irpi.or.id/index.php/sentimas
M. Tadi and B. Arnawisuda Ningsi, “Analisis Klaster Kemiskinan Kabupaten Kota Di Provinsi Banten Menggunakan Metode K-Means,” Lebesgue: Jurnal Ilmiah Pendidikan Matematika, Matematika dan Statistika, vol. 4, no. 1, pp. 374–385, 2023, doi: 10.46306/lb.v4i1.
F. Dwi Ratna Sari and S. Partiwi Ediwijojo, “Clustering Analysis Using K-Medoids on Poverty Level Problems in Central Java by District/City,” KnE Social Sciences, pp. 78–87, May 2023, doi: 10.18502/kss.v8i9.13321.
H. Sofyan, M. Iqbal, M. Marzuki, and M. Muhammad, “The comparison of k-modes clustering and ROCK clustering to the poverty indicator in Samadua Subdistrict, South Aceh,” IOP Conf Ser Mater Sci Eng, vol. 1087, no. 1, p. 012085, Feb. 2021, doi: 10.1088/1757-899x/1087/1/012085.
Z. Sitorus, “Penerapan Data Mining Untuk Clustering Penduduk Miskin Di Kota Tanjungbalai Menggunakan Metode Algoritma K-Means,” Journal of Science and Social Research, no. 1, pp. 212–218, 2024, [Online]. Available: http://jurnal.goretanpena.com/index.php/JSSR
A. Bahauddin, “Analisis Clustering Provinsi Di Indonesia Berdasarkan Tingkat Kemiskinan Menggunakan Algoritma K-Means,” MISI (Jurnal Manajemen informatika & Sistem Informasi), vol. 4, no. 1, Jan. 2021.
K. Niswatin, “Clustering Of Districts And Cities In Indonesia Based On Poverty Indicators Using The K-Means Method,” International Conference on Computing, Mathematics and Statistics, 2021.
N. Istiqamah, O. Soesanto, and D. Anggraini, “Application of the K-Means algorithm to determine poverty status in Hulu Sungai Tengah,” J Phys Conf Ser, vol. 2106, no. 1, Nov. 2021, doi: 10.1088/1742-6596/2106/1/012027.
R. Novita et al., “Penerapan Algoritma K-Means dan Analisisnya untuk Menentukan Kebijakan Strategis Penyelesaian Studi Mahasiswa,” Jurnal SAINTIKOM, vol. 22, pp. 401–413, 2023, [Online]. Available: https://ojs.trigunadharma.ac.id/index.php/jis/index
M. Atiqul Mutaqin and W. Andriyani, “KLASTERISASI DATA DISABILITAS MENGGUNAKAN ALGORITMA K-MEANS,” IJIR, vol. 3, no. 1, pp. 25–35, 2022.
P.-N. Tan, M. Steinbach, and V. Kumar, Introduction to data mining.
M. Cui, “Introduction to the K-Means Clustering Algorithm Based on the Elbow Method,” Clausius Scientific Press, Canada, vol. 1, pp. 5–8, 2020, doi: 10.23977/accaf.2020.010102.
V. Annisa Ekasetya and A. Jananto, “Klusterisasi Optimal Dengan Elbow Method Untuk Pengelompokan Data Kecelakaan Lalu Lintas Di Kota Semarang,” Dinamika Informatika, vol. 12, no. 1, pp. 20–28, 2020.
N. Syahfitri, E. Budianita, A. Nazir, and I. Afrianty, “Pengelompokan Produk Berdasarkan Data Persediaan Barang Menggunakan Metode Elbow dan K-Medoid,” KLIK: Kajian Ilmiah Informatika dan Komputer , vol. 4, no. 3, pp. 1668–1675, 2023, doi: 10.30865/klik.v4i3.1525.
G. Ogbuabor and U. F. N, “Clustering Algorithm for a Healthcare Dataset Using Silhouette Score Value,” International Journal of Computer Science and Information Technology, vol. 10, no. 2, pp. 27–37, Apr. 2018, doi: 10.5121/ijcsit.2018.10203.
A. Naghizadeh and D. N. Metaxas, “Condensed silhouette: An optimized filtering process for cluster selection in K-means,” in Procedia Computer Science, Elsevier B.V., 2020, pp. 205–214. doi: 10.1016/j.procs.2020.08.022.
K. R. Shahapure and C. Nicholas, “Cluster quality analysis using silhouette score,” in Proceedings - 2020 IEEE 7th International Conference on Data Science and Advanced Analytics, DSAA 2020, Institute of Electrical and Electronics Engineers Inc., Oct. 2020, pp. 747–748. doi: 10.1109/DSAA49011.2020.00096.
Copyright (c) 2024 Assyifa Khalif, Anisa Nur Hasanah, Muhammad Hafizh Ridwan, Betha Nurina Sari
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Authors who publish with this journal agree to the following terms:
1. Copyright on any article is retained by the author(s).
2. The author grants the journal, right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgment of the work’s authorship and initial publication in this journal.
3. Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal’s published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
4. Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work.
5. The article and any associated published material is distributed under the Creative Commons Attribution-ShareAlike 4.0 International License