Exploring an Efficient Machine Learning Technique for Diabetes Mellitus Prediction
DOI:
https://doi.org/10.63682/jns.v14i6.5343Keywords:
Machine learning, Kappa statistic, Bagging, Diabetes, Confusion matrixAbstract
Diabetes Mellitus is the most common disease characterized by hyperglycemia resulting from a deficit in the production or action of insulin. Untreated diabetes can lead to a myriad of complications. In 2021, it was estimated that 537 million individuals have diabetes, which is expected to increase to over 783 million by 2045. Preventing and managing this epidemic poses a significant challenge owing to various issues and barriers such as inadequate access to healthcare and lack of surveillance data. However, the advent of machine-learning (ML) techniques can easily address these critical problems. The main goal of this project is to develop a model capable of accurately predicting diabetes in patients. Five machine-learning classification algorithms were employed in this analysis to detect diabetes. The experiments were performed using a diabetes prediction dataset sourced from Kaggle. The efficiency of all five algorithms was assessed using various metrics such as the Kappa statistic, F-measure, precision, true positive rate, and recall. Accuracy was evaluated based on correctly classified and incorrectly classified instances. The results specify that the bagging algorithm outperformed the various other algorithms with the highest accuracy of 97.16% compared to other different algorithms..
Downloads
Metrics
References
Satyanarayanan. S, Srikanta Murthy K, S. Chitnis. “A comprehensive survey of analysis of heart sounds using machine learning techniques to detect heart diseases.” J. Popul.Ther. Clin. Pharmacol, vol.30, no. May, pp.375-384, 2023.
Swaminathan. S, Krishnamurthy, S. M., Gudada, C., Mallappa, S. K., and Ail, N. “Heart Sound Analysis with Machine Learning Using Audio Features for Detecting Heart Diseases”. International Journal of Computer Information Systems and Industrial Management Applications, 16(2), 17-17 (2024).
Sathyanarayanan. S and Sanjay Chitnis. “A survey of machine learning in healthcare.” Artificial Intelligence Applications for Health Care, I., M.K. Ahirwal, N.D. Londhe, and A. Kumar, Eds. Boca Raton: Taylor and Francis Ltd, (2022): 1-22.
Haleem, Abid, Mohd Javaid and Ibrahim Haleem Khan. “Current status and applications of Artificial Intelligence (AI) in medical field: An overview.” Current Medicine Research and Practice 9.6 (2019): 231-237.
Mujumdar, Aishwarya and Vaidehi. V. “Diabetes prediction using machine learning algorithms.” Procedia Computer Science 165 (2019): 292-299.
Nielen, Johannes TH, et al. “Severity of diabetes mellitus and total hip or knee replacement: a population-based case–control study.” Medicine 95.20 (2016): e3739.
Lemp, George F, et al. “Association between the severity of diabetes mellitus and coronary arterial atherosclerosis.” The American journal of cardiology 60.13 (1987): 1015-1019.
Rani, Jyoti. KM. “Diabetes prediction using machine learning.” International
Journal of Scientific Research in Computer Science Engineering and Information Technology 6 (2020): 294-305
Osuwa, Abdulhafis Abdulazeez and Hu¨seyin Oztoprak. “Importance of continuous¨ improvement of machine learning algorithms From A Health Care Management and Management Information Systems Perspective.” 2021 International Conference on Engineering and Emerging Technologies (ICEET). IEEE, (2021).
S. Swaminathan, S.M.K, C. Gudada, and S.K. Mallappa, “Use of Audio Transfer Learning To Analyse Heart Sounds For Detecting Heart Diseases”, Nanotechnology Perceptions, vol. 20, no. S11 (2024), pp. 1863-1878, 2024
Bekkering, Pjotr, et al. “The intricate association between gut microbiota and development of type 1, type 2 and type 3 diabetes.” Expert Review of Clinical Immunology 9.11 (2013): 1031-1041.
Karunasingha, Dulakshi Santhusitha Kumari. “Root mean square error or mean absolute error? Use their ratio as well.” Information Sciences 585 (2022): 609-629.
Pandey, Anand Kishor, and Dharmveer Singh Rajpoot. “A comparative study of classification techniques by utilizing WEKA.” 2016 International Conference on Signal Processing and Communication (ICSC). Ieee, (2016).
Obuandike, Georgina. N, Isah Audu and Alhasan John. “Analytical study of some selected classification algorithms in weka using real crime data.” (2015)
Downloads
Published
How to Cite
Issue
Section
License

This work is licensed under a Creative Commons Attribution 4.0 International License.
You are free to:
- Share — copy and redistribute the material in any medium or format
- Adapt — remix, transform, and build upon the material for any purpose, even commercially.
Terms:
- Attribution — You must give appropriate credit, provide a link to the license, and indicate if changes were made. You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use.
- No additional restrictions — You may not apply legal terms or technological measures that legally restrict others from doing anything the license permits.