Data Mining-Driven Multi-Feature Selection for Chronic Disease Forecasting

Authors

  • B Rama Ganesh
  • Praveen B M
  • Krishna Prasad K
  • G. Swapna
  • Viswanath G

DOI:

https://doi.org/10.52783/jns.v14.1993

Keywords:

Synergistic Feature Engineering, chronic disease, early prediction, machine learning, Voting Classifier, Ensemble learning

Abstract

a major worldwide health difficulty, continual sicknesses call for higher predictive fashions for early diagnosis and individualized treatment. the usage of a synergistic blend of strategies, this technique combines Recursive characteristic removal with pass-Validation (RFECV) and support Vector machine (SVM) for best characteristic selection throughout 8 wonderful datasets: Breast cancer, chronic Kidney, Diabetes danger, Erbil heart sickness, coronary heart disorder, Kidney disease, Pima Indians, and Wisconsin Breast. The technique stresses efficient dimensionality reduction so that the most pertinent facts is applied to improve model overall performance. With a voting Classifier combining AdaBoost decision Tree and ExtraTree obtaining outstanding performance across all datasets, ensemble learning is absolutely important. High-performance results from this era show its dependability and relevance for early continual disorder prediction. The method provides brilliant possibility for enhancing diagnostic accuracy and allowing spark off remedies by tackling feature selection issues and imposing ensemble learning, thereby enhancing healthcare management and results.

Downloads

Download data is not yet available.

Metrics

Metrics Loading ...

References

W. H. O. Diet, ‘‘Chronic diseases,’’ World Health Org., Geneva, Switzerland, 2003.

R. Sawhney, A. Malik, S. Sharma, and V. Narayan, ‘‘A comparative assessment of artificial intelligence models used for early prediction and evaluation of chronic kidney disease,’’ Decis. Anal. J., vol. 6, Mar. 2023, Art. no. 100169.

G. Viswanath “Machine-Learning-Based Cloud Intrusion Detection”, in International Journal of Mechanical Engineering Research and Technology, 2024, Vol.16, pp 38-52.

U. Ullah and B. Garcia-Zapirain, ‘‘Quantum machine learning revolution in healthcare: A systematic review of emerging perspectives and applica tions,’’ IEEE Access, vol. 12, pp. 11423–11450, 2024.

G.Viswanath & Dr.G.Swapna “Diabetes Diagnosis Using Machine Learning with Cloud Security” in Cuestiones de Fisioterapia, 2025, Vol.54, No.2, pp 417-431.

X. Xiong, X. Guo, P. Zeng, R. Zou, and X. Wang, ‘‘A short-term wind power forecast method via XGBoost hyper-parameters optimization,’’ Frontiers Energy Res., vol. 10, May 2022, Art. no. 905155.

X.-W. Chen and J. C. Jeong, ‘‘Enhanced recursive feature elimination,’’ in Proc. 6th Int. Conf. Mach. Learn. Appl. (ICMLA), Dec. 2007, pp. 429–435.

G.Viswanath “Improved Light GBM Model Performance Analysis and Comparison for Coronary Heart Disease Prediction”, in International Journal of Information Technology and Computer Engineering, 2024, Vol.12, pp.658-672.

A. Bohr and K. Memarzadeh, Artificial Intelligence in Healthcare. New York, NY, USA: Academic, 2020.

G.Viswanath & Dr.G.Swapna “Health Prediction Using Machine Learning with Drive HQ Cloud Security” in Frontiersin Health Informatics, 2024. Vol. 13, No. 8, pp. 2755-2761.

S. Agarwal and D. M. G. C. Prabha, ‘‘Chronic diseases prediction using machine learning—A review,’’ Ann. Rom. Soc. Cell Biol., vol. 25, no. 1, pp. 3495–3511, 2021.

A. Maleki, M. Raahemi, and H. Nasiri, ‘‘Breast cancer diagnosis from histopathology images using deep neural network and XGBoost,’’ Biomed. Signal Process. Control, vol. 86, Sep. 2023, Art. no. 105152.

M. J. Raihan, M. A.-M. Khan, S.-H. Kee, and A.-A. Nahid, ‘‘Detection of the chronic kidney disease using XGBoost classifier and explaining the influence of the attributes on the model using SHAP,’’ Sci. Rep., vol. 13, no. 1, p. 6263, Apr. 2023.

S. Hegde and M. R. Mundada, ‘‘Early prediction of chronic disease using an efficient machine learning algorithm through adaptive probabilistic divergence based feature selection approach,’’ Int. J. Pervasive Comput. Commun., vol. 17, no. 1, pp. 20–36, Feb. 2021.

E. Maini, B. Venkateswarlu, D. Marwaha, and B. Maini, ‘‘Upgrading the performance of machine learning based chronic disease prediction systems using stacked generalization technique,’’ Int. J. Comput. Digit. Syst., vol. 10, no. 1, pp. 1031–1039, Nov. 2021.

T. H. H.Aldhyani, A. S. Alshebami, and M. Y. Alzahrani, ‘‘Soft clustering for enhancing the diagnosis of chronic diseases over machine learning algorithms,’’ J. Healthcare Eng., vol. 2020, pp. 1–16, Mar. 2020.

O. A. Jongbo, A. O. Adetunmbi, R. B. Ogunrinde, and B. Badeji-Ajisafe, ‘‘Development of an ensemble approach to chronic kidney disease diagno sis,’’ Sci. Afr., vol. 8, Jul. 2020, Art. no. e00456.

E. Listiana, R. Muzayanah, M. A. Muslim, and E. Sugiharti, ‘‘Optimiza tion of support vector machine using information gain and AdaBoost to improve accuracy of chronic kidney disease diagnosis,’’ J. Soft Comput. Explor., vol. 4, no. 3, pp. 152–158, 2023.

Breast Cancer Wisconsin (Diagnostic) Data Set. Accessed: Nov. 2023. [Online]. Available: cancer-wisconsin-data https://www.kaggle.com/datasets/uciml/breast

L. O. D. Chicco and C. A. Lovejoy. (2021). Chronic Kidney Disease EHRs Abu Dhabi. [Online]. Available: https://www.kaggle.com/datasets/davidechicco/chronic-kidney-disease-ehrs-abu-dhabi

Early Stage Diabetes Risk Prediction Dataset. Accessed: Nov. 2023. [Online]. Available: https://archive.ics.uci.edu/dataset/529/early

Erbil Heart Disease Dataset. Accessed: Nov. 2023. [Online]. Available: https://www.kaggle.com/datasets/hangawqadir/erbil-heart-disease-dataset

Heart Disease Dataset. Accessed: Nov. 2023. [Online]. Available: https://www.kaggle.com/datasets/johnsmith88/heart-disease-dataset?select=heart.csv

Kidney Disease Dataset. Accessed: Nov. 2023. [Online]. Available: https://www.kaggle.com/datasets/akshayksingh/kidney-disease-dataset

Pima Indians Diabetes. Accessed: Nov. 2023. [Online]. Available: https://data.world/uci/pima-indians-diabetes

G.Swapna & K Bhaskar “Malaria Diagnosis Using Double Hidden Layer Extreme Learning Machine Algorithm With Cnn Feature Extraction And Parasite Inflator” International Journal of Information Technology and Computer Engineering, Vol. 12 , 2024, pp536-547,DOI: https://ijitce.org/index.php/ijitce/article/view/704

G.Swapna, “A Drug-Target Interaction Prediction Based on Supervised Probabilistic Classification” Journal of Computer Science, Vol.19, 2023, pp.1203-1211, DOI: https://doi.org/10.3844/jcssp.2023.1203.1211

Viswanath Gudditi, “A Smart Recommendation System for Medicine using Intelligent NLP Techniques”, 2022 International Conference on Automation, Computing and Renewable Systems (ICACRS), 2022, pp.1081-1084.

M. Chen, Y. Hao, K. Hwang, L. Wang, and L. Wang, ‘‘Disease prediction by machine learning over big data from healthcare communities,’’ IEEE Access, vol. 5, pp. 8869–8879, 2017.

G.Swapna & K Bhaskar “Early-Stage Autism Spectrum Disorder Detection Using Machine Learning “International Journal of HRM and Organizational Behavior, pp269-283, DOI: https://ijhrmob.org/index.php/ijhrmob/article/view/242

Wisconsin Breast Cancer Database. Accessed: Nov.2023. [Online]. Avail able: https://www.kaggle.com/datasets/roustekbio/breast-cancer-csv

Viswanath “Multiple Cancer Types Classified Using CTMRI Images Based On Learning Without Forgetting Powered Deep Learning Models”, in International Journal of HRM and Organizational Behavior, 2024, Vol.12, pp 243-253.

S. Palaniappan and R. Awang, ‘‘Intelligent heart disease prediction system using data miningtechniques,’’ in Proc. IEEE/ACSInt. Conf. Comput.Syst. Appl., Apr. 2008, pp. 108–115.

P. Hamet and J. Tremblay, ‘‘Artificial intelligence in medicine,’’ Metabolism, vol. 69, pp. S36–S40, Jan. 2017.

O. Khan, J. H. Badhiwala, G. Grasso, and M. G. Fehlings, ‘‘Use of machine learning and artificial intelligence to drive personalized medicine approaches for spine care,’’ World Neurosurgery, vol. 140, pp. 512–518, Aug. 2020.

Downloads

Published

2025-03-08

How to Cite

1.
Ganesh BR, B M P, Prasad K K, Swapna G, G V. Data Mining-Driven Multi-Feature Selection for Chronic Disease Forecasting. J Neonatal Surg [Internet]. 2025Mar.8 [cited 2025Mar.20];14(5S):108-24. Available from: https://jneonatalsurg.com/index.php/jns/article/view/1993