A Data-Driven Ensemble Learning Model For Heart Disease Prediction Using Feature Representation And Classification

Authors

  • Nithya Shree A. P
  • R. Kannan

Keywords:

heart disease, prediction, machine learning, feature representation, accuracy

Abstract

Heart disease is one of the most common causes of death in the general population. The prognosis of patients with heart conditions is greatly impacted by early detection. Several recognized factors can contribute to life-threatening cardiac problems, such as Age, sex, heart rate, cholesterol, and sugar. However, an expert may find it challenging to assess each patient while considering these factors due to the large number of variables. The work suggest assessing patients' risk of cardiovascular disease by combining Machine Learning (ML) and Deep Learning (DL) with feature augmentation techniques to form an ensemble model. The DenseNet, Gated Network Model (GNM) and Multi-Layer Perceptron’s (MLP) are combined to form the ensemble model. The results of the proposed methods demonstrate a significant improvement, particularly for a condition that impacts a large population, surpassing previous methods by 4.4% and achieving a 95.89% accuracy rate

Downloads

Download data is not yet available.

References

Jensen, L. J. Jensen, and S. Brunak, ‘‘Mining electronic health records: Towards better research applications and clinical care,’’ Nature Rev. Genet., vol. 13, no. 6, pp. 395–405, Jun. 2012.

Li, C. Bai, and C. K. Reddy, ‘‘A distributed ensemble approach for mining healthcare data under privacy constraints,’’ Inf. Sci., vol. 330, pp. 245–259, Feb. 2016.

] Khedr, Z. A. L. Aghbari, and I. Kamel, ''Privacy-preserving decomposable mining association rules on distributed data,'' Int. J. Eng. Technol., vol. 7, nos. 3–13, pp. 157–162, 2018.

Khedr and R. Bhatnagar, ‘‘New algorithm for clustering distributed data using K-means,’’ Comput. Information., vol. 33, pp. 1001–1022, Oct. 2014.

Subhashini and M. K. Jeyakumar, ‘‘OF-KNN technique: An approach for chronic kidney disease prediction,’’ Int. J. Pure Appl. Math., vol. 116, no. 24, pp. 331–348, 2017.

Nathan, P. M. Kumar, P. Panchatcharam, G. Manogaran, and R. Varadharajan, ''A novel Gini index decision tree data mining method with neural network classifiers for prediction of heart disease,'' Des. Automat. Embedded Syst., vol. 22, no. 3, p. 225, 2018

Hsu, G. Manogaran, P. Panchatcharam, and S. Vivekanandan, ''A new approach for prediction of lung carcinoma using backpropagation neural network with decision tree classifiers,'' in Proc. IEEE 8th Int. Symp. Cloud Service Comput. (SC), Nov. 2018, pp. 111–115

Ramasamy and K. Nirmala, ''Disease prediction in data mining using association rule mining and keyword-based clustering algorithms,'' Int. J. Comput. Appl., vol. 42, no. 1, pp. 1–8, Jan. 2020

Bakar, Z. Kefli, S. Abdullah, and M. Sahani, ‘‘Predictive models for dengue outbreak using multiple rulebase classifiers,’’ in Proc. Int. Conf. Electr. Eng. Informat., Jul. 2011, pp. 1–6

Hariharan, R. Umadevi, T. Stephen, and S. Pradeep, ‘‘Burden of diabetes and hypertension among people attending health camps in an urban area of Kancheepuram district,’’ Int. J. Community Med. Public Health, vol. 5, no. 1, p. 140, Dec. 2017.

[Tun, G. Arunagirinathan, S. K. Munshi, and J. M. Pappachan, ‘‘Diabetes mellitus and stroke: A clinical update,’’ World J. Diabetes, vol. 8, no. 6, pp. 235–248, Jun. 2017

Ley, O. Hamdy, V. Mohan, and F. B. Hu, ‘‘Prevention and management of type 2 diabetes: Dietary components and nutritional strategies,’’ Lancet, vol. 383, no. 9933, pp. 1999–2007, Jun. 2014

Meng, Y.-X. Huang, D.-P. Rao, Q. Zhang, and Q. Liu, ‘‘Comparison of three data mining models for predicting diabetes or prediabetes by risk factors,’’ Kaohsiung J. Med. Sci., vol. 29, no. 2, pp. 93–99, Feb. 2013

Bashir, U. Qamar, and F. H. Khan, ‘‘IntelliHealth: A medical decision support application using a novel weighted multi-layer classifier ensemble framework,’’ J. Biomed. Inform., vol. 59, pp. 185–200, Feb. 201

Nai-Arun and R. Moungmai, ''Comparison of classifiers for the risk of diabetes prediction,'' Procedia Comput. Sci., vol. 69, pp. 132–142, Nov. 2015

Calheiros, K. Ramamohanarao, R. Buyya, C. Leckie, and S. Versteeg, ‘‘On the effectiveness of isolation-based anomaly detection in cloud data centers,’’ Concurrency Comput., Pract. Exper., vol. 29, no. 18, p. e4169, Sep. 2017.

Chen, X. Shi, and Y. D. Wong, ‘‘Key feature selection and risk prediction for lane-changing behaviors based on vehicles’ trajectory data,’’ Accident Anal. Prevention, vol. 129, pp. 156–169, Aug. 2019.

Shin, A. Abraham, and S. Y. Han, ‘‘Improving kNN text categorization by removing outliers from training set,’’ in Computational Linguistics and Intelligent Text Processing, vol. 3878, A. Gelbukh, Ed. Berlin, Germany: Springer, 2006, pp. 563–566.

[Alfian, M. Syafrudin, B. Yoon, and J. Rhee, ''False-positive RFID detection using classification models,'' Appl. Sci., vol. 9, no. 6, p. 1154, Mar. 2019

] Farquad and I. Bose, ‘‘Preprocessing unbalanced data using support vector machine,’’ Decis. Support Syst., vol. 53, no. 1, pp. 226–233, Apr. 2012.

Hardiman and K. Uchida, ''Data- and algorithm-hybrid approach for imbalanced data problems in deep neural network,'' Int. J. Mach. Learn. Comput., vol. 8, no. 3, pp. 208–213, Jun. 2018

Lemaitre, F. Nogueira, and C. K. Aridas, ‘‘Imbalanced-learn: A Python toolbox to tackle the curse of imbalanced datasets in machine learning,’’ J. Mach. Learn. Res., vol. 18, no. 1, pp. 559–563, 2017

Gupta, R. Kumar, H. Singh Arora, and B. Raman, ‘‘MIFH: A machine intelligence framework for heart disease diagnosis,’’ IEEE Access, vol. 8, pp. 14659–14674, 2020.

] Pasha and E. S. Mohamed, ''Bio-inspired ensemble feature selection (BEFS) model with machine learning and data mining algorithms for disease risk prediction,'' in Proc. 5th Int. Conf. Comput., Commun., Control Autom. (ICCUBEA), Pune, India, Sep. 2019, pp. 1–

Sabahi, ‘‘Bimodal fuzzy analytic hierarchy process (BFAHP) for coronary heart disease risk assessment,’’ J. Biomed. Informat., vol. 83, pp. 204–216, Jul. 2018.

Maji and S. Arora, ‘‘Decision tree algorithms for prediction of heart disease,’’ in Information and Communication Technology for Competitive Strategies, vol. 40. Singapore: Springer, 2019, pp. 447–454.

Reddy, M. P. K. Reddy, K. Lakshmana, D. S. Rajput, R. Kaluri, and G. Srivastava, ''Hybrid genetic algorithm and a fuzzy logic classifier for heart disease diagnosis,'' Evol. Intell., vol. 13, no. 2, pp. 185–196, Jun. 2020.

[Zhu, Z. Ni, L. Ni, F. Jin, M. Cheng, and J. Li, ‘‘Improved discrete artificial fish swarm algorithm combined with margin distance minimization for ensemble pruning,’’ Comput. Ind. Eng., vol. 128, pp. 32–46, Feb. 2019.

Tanveer, C. Gautam, and P. N. Suganthan, ‘‘Comprehensive evaluation of twin SVM based classifiers on UCI datasets,’’ Appl. Soft Comput., vol. 83, Oct. 2019, Art. no. 105617.

[Paul, P. C. Shill, M. R. I. Rabin, and K. Murase, ‘‘Adaptive weighted fuzzy rule-based system for the risk level assessment of heart disease,’’ Int. J. Speech Technol., vol. 48, no. 7, pp. 1739–1756, Jul. 2018.

UCI Machine Learning Repository. (2015). Chronic_Kidney_Disease Data Set. [Online]. Available: https://archive.ics.uci.edu/ml/datasets/ chronic_kidney_disease.

Downloads

Published

2025-07-15

How to Cite

1.
Shree A. P N, Kannan R. A Data-Driven Ensemble Learning Model For Heart Disease Prediction Using Feature Representation And Classification. J Neonatal Surg [Internet]. 2025Jul.15 [cited 2025Oct.14];14(32S):5301-1. Available from: https://jneonatalsurg.com/index.php/jns/article/view/8291