Conditional Generative Adversarial Network (CGAN) Based Data Oversampling And Gaussian Bobcat Optimization Algorithm (GBOA) For Heart Diseases Prediction

Authors

  • M. MadhanGiri
  • M. Santhalakshmi

Abstract

Early detection of cardiac problems and continuous patient monitoring by physicians can help reduce death rates. The classification of imbalanced datasets is an important task in machine learning. The number of samples in each class is not uniformly distributed; one class contains a large number of samples while the other has a small number. This often leads to higher classification accuracy for the majority category and lower classification accuracy for the minority category. In this paper, Conditional Generative Adversarial Network (CGAN) is a type of neural network to generate new data samples similar to a given training sample. CGAN is introduced to translate unbalanced samples from one form to another resulting in balanced samples. Gaussian Bobcat Optimization Algorithm (GBOA) is introduced for the importance of features and provides valuable insights into the relevance and predictive power of each feature in a heart disease dataset. GBOA is derived from the hunting strategy of bobcats during the attack towards the prey and the chase process between them. Natural behaviors of the bobcat in the wild, the strategy of this animal during hunting is much more prominent. Memory-Augmented Deep Autoencoder (MADAE) classifier consists of a fully connected three-layer neural network where the encoder contains input and hidden layers and the decoder part comprises hidden and output layers. By allowing early detection and treatment, accurate heart disease prediction can greatly reduce mortality. Cleveland Heart Disease Database (CHDD) is collected from University of California Irvine (UCI) repository. Experimental results show the effectiveness of the classifiers in terms of Precision, Recall, F-Measure, and Accuracy.

Downloads

Download data is not yet available.

References

Indrakumari, R., Poongodi, T. and Jena, S.R., 2020. Heart disease prediction using exploratory data analysis. Procedia Computer Science, 173, pp.130-139.

Fitriyani, N.L., Syafrudin, M., Alfian, G. and Rhee, J., 2020. HDPM: an effective heart disease prediction model for a clinical decision support system. IEEE Access, 8, pp.133034-133050.

Pathan, M.S., Nag, A., Pathan, M.M. and Dev, S., 2022. Analyzing the impact of feature selection on the accuracy of heart disease prediction. Healthcare Analytics, 2, pp.1-9.

Mohan, S., Thirumalai, C. and Srivastava, G., 2019. Effective heart disease prediction using hybrid machine learning techniques. IEEE access, 7, pp.81542-81554.

Bhatt, C.M., Patel, P., Ghetia, T. and Mazzeo, P.L., 2023. Effective heart disease prediction using machine learning techniques. Algorithms, 16(2), pp.1-14.

Yahaya, L., Oye, N.D. and Garba, E.J., 2020. A comprehensive review on heart disease prediction using data mining and machine learning techniques. American Journal of Artificial Intelligence, 4(1), pp.20-29.

Aremu, O.O., Hyland-Wood, D. and McAree, P.R., 2020. A machine learning approach to circumventing the curse of dimensionality in discontinuous time series machine data. Reliability Engineering & System Safety, 195, p.106706.

Pavithra, V. and Jayalakshmi, V., 2020, Review of feature selection techniques for predicting diseases. In 2020 5th International Conference on Communication and Electronics Systems (ICCES), pp. 1213-1217.

Cai, J., Luo, J., Wang, S. and Yang, S., 2018. Feature selection in machine learning: A new perspective. Neurocomputing, 300, pp.70-79.

Wang, G., Lauri, F. and El Hassani, A.H., 2021, A study of dimensionality reduction’s influence on heart disease prediction. In 2021 12th International Conference on Information, Intelligence, Systems & Applications (IISA), pp. 1-6.

Rostami, M., Berahmand, K., Nasiri, E. and Forouzandeh, S., 2021. Review of swarm intelligence-based feature selection methods. Engineering Applications of Artificial Intelligence, 100, pp.1-31.

Nguyen, B.H., Xue, B. and Zhang, M., 2020. A survey on swarm intelligence approaches to feature selection in data mining. Swarm and Evolutionary Computation, 54, pp.1-27.

Saw, T. and Myint, P.H., 2019. Swarm intelligence based feature selection for high dimensional classification: a literature survey. Int. J. Comput, 33(1), pp.69-83.

Alrefai, N. and Ibrahim, O., 2022. Optimized feature selection method using particle swarm intelligence with ensemble learning for cancer classification based on microarray datasets. Neural Computing and Applications, 34(16), pp.13513-13528.

Maldonado, S. and López, J., 2018. Dealing with high-dimensional class-imbalanced datasets: Embedded feature selection for SVM classification. Applied Soft Computing, 67, pp.94-105.

Liu, H., Zhou, M. and Liu, Q., 2019. An embedded feature selection method for imbalanced data classification. IEEE/CAA Journal of Automatica Sinica, 6(3), pp.703-715.

Kamalov, F., Thabtah, F. and Leung, H.H., 2023. Feature selection in imbalanced data. Annals of Data Science, 10(6), pp.1527-1541.

Johnson, J.M. and Khoshgoftaar, T.M., 2019. Survey on deep learning with class imbalance. Journal of big data, 6(1), pp.1-54.

Rendon, E., Alejo, R., Castorena, C., Isidro-Ortega, F.J. and Granda-Gutierrez, E.E., 2020. Data sampling methods to deal with the big data multi-class imbalance problem. Applied Sciences, 10(4), pp.1-57.

Ghosh, K., Bellinger, C., Corizzo, R., Branco, P., Krawczyk, B. and Japkowicz, N., 2024. The class imbalance problem in deep learning. Machine Learning, 113(7), pp.4845-4901.

Chen, Z., Duan, J., Kang, L. and Qiu, G., 2021. Class-imbalanced deep learning via a class-balanced ensemble. IEEE transactions on neural networks and learning systems, 33(10), pp.5626-5640.

Sampath, V., Maurtua, I., Aguilar Martin, J.J. and Gutierrez, A., 2021. A survey on generative adversarial networks for imbalance problems in computer vision tasks. Journal of big Data, 8, pp.1-59.

Douzas, G. and Bacao, F., 2018. Effective data generation for imbalanced learning using conditional generative adversarial networks. Expert Systems with applications, 91, pp.464-471.

Ghosh, P., Azam, S., Jonkman, M., Karim, A., Shamrat, F.J.M., Ignatious, E., Shultana, S., Beeravolu, A.R. and De Boer, F., 2021. Efficient prediction of cardiovascular disease using machine learning algorithms with relief and LASSO feature selection techniques. IEEE Access, 9, pp.19304-19326.

Li, J.P., Haq, A.U., Din, S.U., Khan, J., Khan, A. and Saboor, A., 2020. Heart disease identification method using machine learning classification in e-healthcare. IEEE access, 8, pp.107562-107582.

Ishaq, A., Sadiq, S., Umer, M., Ullah, S., Mirjalili, S., Rupapara, V. and Nappi, M., 2021. Improving the prediction of heart failure patients’ survival using SMOTE and effective data mining techniques. IEEE access, 9, pp.39707-39716.

Wankhede, J., Kumar, M. and Sambandam, P., 2020. Efficient heart disease prediction‐based on optimal feature selection using DFCSS and classification by improved Elman‐SFO. IET systems biology, 14(6), pp.380-390.

Saranya, G. and Pravin, A., 2023. A novel feature selection approach with integrated feature sensitivity and feature correlation for improved prediction of heart disease. Journal of Ambient Intelligence and Humanized Computing, 14(9), pp.12005-12019.

Wadhawan, S. and Maini, R., 2022. ETCD: An effective machine learning based technique for cardiac disease prediction with optimal feature subset selection. Knowledge-Based Systems, 255, p.109709.

Alghamdi, F.A., Almanaseer, H., Jaradat, G., Jaradat, A., Alsmadi, M.K., Jawarneh, S., Almurayh, A.S., Alqurni, J. and Alfagham, H., 2024. Multilayer perceptron neural network with arithmetic optimization algorithm-based feature selection for cardiovascular disease prediction. Machine Learning and Knowledge Extraction, 6(2), pp.987-1008.

Dwarakanath, B., Latha, M., Annamalai, R., Kallimani, J.S., Walia, R. and Belete, B., 2022. A novel feature selection with hybrid deep learning based heart disease detection and classification in the e-healthcare environment. Computational Intelligence and Neuroscience, vol.2022, no.1167494, pp.1-12.

Zhou, G., Fan, Y., Shi, J., Lu, Y. and Shen, J., 2022. Conditional generative adversarial networks for domain transfer: a survey. Applied Sciences, 12(16), pp.1-26.

Benmamoun, Z., Khlie, K., Bektemyssova, G., Dehghani, M. and Gherabi, Y., 2024. Bobcat Optimization Algorithm: an effective bio-inspired metaheuristic algorithm for solving supply chain optimization problems. Scientific Reports, 14(1), pp.1-62.

Xie, R., Wen, J., Quitadamo, A., Cheng, J. and Shi, X., 2017. A deep auto-encoder model for gene expression prediction. BMC genomics, 18, pp.39-49.

Pan, Y., He, F. and Yu, H., 2020. Learning social representations with deep autoencoder for recommender system. World Wide Web, 23(4), pp.2259-2279.

Min, B., Yoo, J., Kim, S., Shin, D. and Shin, D., 2021. Network anomaly detection using memory-augmented deep autoencoder. IEEE Access, 9, pp.104695-104706.

Downloads

Published

2025-07-30

How to Cite

1.
MadhanGiri M, Santhalakshmi M. Conditional Generative Adversarial Network (CGAN) Based Data Oversampling And Gaussian Bobcat Optimization Algorithm (GBOA) For Heart Diseases Prediction. J Neonatal Surg [Internet]. 2025Jul.30 [cited 2025Oct.14];14(32S):6606-21. Available from: https://jneonatalsurg.com/index.php/jns/article/view/8628