Enhanced Cancer Classification Using Optimized Deep Learning Approaches: A Novel Framework Integrating RNA Sequence Analysis and AI Techniques
DOI:
https://doi.org/10.52783/jns.v14.3091Keywords:
Lung Cancer Classification, Optimized Deep Learning, Convolutional Neural Network, K-Means Clustering, BRCA, UCEC, particle swarm optimizationAbstract
Lung cancer is one of the most aggressive diseases globally, responsible for over 9 million deaths annually. Early staging is critical for improving recovery rates, and RNA sequence analysis has emerged as a vital technique in this process. Recent advancements in AI have significantly enhanced the efficiency and accuracy of human genomics analysis. This study proposes Lung Cancer Classification Using Optimized Deep Learning Approaches(CC-ODLA) to classify various lung cancer typesspecifically kidney renal clear cell carcinoma (KIRC), breast invasive carcinoma (BRCA), lung squamous cell carcinoma (LUSC), lung adenocarcinoma (LUAD), and uterine corpus endometrial carcinoma (UCEC)—using deep learning methods. The first approach combines BPSO-DT and CNN to analyzetumor RNA-seq gene expression data. The proposed methodology consists of three phases: (1) preprocessing, which optimizes high-dimensional RNA-seq data to select optimal features and converts them into 2D images; (2) data augmentation, which increases the dataset size from 2,086 samples to five times larger to mitigate overfitting; and (3) deep CNN architecture, employing a two-layer convolutional framework for feature extraction and classification, achieving a testing accuracy of 96.90%. The second approach introduces an AFOECNN, designed to enhance classification accuracy in lung cancer datasets. This approach also involves three phases: (1) preprocessing using K-Means Clustering (KMC) to reduce noise and handle missing data; (2) feature subset selection via the AFO algorithm to identify significant features based on fitness values; and (3) classification using the Enhanced CNN (ECNN) algorithm, which demonstrates superior precision, recall, F-measure, and accuracy compared to existing algorithms while maintaining lower time complexity. The comparative results indicate that the proposed methods not only improve classification performance but also offer efficient computational resource management, underscoring the transformative potential of AI in smart healthcare solutions.Using Adaptive Firefly Optimisation, the Enhanced CNN improves classification performance with 98% testing accuracy, 93.21% precision, 98% recall, and 91.82% F1-score. However, the combination BPSO-DT and CNN method achieves 96.90% testing accuracy.
Downloads
Metrics
References
Rukhsar, L., Bangyal, W. H., Ali Khan, M. S., Ag Ibrahim, A. A., Nisar, K., & Rawat, D. B. (2022). Analyzing RNA-seq gene expression data using deep learning approaches for cancer classification. Applied Sciences, 12(4), 1850.
Alharbi, F., &Vakanski, A. (2023). Machine learning methods for cancer classification using gene expression data: A review. Bioengineering, 10(2), 173.
Ramamurthy, M., Krishnamurthi, I., Vimal, S., & Robinson, Y. H. (2020). Deep learning based genome analysis and NGS-RNA LL identification with a novel hybrid model. Biosystems, 197, 104211.
Mazlan, A. U., Sahabudin, N. A., Remli, M. A., Ismail, N. S. N., Mohamad, M. S., Nies, H. W., & Abd Warif, N. B. (2021). A review on recent progress in machine learning and deep learning methods for cancer classification on gene expression data. Processes, 9(8), 1466.
Joshi, A. A., & Aziz, R. M. (2024). Deep learning approach for brain tumor classification using metaheuristic optimization with gene expression data. International Journal of Imaging Systems and Technology, 34(2), e23007.
Joshi, A. A., & Aziz, R. M. (2024). Deep learning approach for brain tumor classification using metaheuristic optimization with gene expression data. International Journal of Imaging Systems and Technology, 34(2), e23007.
Avila Santos, A. P., de Almeida, B. L., Bonidia, R. P., Stadler, P. F., Stefanic, P., Mandic-Mulec, I., ... & de Carvalho, A. C. (2024). BioDeepfuse: a hybrid deep learning approach with integrated feature extraction techniques for enhanced non-coding RNA classification. RNA biology, 21(1), 1-12.
Mahto, R., Ahmed, S. U., Rahman, R. U., Aziz, R. M., Roy, P., Mallik, S., ... & Shah, M. A. (2023). A novel and innovative cancer classification framework through a consecutive utilization of hybrid feature selection. BMC bioinformatics, 24(1), 479.
Park, Y., Heider, D., & Hauschild, A. C. (2021). Integrative analysis of next-generation sequencing for next-generation cancer research toward artificial intelligence. Cancers, 13(13), 3148.
Tran, K. A., Kondrashova, O., Bradley, A., Williams, E. D., Pearson, J. V., & Waddell, N. (2021). Deep learning in cancer diagnosis, prognosis and treatment selection. Genome Medicine, 13, 1-17.
Gupta, S., Gupta, M. K., Shabaz, M., & Sharma, A. (2022). Deep learning techniques for cancer classification using microarray gene expression data. Frontiers in Physiology, 13, 952709.
Tran, T. O., Vo, T. H., & Le, N. Q. K. (2024). Omics-based deep learning approaches for lung cancer decision-making and therapeutics development. Briefings in Functional Genomics, 23(3), 181-192.
Chowdhury, Shanta, Xishuang Dong, and Xiangfang Li. "Recurrent neural network based feature selection for high dimensional and low sample size micro-array data." 2019 IEEE International Conference on Big Data (Big Data). IEEE, 2019.
Danaee, Padideh, Reza Ghaeini, and David A. Hendrix. "A deep learning approach for cancer detection and relevant gene identification." Pacific symposium on biocomputing 2017. 2017.
Sawhney, Ramit, Puneet Mathur, and Ravi Shankar. "A firefly algorithm based wrapper-penalty feature selection method for cancer diagnosis." Computational Science and Its Applications–ICCSA 2018: 18th International Conference, Melbourne, VIC, Australia, July 2-5, 2018, Proceedings, Part I 18. Springer International Publishing, 2018.
Pfeffer, Maximilian Achim, and Sai Ho Ling. "Evolving Optimised Convolutional Neural Networks for Lung Cancer Classification." Signals 3.2 (2022): 284-295.
Lopez-Garcia, Guillermo, et al. "Transfer learning with convolutional neural networks for cancer survival prediction using gene-expression data." PloS one 15.3 (2020): e0230536.
Vadapalli, Sreya, et al. "Artificial intelligence and machine learning approaches using gene expression and variant data for personalized medicine." Briefings in bioinformatics 23.5 (2022).
Gawade, P., Nandi, S., Meena, C., & Sarkar, R. R. (2022). Artificial intelligence and machine learning techniques using omics data for cancer diagnosis and treatment. In Systems Biomedicine Approaches in Cancer Research (pp. 23-50). Singapore: Springer Nature Singapore.
Mohamed, T. I., Ezugwu, A. E., Fonou-Dombeu, J. V., Ikotun, A. M., & Mohammed, M. (2023). A bio-inspired convolution neural network architecture for automatic breast cancer detection and classification using RNA-Seq gene expression data. Scientific Reports, 13(1), 14644.
Sokouti, M., &Sokouti, B. (2024). Cancer genetics and deep learning applications for diagnosis, prognosis, and categorization. Journal of Biological Methods| Volume, 11(3), e99010017.
Alanazi, S. A., Alshammari, N., Alruwaili, M., Junaid, K., Abid, M. R., & Ahmad, F. (2024). Integrative analysis of RNA expression data unveils distinct cancer types through machine learning techniques. Saudi Journal of Biological Sciences, 31(3), 103918.
Hasan, M. M., Tsukiyama, S., Cho, J. Y., Kurata, H., Alam, M. A., Liu, X., ... & Deng, H. W. (2022). Deepm5C: a deep-learning-based hybrid framework for identifying human RNA N5-methylcytosine sites using a stacking strategy. Molecular Therapy, 30(8), 2856-2867.
Yuan, L., Zhao, J., Sun, T., & Shen, Z. (2021). A machine learning framework that integrates multi-omics data predicts cancer-related LncRNAs. BMC bioinformatics, 22(1), 332.
Khalsan, M., Machado, L. R., Al-Shamery, E. S., Ajit, S., Anthony, K., Mu, M., & Agyeman, M. O. (2022). A survey of machine learning approaches applied to gene expression analysis for cancer prediction. IEEE Access, 10, 27522-27534.
Downloads
Published
How to Cite
Issue
Section
License

This work is licensed under a Creative Commons Attribution 4.0 International License.
You are free to:
- Share — copy and redistribute the material in any medium or format
- Adapt — remix, transform, and build upon the material for any purpose, even commercially.
Terms:
- Attribution — You must give appropriate credit, provide a link to the license, and indicate if changes were made. You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use.
- No additional restrictions — You may not apply legal terms or technological measures that legally restrict others from doing anything the license permits.