Advances in Computational Approaches for Disease Prediction: A Comprehensive Survey of Candidate Gene Identification Using Protein-Protein Interaction (PPI) Networks and Multi-Omics Integration
Keywords:
Candidate gene prediction, protein-protein interaction (PPI) networks, disease prediction, network-based algorithms, machine learning, multi-omics integration, gene-disease association, computational biology, gene analysis, bioinformaticsAbstract
Disease prediction plays a crucial role in understanding the underlying genetic factors responsible for various conditions and improving early diagnosis and treatment. Computational methods have become essential tools in identifying genes linked to diseases, aiding in the discovery of new therapeutic targets. One of the key areas of research in this field is the use of protein-protein interaction (PPI) networks and gene analysis to predict candidate genes involved in disease processes. This research focuses on the prediction of candidate genes using PPI networks, examining various computational approaches to identify genes potentially involved in specific biological processes, diseases, or traits. By leveraging the structural and functional insights provided by PPI networks, these methods infer gene involvement based on their interactions with known disease-associated proteins. Techniques range from network-based algorithms, which analyze the topological properties of PPI networks, to machine learning models that predict gene-disease associations using complex data patterns. Additionally, integrative approaches combine PPI networks with multi-omics data, such as genomics, transcriptomics, and epigenomics, to enhance the biological relevance of predictions. This survey highlights the strengths and limitations of each method, offering a comprehensive overview of the evolving landscape of candidate gene prediction.
Downloads
Metrics
References
Zhao, X., Li, Z., Yang, M., & Fang, J. (2020). Predicting candidate disease genes by integrating tissue-specific gene expression data into a random walk with restart model on protein–protein interaction networks. BMC Bioinformatics, 21(1), 283.
Morshed, M. N., Mollah, M. N. H., & Mazumder, H. (2021). An integrative approach to identifying candidate genes for Alzheimer's disease using protein–protein interaction network and machine learning. Scientific Reports, 11(1), 17822.
Köhler, S., Gargano, M., Matentzoglu, N., & Robinson, P. N. (2021). Network propagation approaches for prioritizing candidate genes in rare disease research. American Journal of Human Genetics, 108(3), 439-452.
Zhu, X., Liao, Z., Li, Y., & Liu, S. (2020). Integrating epigenomic data and protein-protein interaction network for candidate gene prioritization of complex diseases. Frontiers in Genetics, 11, 582749.
Dong, Z., Cui, J., Liu, Q., & Shen, Y. (2022). Dynamic clustering of protein interaction networks improves candidate gene prioritization for stage-specific diseases. PLoS Computational Biology, 18(3), e1009803.
Jiang, P., Zhang, Y., & Chen, J. (2020). Unsupervised deep learning for detecting gene modules in protein–protein interaction networks. Bioinformatics, 36(9), 2639-2645.
Liu, Z., Wang, C., & Wang, Y. (2021). Predicting disease genes using weighted shortest paths in protein-protein interaction networks. Bioinformatics, 37(12), 1730-1738.
Sun, Y., Liu, L., & Huang, T. (2020). Pathway-centric shortest path approach for prioritizing candidate genes for Parkinson's disease. BMC Bioinformatics, 21(1), 527.
Chen, Y., Yang, P., & Sun, X. (2020). Prediction of candidate genes for complex diseases based on protein–protein interaction networks and gene expression data. BMC Genomics, 21, 512.
Li, L., Gao, Z., & Zhang, X. (2021). Prediction of cardiovascular disease-related candidate genes based on protein interaction networks and gene function annotations. Scientific Reports, 11(1), 9736.
Zitnik, M., Wang, C., & Leskovec, J. (2021). Predicting gene-disease associations with graph neural networks and multi-layer PPI networks. Nature Communications, 12(1), 1257.
Peng, J., Tang, Y., & Liu, Q. (2022). DeepGraphGO: Predicting candidate genes in neurodegenerative diseases by integrating PPI networks with graph neural networks and gene ontology annotations. Bioinformatics, 38(6), 899-906.
Cao, Y., Zhu, Z., & Li, J. (2021). A feature-based Random Forest approach for predicting cancer candidate genes using protein-protein interaction networks. BMC Medical Genomics, 14(1), 214.
Sun, Y., Zhang, Y., & Ma, Z. (2020). A hybrid machine learning model for predicting cardiovascular disease genes based on PPI network features. Journal of Translational Medicine, 18, 300.
Chen, L., Zhang, X., & Liu, M. (2020). Integration of PPI networks and gene expression data reveals candidate genes for Alzheimer’s disease. Journal of Alzheimer's Disease, 75(3), 801-812.
Wang, Y., Zhao, Z., & Zhang, D. (2021). Integrating gene expression data with protein-protein interaction networks for breast cancer candidate gene prediction. BMC Bioinformatics, 22(1), 312.
Wu, H., Zhao, Y., & Xu, J. (2021). Multi-omics integration with protein interaction networks for prioritizing candidate genes in cardiovascular diseases. Scientific Reports, 11(1), 10988.
Liu, B., Luo, X., & Lin, D. (2020). Multi-omics data integration with PPI networks for predicting candidate genes in metabolic disorders. BMC Medical Genomics, 13, 172.
Xu, L., Wang, Y., & Sun, C. (2021). Integrating protein-protein interaction networks and chromatin accessibility data for predicting cancer-related genes. Bioinformatics, 37(5), 678-684.
Zhang, M., Shi, W., & Wang, L. (2020). Integrative analysis of DNA methylation and protein interaction networks identifies candidate genes in breast cancer. BMC Genomics, 21(1), 492.
.
Downloads
Published
How to Cite
Issue
Section
License

This work is licensed under a Creative Commons Attribution 4.0 International License.
You are free to:
- Share — copy and redistribute the material in any medium or format
- Adapt — remix, transform, and build upon the material for any purpose, even commercially.
Terms:
- Attribution — You must give appropriate credit, provide a link to the license, and indicate if changes were made. You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use.
- No additional restrictions — You may not apply legal terms or technological measures that legally restrict others from doing anything the license permits.