Pathway-Capsule Neural Networks
Pathway-Capsule Neural Networks
Abstract
The progress made in Next-Generation Sequencing technology has resulted in many genetic databases, which offer the potential for more effective diagnostic tools. Recent literature highlights Artificial Intelligence (AI) as one of the most potent methods. Nonetheless, when it comes to analyzing genetic data, Machine Learning (ML) and Deep Learning (DL) face three significant challenges due to: (1) multi-modality analysis, (2) model explainability, and (3) extension to patient-specific biomarker identification. In this paper, we introduce Pathway-Capsule Neural Networks (PCNN) and its Bayesian version (BayesPCNN) to address the three limitations. We demonstrate the model using Big Data of 7, 372 samples with 57 protein expressions from the TCGA dataset. In the best setting, the proposed PCNN outperforms tree-based ML models and KNN with a significant gap of 10% − 30% in accuracy for Pan-Cancer detection; while only outperformed by kernel methods with under 2%. Another contribution of BayesPCNN is highlighted as Explainable AI (XAI), which reveals tumors with potential metastasis and patient-specific biomarkers.
References
S. Sabour, N. Frosst, and G. E. Hinton, “Dynamic routing between capsules,” Advances in neural information processing systems, vol. 30, 2017.
G. Hinton, “How to represent part-whole hierarchies in a neural network,” Neural Computation, pp. 1–40, 2022.
C. Peng, Y. Zheng, and D.-S. Huang, “Capsule network based modeling of multi-omics data for discovery of breast cancerrelated genes,” IEEE/ACM transactions on computational biology and bioinformatics, vol. 17, no. 5, pp. 1605–1612, 2019.
P. Zheng, Y. Qi, X. Li, Y. Liu, Y. Yao, and G. Huang, “A capsule network-based method for identifying transcription factors,” Frontiers in Microbiology, vol. 13, p. 1048478, 2022.
Y. Wang, L. Huang, S. Jiang, Y. Wang, J. Zou, H. Fu, and S. Yang, “Capsule networks showed excellent performance in the classification of herg blockers/nonblockers,” Frontiers in pharmacology, vol. 10, p. 1631, 2020.
L. Wang, X. Miao, R. Nie, Z. Zhang, J. Zhang, and J. Cai, “Multicapsnet: a general framework for data integration and interpretable classification,” Frontiers in genetics, p. 2237, 2022.
X. Luo, X. Kang, and A. Schonhuth, “Predicting the prevalence of complex genetic diseases from individual genotype profiles using capsule networks,” Nature Machine Intelligence, vol. 5, no. 2, pp. 114–125, 2023.
L. Wang, R. Nie, Z. Yu, R. Xin, C. Zheng, Z. Zhang, J. Zhang, and J. Cai, “An interpretable deep-learning architecture of capsule networks for identifying cell-type gene expression programs from single-cell rna-sequencing data,” Nature Machine Intelligence, vol. 2, no. 11, pp. 693–703, 2020.
G. Hadash, E. Kermany, B. Carmeli, O. Lavi, G. Kour, and A. Jacovi, “Estimate and replace: A novel approach to integrating deep neural networks with existing applications,” arXiv preprint arXiv:1804.09028, 2018.
J. Linares-Blanco, A. Pazos, and C. Fernandez-Lozano, “Machine learning analysis of tcga cancer data,” PeerJ Computer Science, vol. 7, p. e584, 2021.
M. M. Bronstein, J. Bruna, Y. LeCun, A. Szlam, and P. Vandergheynst, “Geometric deep learning: going beyond euclidean data,” IEEE Signal Processing Magazine, vol. 34, no. 4, pp. 18–42, 2017.
K. He, X. Zhang, S. Ren, and J. Sun, “Deep residual learning for image recognition,” in Proceedings of the IEEE conference on computer vision and pattern recognition, 2016, pp. 770–778.
D. C. C. B. R. . J. M. A. . K. A. . P. T. . P. D. . W. Y. 53 and T. S. S. L. D. A. 68, “The cancer genome atlas pancancer analysis project,” Nature genetics, vol. 45, no. 10, pp. 1113–1120, 2013.
J. Bergstra, R. Bardenet, Y. Bengio, and B. Kégl, “Algorithms for hyper-parameter optimization,” Advances in neural information processing systems, vol. 24, 2011.
N. Nguyen and K.-C. Chen, “Quantum embedding search for quantum machine learning,” IEEE Access, vol. 10, pp. 41 444–41 456, 2022.
N. Nguyen and J. M. Chang, “Csnas: Contrastive selfsupervised learning neural architecture search via sequential model-based optimization,” IEEE Transactions on Artificial Intelligence, vol. 3, no. 4, pp. 609–624, 2021.
I. Loshchilov and F. Hutter, “Decoupled weight decay regularization,” arXiv preprint arXiv:1711.05101, 2017.
B. Lakshminarayanan, A. Pritzel, and C. Blundell, “Simple and scalable predictive uncertainty estimation using deep ensembles,” Advances in neural information processing systems, vol. 30, 2017.
C. D. Putnam, R. D. Kolodner et al., “Dna mismatch repair: Mechanisms and cancer genetics,” 2019.
M.-J. Nokin, J. Bellier, F. Durieux, O. Peulen, G. Rademaker, M. Gabriel, C. Monseur, B. Charloteaux, L. Verbeke, S. van Laere et al., “Methylglyoxal, a glycolysis metabolite, triggers metastasis through mek/erk/smad1 pathway activation in breast cancer,” Breast Cancer Research, vol. 21, no. 1, pp. 1–19, 2019.
T. Xu, J. Li, Z. Wang, X. Zhang, J. Zhou, Z. Lu, L. Shen, and X. Wang, “Real-world treatment and outcomes of patients with metastatic braf mutant colorectal cancer,” Cancer Medicine, 2023.
P.-P. Wang, C. Lin, J. Wang, G. A. Margonis, and B. Wu, “Braf mutations in colorectal liver metastases: Prognostic implications and potential therapeutic strategies,” Cancers, vol. 14, no. 17, p. 4067, 2022.
Y. LeCun, “A path towards autonomous machine intelligence version 0.9. 2, 2022-06-27,” Open Review, vol. 62, 2022.