Phân hạng gen gây bệnh sử dụng học tăng cường kết hợp với xác suất tiền nghiệm

  • Đặng Vũ Tùng Vietnam Youth Academy
  • Dương Anh Trà
  • Lê Đức Hậu
  • Từ Minh Phương

Abstract

Disease gene prioritization is the process of ranking candidate genes according to their relevance to a disease phenotype, thus facilitating the identification of disease genes by narrowing down the set of genes to be tested experimentally. Many methods have been proposed for disease gene prioritization based on relationships between proteins encoded in protein-protein interaction networks using various graph-based algorithms. In this paper, we propose a novel method for prioritizing candidate disease genes by combining reinforcement learning with PageRank algorithm and assigning priors for known disease genes. We experimentally evaluate the proposed method on a human protein interaction network and compared its performance with a state-of-the-art methods, namely PageRank with priors, Random Walk with Restart and K-Step Markov. The experiment results show that our method achieves relatively high performance in terms of AUC values and outperforms comparative methods.

Author Biography

Đặng Vũ Tùng, Vietnam Youth Academy
Bộ môn Tin học

References

. PEREZ-IRATXETA, C., BORK, P. and ANDRADE, M.A, "Association of genes to genetically inherited diseases using data mining", Nat Genet, vol. 31, pp. 316-319, 2002.

CARLOS ROBERTO ARIAS, HSIANG-YUAN YEH,VON-WUN SOO, "Disease Gene Prioritization", Selected Works in Bioinformatics, Dr. Xuhua Xia (Ed.), ISBN: 978-953-307-281-4, InTech, Available from http://www.intechopen.com/books/selected-works-in-bioinformatics/disease-gene-prioritization, 2011.

XIUJUAN WANG, NATALI GULBAHCE and HAIYUANYU, "Network-based methods for human disease gene prediction", Briefings in Functional Genomics, vol. 10, no. 5, pp. 280-239, 2011.

D. J. WATTS, S. H. STROGATZ, "Collective dynamics of small-world networks", Nature 393(1), pp. 440-442, 1998.

JUNKER BH, KOSCHUTZKI D, SCHREIBER F, "Exploration of biological network centralities with CentiBiN", BMC Bioinformatics, p. 219, July 2006.

LOVASZ, L., "Random walks on graphs: A survey", Combinatorics, Paul Erdos is Eighty, vol. 2, pp. 353-398, 1996.

KOHLER, S., BAUER, S., HORN, D. and ROBINSON, P., "Walking the Interactome for Prioritization of Candidate Disease Genes", The American Journal of Human Genetics, vol. 82, pp. 949-958, 2008.

DUC-HAU LE, YUNG-KEUN KWON, "Neighbor-favoring weight reinforcement to improve random walk-based disease gene prioritization", Computational Biology and Chemistry, vol. 44, pp. 1–8, 2013.

DUC-HAU LE, YUNG-KEUN KWON, "GPEC: A Cytoscape plug-in for random walk-based gene prioritization and biomedical evidence collection", Computational Biology and Chemistry, vol. 37, pp. 17–23, 2012.

CHEN, J., ARONOW, B. and JEGGA, A, "Disease candidate gene identification and prioritization using protein interaction networks", BMC Bioinformatics, vol. 10, p. 73, 2009.

VANUNU O, MAGGER O, RUPPIN E, et al., "Associating genes and protein complexes with disease via network propagation", PLoSComput Biol, vol. 6:e1000641, 2010.

VALI DERHAMI, ELAHE KHODADADIAN, MOHAMMAD GHASEMZADEH, ALI MOHAMMAD ZAREH BIDOKI, "Applying reinforcement learning for web pages ranking algorithms", Applied Soft Computing, vol. 13, pp. 1686–1692, 2013.

ELAHE KHODADADIAN, MOHAMMAD GHASEMZADEH, VALI DERHAMI, S.ALI MIRSOLEIMANI, "A Novel Ranking Algorithm Based on Reinforcement Learning", Artificial Intelligence and Signal Processing (AISP), 2012 16th CSI International Symposium on, pp. 546-551, 2012.

HAVELIWALA, T., "Topic-sensitive PageRank", Proceedings of the 11th International World Wide Web Conference, Honolulu, Hawaii, pp. 517-526, 2002.

JEH, G. and WIDOM J., "Scaling personalized Web search", Stanford University, Computer Science Department Technical Report, 2002.

Published
2015-06-30
Section
Bài báo