ABSTRACT
DNA methylation, a well-studied mechanism of epigenetic regulation, plays important roles in cancer. Increased levels of global DNA methylation is observed in primary solid tumors including endometrial carcinomas and is generally associated with silencing of tumor suppressor genes. The role of DNA methylation in cancer recurrence after therapeutic intervention is not clear. Here, we developed a novel computational method to analyze whole-genome DNA methylation data for endometrial tumors within the context of a human protein-protein interaction (PPI) network, in order to identify subnetworks as potential epigenetic biomarkers for predicting tumor recurrence. Our method consists of the following steps. First, differentially methylated (DM) genes between recurrent and non-recurrent tumors are identified and mapped onto a human PPI network. Then, a PPI subnetwork consisting of DM genes and genes that are topologically important for connecting the DMs on the PPI network, termed epigenetic connectors (ECs), are extracted using a Steiner-tree based algorithm. Finally, a random-walk based machine learning method is used to propagate the DNA methylation scores from the DMs to the ECs, which enables the ECs to be used as features in a support vector machine classifier for predicting recurrence. Remarkably, we found that while the DMs are not enriched in any cancer-related pathways, the ECs are enriched in many well-known tumorgenesis and metastasis pathways and include known epigenetic regulators. Moreover, combining the DMs and ECs significantly improves the prediction accuracy of cancer recurrence and outperforms several alternative methods. Therefore, the network-based method is effective in identifying gene subnetworks that are crucial both for the understanding and prediction of tumor recurrence.
- M.A. Shipp, K. N. Ross, P. Tamayo, et al. Diffuse large B-cell lymphoma outcome prediction by geneexpression profiling and supervised machine learning. Nat Med, 8:68--74, 2002.Google ScholarCross Ref
- T. Sorlie, C. M. Perou, R. Tibshirani, et al. Gene expression patterns of breast carcinomas distinguish tumor subclasses with clinical implications. Proc Natl Acad Sci U S A, 98:10869--74, 2001.Google ScholarCross Ref
- P. Radivojac, K. Peng, W. T. Clark, et al. An integrated approach to inferring gene-disease associations in humans. Proteins, 72:1030--7, 2008.Google ScholarCross Ref
- T.H. Huang and M. Esteller. Chromatin remodeling in mammary gland differentiation and breast tumorigenesis. Cold Spring Harb Perspect Biol, 2:a004515, 2010.Google ScholarCross Ref
- M. Kulis and M. Esteller. DNA methylation and cancer. Adv Genet, 70:27--56, 2010.Google ScholarCross Ref
- K. Dobbin and R. Simon. Sample size determination in microarray experiments for class comparison and prognostic classification. Biostatistics, 6:27--38, 2005.Google ScholarCross Ref
- L. Ein-Dor, O. Zuk, and E. Domany. Thousands of samples are needed to generate a robust gene list for predicting outcome in cancer. Proc Natl Acad Sci U S A, 103:5923--8, 2006.Google ScholarCross Ref
- J. Li, A. E. Lenferink, Y. Deng, et al. Identification of high-quality cancer prognostic markers and metastasis network modules. Nat Commun, 1:34, 2010.Google ScholarCross Ref
- M.L. Gatza, Joseph E. Lucas, William T. Barry, et al. A pathway-based classification of human breast cancer. Proc Natl Acad Sci U S A, 107:6994--6999, 2010.Google ScholarCross Ref
- M. Vidal, M. E. Cusick, and A. L. Barabasi. Interactome networks and human disease. Cell, 144:986--98, 2011.Google ScholarCross Ref
- A.L. Barabasi, N. Gulbahce, and J. Loscalzo. Network medicine: a network-based approach to human disease. Nat Rev Genet, 12:56--68, 2011.Google ScholarCross Ref
- E. Lee, Han-Yu Chuang, Jong-Won Kim, Trey Ideker, and Doheon Lee. Inferring pathway activity toward precise disease classification. PLoS Comput Biol, 4:e1000217+, 2008.Google Scholar
- A. Keller, C. Backes, A. Gerasch, et al. A novel algorithm for detecting differentially regulated paths based on gene set enrichment analysis. Bioinformatics, 25:2787--94, 2009. Google ScholarDigital Library
- Y.A. Kim, S. Wuchty, and T. M. Przytycka. Identifying causal genes and dysregulated pathways in complex diseases. PLoS Comput Biol, 7:e1001095, 2011.Google ScholarCross Ref
- H Y Chuang, E Lee, Y T Liu, D Lee, and T Ideker. Network-based classification of breast cancer metastasis. Mol Syst Biol, 3:140, 2007.Google ScholarCross Ref
- M. Liu, A. Liberzon, S. W. Kong, et al. Network-based analysis of affected biological processes in type 2 diabetes models. PLoS Genet, 3:e96, 2007.Google ScholarCross Ref
- T. Hwang, H. Sicotte, Ze Tian, et al. Robust and efficient identification of biomarkers by classifying features on graphs. Bioinformatics, 24:2023--2029, 2008. Google ScholarDigital Library
- J.H. Hung, T. W. Whitfield, T. H. Yang, et al. Identification of functional modules that correlate with phenotypic difference: the influence of network topology. Genome Biol, 11:R23, 2010.Google ScholarCross Ref
- I. Ulitsky, A. Krishnamurthy, R. M. Karp, and R. Shamir. DEGAS: de novo discovery of dysregulated pathways in human diseases. PLoS One, 5:e13367, 2010.Google ScholarCross Ref
- S.A. Chowdhury, R. K. Nibbe, M. R. Chance, and M. Koyuturk. Subnetwork state functions define dys-regulated subnetworks in cancer. J Comput Biol, 18:263--281, 2011.Google ScholarCross Ref
- L. Geistlinger, G. Csaba, R. Kuffner, N. Mulder, and R. Zimmer. From sets to graphs: towards a realistic enrichment analysis of transcriptomic systems. Bioinformatics, 27:i366--i373, 2011. Google ScholarDigital Library
- F. Vandin, E. Upfal, and B. J. Raphael. Algorithms for detecting significantly mutated pathways in cancer. RECOMB'10, pages 506--521, Berlin, Heidelberg, 2010. Google ScholarDigital Library
- P. Dao, K. Wang, C. Collins, et al. Optimally discriminative subnetwork markers predict response to chemotherapy. Bioinformatics, 27:205--213, 2011. Google ScholarDigital Library
- C.S. Awtrey, M. G. Cadungog, Mario M. Leitao, et al. Surgical resection of recurrent endometrial carcinoma. Gynecol Oncol, 102:480--8, 2006.Google ScholarCross Ref
- Y.-W. Huang, J. Luo, Yu-I Weng, et al. Promoter hypermethylation of CIDEA, HAAO and RXFP3 associated with microsatellite instability in endometrial carcinomas. Gynecol Oncol, 117:239--247, 2010.Google ScholarCross Ref
- T.A. Rauch and G. P. Pfeifer. Dna methylation profiling using the methylated-CpG island recovery assay (MIRA). Methods, 52:213--7, 2010.Google ScholarCross Ref
- D. Serre, B. H. Lee, and A. H. Ting. MBD-isolated genome sequencing provides a high-throughput and comprehensive survey of DNA methylation in the human genome. Nucleic Acids Res, 38:391--9, 2010.Google ScholarCross Ref
- T. S. Keshava Prasad, Renu Goel, Kumaran Kandasamy, et al. Human protein reference database - 2009 update. Nucleic Acids Res, 37:D767--772, 2009.Google ScholarCross Ref
- S. Voß. Steiner's problem in graphs: heuristic methods. Discrete Appl Math, 40:45--72, 1992. Google ScholarDigital Library
- V. J Rayward-Smith. The computation of nearly minimal Steiner trees in graphs. Internat J. Math. Ed. Sci. Tech, 14:15--23, 1983.Google ScholarCross Ref
- H. Tong, C. Faloutsos, and J.-Y. Pan. Fast random walk with restart and its applications. In Proceedings of the Sixth International Conference on Data Mining, ICDM '06, pages 613--622, Washington, DC, USA, 2006. Google ScholarDigital Library
- M. Hall, E. Frank, G. Holmes, et al. The weka data mining software: an update. SIGKDD Explor. Newsl., 11(1):10--18, 2009. Google ScholarDigital Library
- J.R. Landis and G. G. Koch. The measurement of observer agreement for categorical data. Biometrics, 33:159--74, 1977.Google ScholarCross Ref
- H.B. Nair, R. P. Perla, N. B. Kirma, et al. Estrogen receptor-beta mediates the protective effects of aromatase induction in the MMTV-Her-2/neu x aromatase double transgenic mice. Hormones and Cancer, 3:26--36, 2012.Google ScholarCross Ref
- G.E. Konecny, L. Santos, B. Winterhoff, et al. HER2 gene amplification and EGFR expression in a large cohort of surgically staged patients with nonendometrioid (type II) endometrial cancer. Br J Cancer, 100:89--95, 2009.Google ScholarCross Ref
- B. Sitohy, J. A. Nagy, and H. F. Dvorak. Anti-VEGF/VEGFR therapy for cancer: Reassessing the target. Cancer Research, 72:1909--1914, 2012.Google ScholarCross Ref
- S.A. Ali, S. K. Zaidi, J. R. Dobson, et al. Transcriptional corepressor TLE1 functions with Runx2 in epigenetic repression of ribosomal RNA genes. Proc Natl Acad Sci U S A, 107:4165--9, 2010.Google ScholarCross Ref
- F.R. Althaus. Poly(ADP-ribose): a co-regulator of DNA methylation? Oncogene, 24:11--12, 2005.Google ScholarCross Ref
- P. Caiafa, T. Guastafierro, and M. Zampieri. Epigenetics: poly(ADP-ribosyl)ation of PARP-1 regulates genomic methylation patterns. FASEB J, 23:672--8, 2009.Google ScholarCross Ref
- M.J. Jahid and J. Ruan. Identification of biomarkers in breast cancer metastasis by integrating protein-protein interaction network and gene expression data. BMC Genomics, S5:S8, 2012.Google Scholar
Index Terms
- Network-based classification of recurrent endometrial cancers using high-throughput DNA methylation data
Recommendations
DNA methylation-regulated microRNA pathways in ovarian serous cystadenocarcinoma
Perform an integrate analysis of DNA methylation, microRNA expression and mRNA expression data.Meta-analysis was performed to reduce the effects of biological heterogeneity among different batches of data.Proposed a systematic strategy to construct ...
Characterizing mutation-expression network relationships in multiple cancers
BackgroundData made available through large cancer consortia like The Cancer Genome Atlas make for a rich source of information to be studied across and between cancers. In recent years, network approaches have been applied to such data in uncovering ...
Integration of DNA Methylation, Copy Number Variation, and Gene Expression for Gene Regulatory Network Inference and Application to Psychiatric Disorders
BIBE '14: Proceedings of the 2014 IEEE International Conference on Bioinformatics and BioengineeringBiological network inference is a crucial problem to solve in Bioinformatics as most of biological process are based on bio molecular interactions. Many researchers have worked on especially the inference of gene regulatory networks where a node and ...
Comments