全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...
PLOS ONE  2012 

Linking the Epigenome to the Genome: Correlation of Different Features to DNA Methylation of CpG Islands

DOI: 10.1371/journal.pone.0035327

Full-Text   Cite this paper   Add to My Lib

Abstract:

DNA methylation of CpG islands plays a crucial role in the regulation of gene expression. More than half of all human promoters contain CpG islands with a tissue-specific methylation pattern in differentiated cells. Still today, the whole process of how DNA methyltransferases determine which region should be methylated is not completely revealed. There are many hypotheses of which genomic features are correlated to the epigenome that have not yet been evaluated. Furthermore, many explorative approaches of measuring DNA methylation are limited to a subset of the genome and thus, cannot be employed, e.g., for genome-wide biomarker prediction methods. In this study, we evaluated the correlation of genetic, epigenetic and hypothesis-driven features to DNA methylation of CpG islands. To this end, various binary classifiers were trained and evaluated by cross-validation on a dataset comprising DNA methylation data for 190 CpG islands in HEPG2, HEK293, fibroblasts and leukocytes. We achieved an accuracy of up to 91% with an MCC of 0.8 using ten-fold cross-validation and ten repetitions. With these models, we extended the existing dataset to the whole genome and thus, predicted the methylation landscape for the given cell types. The method used for these predictions is also validated on another external whole-genome dataset. Our results reveal features correlated to DNA methylation and confirm or disprove various hypotheses of DNA methylation related features. This study confirms correlations between DNA methylation and histone modifications, DNA structure, DNA sequence, genomic attributes and CpG island properties. Furthermore, the method has been validated on a genome-wide dataset from the ENCODE consortium. The developed software, as well as the predicted datasets and a web-service to compare methylation states of CpG islands are available at http://www.cogsys.cs.uni-tuebingen.de/so?ftware/dna-methylation/.

References

[1]  Bird AP (1978) Use of restriction enzymes to study eukaryotic DNA methylation: II. the symmetry of methylated sites supports semi-conservative copying of the methylation pattern. Journal of Molecular Biology 118: 49–60.
[2]  Jones PA, Baylin SB (2007) The epigenomics of cancer. Cell 128: 683–692.
[3]  Bernstein BE, Meissner A, Lander ES (2007) The Mammalian Epigenome. Cell 128: 669–681.
[4]  Lander ES, Linton LM, Birren B, Nusbaum C, Zody MC, et al. (2001) Initial sequencing and analysis of the human genome. Nature 409: 860–921.
[5]  Gardiner-Garden M, Frommer M (1987) CpG islands in vertebrate genomes. Journal of Molecular Biology 196: 261–282.
[6]  Bock C, Walter J, Paulsen M, Lengauer T (2007) CpG Island Mapping by Epigenome Prediction. PLoS Comput Biol 3: e110+.
[7]  Antequera F, Bird A (1993) Number of CpG islands and genes in human and mouse. PNAS 90: 11995–11999.
[8]  Antequera F (2003) Structure, function and evolution of CpG island promoters. Cell Mol Life Sci 60: 1647–1658.
[9]  Wang Y, Leung FCC (2004) An evaluation of new criteria for CpG islands in the human genome as gene markers. Bioinformatics 20: 1170–1177.
[10]  Cedar H, Bergman Y (2009) Linking DNA methylation and histone modification: patterns and paradigms. Nat Rev Genet 10: 295–304.
[11]  Rollins RA, Haghighi F, Edwards JR, Das R, Zhang MQ, et al. (2006) Large-scale structure of genomic methylation patterns. Genome Research 16: 157–163.
[12]  Schilling E, Rehli M (2007) Global, comparative analysis of tissue-specific promoter CpG methylation. Genomics 90: 314–323.
[13]  Bustos CD, Ramos E, Young JM, Tran RK, Menzel U, et al. (2009) Tissue-specific variation in DNA methylation levels along human chromosome 1. Epigenetics Chromatin 2: 7.
[14]  Brena RM, Huang THM, Plass C (2006) Toward a human epigenome. Nat Genet 38: 1359–1360.
[15]  Reik W (2007) Stability and exibility of epigenetic gene regulation in mammalian development. Nature 447: 425–432.
[16]  Costello JF, Frühwald MC, Smiraglia DJ, Rush LJ, Robertson GP, et al. (2000) Aberrant CpGisland methylation has non-random and tumour-type-specific patterns. Nat Genet 24: 132–138.
[17]  Esteller M (2008) Epigenetics in cancer. The New England Journal of Medicine 358: 1148–1159.
[18]  Brena RM, Costello JF (2007) Genome-epigenome interactions in cancer. Human molecular genetics 16 Spec No 1: R96–105.
[19]  Esteller M (2007) Cancer epigenomics: DNA methylomes and histone-modification maps. Nature Reviews Genetics 8: 286–298.
[20]  Eckhardt F, Lewin J, Cortese R, Rakyan VK, Attwood J, et al. (2006) DNA methylation profiling of human chromosomes 6, 20 and 22. Nat Genet 38: 1378–1385.
[21]  Zhang Y, Rohde C, Tierling S, Jurkowski TP, Bock C, et al. (2009) DNA Methylation Analysis of Chromosome 21 Gene Promoters at Single Base Pair and Single Allele Resolution. PLoS Genet 5: e1000438.
[22]  Zilberman D, Henikoff S (2007) Genome-wide analysis of DNA methylation patterns. Development 134: 3959–3965.
[23]  Dehan P, Kustermans G, Guenin S, Horion J, Boniver J, et al. (2009) DNA methylation and cancer diagnosis: new methods and applications. Expert Review of Molecular Diagnostics 9: 651–657.
[24]  Thu KL, Pikor LA, Kennett JY, Alvarez CE, Lam WL (2010) Methylation analysis by DNA immunoprecipitation. Journal of Cellular Physiology 222: 522–531.
[25]  Feltus FA, Lee EK, Costello JF, Plass C, Vertino PM (2003) Predicting aberrant CpG island methylation. Proceedings of the National Academy of Sciences 100: 12253–12258.
[26]  Das R, Dimitrova N, Xuan Z, Rollins RA, Haghighi F, et al. (2006) Computational prediction of methylation status in human genomic sequences. Proc Natl Acad Sci U S A 103: 10713–10716.
[27]  Fang F, Fan S, Zhang X, Zhang MQ (2006) Predicting methylation status of CpG islands in the human brain. Bioinformatics 22: 2204–2209.
[28]  Bock C, Paulsen M, Tierling S, Mikeska T, Lengauer T, et al. (2006) CpG Island Methylation in Human Lymphocytes Is Highly Correlated with DNA Sequence, Repeats, and Predicted DNA Structure. PLoS Genet 2: e26+.
[29]  Fan S, Zhang MQ, Zhang X (2008) Histone methylation marks play important roles in predicting the methylation status of cpg islands. Biochemical and Biophysical Research Communications 374: 559–564.
[30]  Jia D, Jurkowska RZ, Zhang X, Jeltsch A, Cheng X (2007) Structure of Dnmt3a bound to Dnmt3L suggests a model for de novo DNA methylation. Nature 449: 248–251.
[31]  Vikas H, Albert J (2005) Profound Flanking Sequence Preference of Dnmt3a and Dnmt3b Mammalian DNA Methyltransferases Shape the Human Epigenome. Journal of Molecular Biology 348: 1103–1112.
[32]  Celniker SE, Dillon LAL, Gerstein MB, Gunsalus KC, Henikoff S, et al. (2009) Unlocking the secrets of the genome. Nature 459: 927–930.
[33]  Cleary JG, Trigg LE (1995) K*: An Instance-based Learner Using an Entropic Distance Measure. In: In Proceedings of the 12th International Conference on Machine Learning. Morgan Kaufmann, 108–114:
[34]  Henckel A, Nakabayashi K, Sanz LA, Feil R, Hata K, et al. (2009) Histone methylation is mechanistically linked to DNA methylation at imprinting control regions in mammals. Hum Mol Genet 18: 3375–3383.
[35]  Fuks F (2005) DNA methylation and histone modifications: teaming up to silence genes. Curr Opin Genet Dev 15: 490–495.
[36]  Ooi SKT, Qiu C, Bernstein E, Li K, Jia D, et al. (2007) DNMT3L connects unmethylated lysine 4 of histone H3 to de novo methylation of DNA. Nature 448: 714–717.
[37]  Mohn F, Weber M, Rebhan M, Roloff TC, Richter J, et al. (2008) Lineage-specific polycomb targets and de novo DNA methylation define restriction and potential of neuronal progenitors. Mol Cell 30: 755–766.
[38]  Meissner A, Mikkelsen TS, Gu H, Wernig M, Hanna J, et al. (2008) Genome-scale DNA methylation maps of pluripotent and differentiated cells. Nature 454: 766–770.
[39]  Previti C, Harari O, Zwir I, del Val C (2009) Profile analysis and prediction of tissue-specific CpG island methylation classes. BMC Bioinformatics 10: 116.
[40]  Jeltsch A (2010) Phylogeny of methylomes. Science 328: 837–838.
[41]  Lister R, Pelizzola M, Dowen RH, Hawkins RD, Hon G, et al. (2009) Human DNA methylomes at base resolution show widespread epigenomic differences. Nature 462: 315–322.
[42]  Bell JT, Pai AA, Pickrell JK, Gaffney DJ, Pique-Regi R, et al. (2011) Dna methylation patterns associate with genetic and gene expression variation in hapmap cell lines. Genome Biol 12: R10.
[43]  Kim S, Li M, Paik H, Nephew K, Shi H, et al. (2008) Predicting DNA methylation susceptibility using CpG anking sequences. Pacific Symposium on Biocomputing. pp. 315–326.
[44]  Oka M, Rodi? N, Graddy J, Chang LJ, Terada N (2006) CpG sites preferentially methylated by Dnmt3a in vivo. J Biol Chem 281: 9901–9908.
[45]  Rhead B, Karolchik D, Kuhn RM, Hinrichs AS, Zweig AS, et al. (2010) The UCSC Genome Browser database: update 2010. Nucleic Acids Res 38: D613–D619.
[46]  Kent WJ, Sugnet CW, Furey TS, Roskin KM, Pringle TH, et al. (2002) The human genome browser at UCSC. Genome Res 12: 996–1006.
[47]  Chang CC, Lin CJ (2001) LIBSVM: a library for support vector machines. 16: Available: Software available from the LIBSVM homepage at http://www.csie.ntu.edu.tw/~cjlin/libsvm. Accessed 2012 Mar.
[48]  Takai D, Jones PA (2002) Comprehensive analysis of CpG islands in human chromosomes 21 and 22. Proc Natl Acad Sci U S A 99: 3740–3745.
[49]  Schones DE, Zhao K (2008) Genome-wide approaches to studying chromatin modifications. Nat Rev Genet 9: 179–191.
[50]  ENCODE HudsonAlpha Methyl27 GM12878 replicate 1. Downloaded from the “ENCODE Data Coordination Center at UCSC”. 30: Available: http://hgdownload.cse.ucsc.edu/goldenPat?h/hg18/encodeDCC/wgEncodeHudsonalphaMeth?yl27/wgEncodeHudsonalphaMethyl27GM12878r?1.bed. gz. Accessed 2011 Mar.
[51]  ENCODE HudsonAlpha MethylSeq HEPG2, Pcr2x, replicate 1. Downloaded from the “ENCODE Data Coordination Center at UCSC”. 25: Available: http://hgdownload.cse.ucsc.edu/goldenPat?h/hg18/encodeDCC/wgEncodeHudsonalphaMeth?ylSeq/wgEncodeHudsonalphaMethylSeqRegion?sRep1Hepg2Pcr2x.bed9.gz. Accessed 2011 Mar.
[52]  Baldi P, Brunak S, Chauvin Y, Andersen CA, Nielsen H (2000) Assessing the accuracy of prediction algorithms for classification: an overview. Bioinformatics 16: 412–424.
[53]  Fan R, Chang K, Hsieh C, Wang X, Lin C (2008) LIBLINEAR: A library for large linear classification. The Journal of Machine Learning Research 9: 1871–1874.
[54]  Hall M, Frank E, Holmes G, Pfahringer B, Reutemann P, et al. (2009) The WEKA data mining software: An update. ACM SIGKDD Explorations Newsletter 11: 10–18.
[55]  Kochanek S, Renz D, Doerer W (1993) DNA methylation in the Alu sequences of diploid and haploid primary human cells. EMBO J 12: 1141–1151.
[56]  Hajkova P, el Maarri O, Engemann S, Oswald J, Olek A, et al. (2002) DNA-methylation analysis by the bisulfite-assisted genomic sequencing method. Methods Mol Biol 200: 143–154.
[57]  Bock C, Lengauer T (2008) Computational epigenetics. Bioinformatics 24: 1–10.
[58]  Stormo GD (2000) DNA binding sites: representation and discovery. Bioinformatics 16: 16–23.
[59]  Aerts S, Loo PV, Thijs G, Moreau Y, Moor BD (2003) Computational detection of cis-regulatory modules. Bioinformatics 19: Suppl 2ii5–i14.
[60]  Wrzodek C, Schr?der A, Dr?ger A, Wanke D, Berendzen KW, et al. (2010) ModuleMaster: A new tool to decipher transcriptional regulatory networks. Biosystems 99: 79–81.
[61]  Burset M, Seledtsov IA, Solovyev VV (2001) Splicedb: database of canonical and non-canonical mammalian splice sites. Nucleic Acids Res 29: 255–259.
[62]  Gardiner EJ, Hunter CA, Packer MJ, Palmer DS, Willett P (2003) Sequence-dependent DNA structure: a database of octamer structural parameters. J Mol Biol 332: 1025–1035.
[63]  Bernstein BE, Kamal M, Lindblad-Toh K, Bekiranov S, Bailey DK, et al. (2005) Genomic maps and comparative analysis of histone modifications in human and mouse. Cell 120: 169–181.
[64]  Siepel A, Bejerano G, Pedersen JS, Hinrichs AS, Hou M, et al. (2005) Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes. Genome Res 15: 1034–1050.
[65]  Ting AH, McGarvey KM, Baylin SB (2006) The cancer epigenome–components and functional correlates. Genes Dev 20: 3215–3231.
[66]  Bird A (2002) DNA methylation patterns and epigenetic memory. Genes Dev 16: 6–21.
[67]  Barski A, Cuddapah S, Cui K, Roh TY, Schones DE, et al. (2007) High-resolution profiling of histone methylations in the human genome. Cell 129: 823–837.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133