Background The length of the huntingtin (HTT) CAG repeat is strongly correlated with both age at onset of Huntington’s disease (HD) symptoms and age at death of HD patients. Dichotomous analysis comparing HD to controls is widely used to study the effects of HTT CAG repeat expansion. However, a potentially more powerful approach is a continuous analysis strategy that takes advantage of all of the different CAG lengths, to capture effects that are expected to be critical to HD pathogenesis. Methodology/Principal Findings We used continuous and dichotomous approaches to analyze microarray gene expression data from 107 human control and HD lymphoblastoid cell lines. Of all probes found to be significant in a continuous analysis by CAG length, only 21.4% were so identified by a dichotomous comparison of HD versus controls. Moreover, of probes significant by dichotomous analysis, only 33.2% were also significant in the continuous analysis. Simulations revealed that the dichotomous approach would require substantially more than 107 samples to either detect 80% of the CAG-length correlated changes revealed by continuous analysis or to reduce the rate of significant differences that are not CAG length-correlated to 20% (n = 133 or n = 206, respectively). Given the superior power of the continuous approach, we calculated the correlation structure between HTT CAG repeat lengths and gene expression levels and created a freely available searchable website, “HD CAGnome,” that allows users to examine continuous relationships between HTT CAG and expression levels of ~20,000 human genes. Conclusions/Significance Our results reveal limitations of dichotomous approaches compared to the power of continuous analysis to study a disease where human genotype-phenotype relationships strongly support a role for a continuum of CAG length-dependent changes. The compendium of HTT CAG length-gene expression level relationships found at the HD CAGnome now provides convenient routes for discovery of candidates influenced by the HD mutation.
References
[1]
HDCRG (1993) A novel gene containing a trinucleotide repeat that is expanded and unstable on Huntington’s disease chromosomes. The Huntington’s Disease Collaborative Research Group. Cell 72: 971–983. doi: 10.1016/0092-8674(93)90585-e
[2]
Andrew SE, Goldberg YP, Kremer B, Telenius H, Theilmann J, et al. (1993) The relationship between trinucleotide (CAG) repeat length and clinical features of Huntington’s disease. Nat Genet 4: 398–403. doi: 10.1038/ng0893-398
[3]
Duyao M, Ambrose C, Myers R, Novelletto A, Persichetti F, et al. (1993) Trinucleotide repeat length instability and age of onset in Huntington’s disease. Nat Genet 4: 387–392. doi: 10.1038/ng0893-387
[4]
Lee JM, Ramos EM, Lee JH, Gillis T, Mysore JS, et al. (2012) CAG repeat expansion in Huntington disease determines age at onset in a fully dominant fashion. Neurology 78: 690–695. doi: 10.1212/wnl.0b013e318249f683
[5]
Persichetti F, Srinidhi J, Kanaley L, Ge P, Myers RH, et al. (1994) Huntington’s disease CAG trinucleotide repeats in pathologically confirmed post-mortem brains. Neurobiol Dis 1: 159–166. doi: 10.1006/nbdi.1994.0019
[6]
Snell RG, MacMillan JC, Cheadle JP, Fenton I, Lazarou LP, et al. (1993) Relationship between trinucleotide repeat expansion and phenotypic variation in Huntington’s disease. Nat Genet 4: 393–397. doi: 10.1038/ng0893-393
[7]
Seong IS, Ivanova E, Lee JM, Choo YS, Fossale E, et al. (2005) HD CAG repeat implicates a dominant property of huntingtin in mitochondrial energy metabolism. Hum Mol Genet 14: 2871–2880. doi: 10.1093/hmg/ddi319
[8]
Consortium THi (2012) Induced pluripotent stem cells from patients with Huntington’s disease show CAG-repeat-expansion-associated phenotypes. Cell Stem Cell 11: 264–278. doi: 10.1016/j.stem.2012.04.027
[9]
Lee JM, Galkina EI, Levantovsky RM, Fossale E, Anne Anderson M, et al. (2013) Dominant effects of the Huntington’s disease HTT CAG repeat length are captured in gene-expression data sets by a continuous analysis mathematical modeling strategy. Hum Mol Genet 22: 3227–3238. doi: 10.1093/hmg/ddt176
[10]
Perlis RH, Smoller JW, Mysore J, Sun M, Gillis T, et al. Prevalence of incompletely penetrant Huntington’s disease alleles among individuals with major depressive disorder. Am J Psychiatry 167: 574–579. doi: 10.1176/appi.ajp.2009.09070973
[11]
Johnson WE, Li C, Rabinovic A (2007) Adjusting batch effects in microarray expression data using empirical Bayes methods. Biostatistics 8: 118–127. doi: 10.1093/biostatistics/kxj037
[12]
Borovecki F, Lovrecic L, Zhou J, Jeong H, Then F, et al. (2005) Genome-wide expression profiling of human blood reveals biomarkers for Huntington’s disease. Proc Natl Acad Sci U S A 102: 11023–11028. doi: 10.1073/pnas.0504921102
[13]
Hodges A, Strand AD, Aragaki AK, Kuhn A, Sengstag T, et al. (2006) Regional and cellular gene expression changes in human Huntington’s disease brain. Hum Mol Genet 15: 965–977. doi: 10.1093/hmg/ddl013
[14]
Kuhn A, Goldstein DR, Hodges A, Strand AD, Sengstag T, et al. (2007) Mutant huntingtin’s effects on striatal gene expression in mice recapitulate changes observed in human Huntington’s disease brain and do not differ with mutant huntingtin length or wild-type huntingtin dosage. Hum Mol Genet 16: 1845–1861. doi: 10.1093/hmg/ddm133
[15]
Luthi-Carter R, Hanson SA, Strand AD, Bergstrom DA, Chun W, et al. (2002) Dysregulation of gene expression in the R6/2 model of polyglutamine disease: parallel changes in muscle and brain. Hum Mol Genet 11: 1911–1926. doi: 10.1093/hmg/11.17.1911
[16]
Luthi-Carter R, Strand AD, Hanson SA, Kooperberg C, Schilling G, et al. (2002) Polyglutamine and transcription: gene expression changes shared by DRPLA and Huntington’s disease mouse models reveal context-independent effects. Hum Mol Genet 11: 1927–1937. doi: 10.1093/hmg/11.17.1927
[17]
Runne H, Kuhn A, Wild EJ, Pratyaksha W, Kristiansen M, et al. (2007) Analysis of potential transcriptomic biomarkers for Huntington’s disease in peripheral blood. Proc Natl Acad Sci U S A 104: 14424–14429. doi: 10.1073/pnas.0703652104
[18]
Strand AD, Baquet ZC, Aragaki AK, Holmans P, Yang L, et al. (2007) Expression profiling of Huntington’s disease models suggests that brain-derived neurotrophic factor depletion plays a major role in striatal degeneration. J Neurosci 27: 11758–11768. doi: 10.1523/jneurosci.2461-07.2007