A variety of methods have been proposed for studying the association of multiple genes thought to be involved in a common pathway for a particular disease. Here, we present an extension of a Bayesian hierarchical modeling strategy that allows for multiple SNPs within each gene, with external prior information at either the SNP or gene level. The model involves variable selection at the SNP level through latent indicator variables and Bayesian shrinkage at the gene level towards a prior mean vector and covariance matrix that depend on external information. The entire model is fitted using Markov chain Monte Carlo methods. Simulation studies show that the approach is capable of recovering many of the truly causal SNPs and genes, depending upon their frequency and size of their effects. The method is applied to data on 504?SNPs in 38 candidate genes involved in DNA damage response in the WECARE study of second breast cancers in relation to radiotherapy exposure. 1. Introduction The Women’s Environment, Cancer And Radiation Epidemiology (WECARE) study [1] is aimed at a comprehensive examination of genes involved in particular functional pathways. The study is a population-based nested case-control study of 708 contralateral breast cancers (CBC) within a notional cohort of about 65,000 survivors of a first breast cancer, 1401 of whom serve as controls, and the primary exposure of interest is ionizing radiation dose to the contralateral breast from radiotherapy for treatment of the first cancer. Ionizing radiation is known to cause double strand breaks (DSBs) in DNA, which can invoke any of several DNA damage response mechanisms, primarily DSB repair via either homologous recombination or nonhomologous end joining, cell cycle checkpoint regulation, or apoptosis. The original study focused on mutations in the ATM gene, which plays a central role in the recognition of DSBs. The study was then extended to include BRCA1, BRCA2, and CHEK2, which are all involved in homologous recombination repair (HRR), and later still to include a broader set of 38 candidate genes involved in this and other pathways for DSB damage response. We have previously reported on the main effects of ionizing radiation [2, 3], ATM [4–6], BRCA1/2 [7–12], CHEK2 [13], and the interactions of radiation with ATM [14] and BRCA1/2 [15] as well as with other treatments and reproductive factors [16, 17], amongst other risk factors. The aim of this paper is to provide a comprehensive modeling strategy for examining the effects of all genes in a pathway and to apply the approach to candidate genes
References
[1]
J. L. Bernstein, B. Langholz, R. W. Haile et al., “Study design: evaluating gene-environment interactions in the etiology of breast cancer-the WECARE study,” Breast Cancer Research, vol. 6, no. 3, pp. R199–R214, 2004.
[2]
B. Langholz, D. C. Thomas, M. Stovall et al., “Statistical methods for analysis of radiation effects with tumor and dose location-specific information with application to the wecare study of asynchronous contralateral breast cancer,” Biometrics, vol. 65, no. 2, pp. 599–608, 2009.
[3]
M. Stovall, S. A. Smith, B. M. Langholz et al., “Dose to the contralateral breast from radiotherapy and risk of second primary breast cancer in the WECARE study,” International Journal of Radiation Oncology Biology Physics, vol. 72, no. 4, pp. 1021–1030, 2008.
[4]
J. L. Bernstein, S. Teraoka, M. C. Southey et al., “Population-based estimates of breast cancer risks associated with ATM gene variants c.7271T > G and c.1066-6T > G (IVS10-6T > G) from the breast cancer family registry,” Human Mutation, vol. 27, no. 11, pp. 1122–1128, 2006.
[5]
P. Concannon, R. W. Haile, A. L. B?orresen-Dale et al., “Variants in the ATM gene associated with a reduced risk of contralateral breast cancer,” Cancer Research, vol. 68, no. 16, pp. 6486–6491, 2008.
[6]
B. Langholz, J. L. Bernstein, L. Bernstein et al., “On the proposed association of the ATM variants 5557G>A and IVS38-8T>C and bilateral breast cancer,” International Journal of Cancer, vol. 119, no. 3, pp. 724–725, 2006.
[7]
C. B. Begg, R. W. Haile, ?. Borg et al., “Variation of breast cancer risk among BRCA1/2 carriers,” Journal of the American Medical Association, vol. 299, no. 2, pp. 194–201, 2008.
[8]
A. Borg, R. W. Haile, K. E. Malone et al., “Characterization of BRCA1 and BRCA2 deleterious mutations and variants of unknown clinical significance in unilateral and bilateral breast cancer: the WECARE study,” Human Mutation, vol. 31, no. 3, pp. E1200–E1240, 2010.
[9]
M. Capanu, P. Concannon, R. W. Haile et al., “Assessment of rare BRCA1 and BRCA2 variants of unknown significance using hierarchical modeling,” Genetic Epidemiology, vol. 35, no. 5, pp. 389–397, 2011.
[10]
J. C. Figueiredo, J. D. Brooks, D. V. Conti et al., “Risk of contralateral breast cancer associated with common variants in BRCA1 and BRCA2: potential modifying effect of BRCA1/BRCA2 mutation carrier status,” Breast Cancer Research and Treatment, vol. 127, no. 3, pp. 819–829, 2011.
[11]
M. A. Quintana, J. L. Berstein, D. C. Thomas, and D. V. Conti, “Incorporating model uncertainty in detecting rare variants: the Bayesian risk index,” Genetic Epidemiology, vol. 35, no. 7, pp. 638–649, 2011.
[12]
M. A. Quintana, F. R. Schumacher, G. Casey, J. L. Bernstein, L. Li, and D. V. Conti, “Incorporating prior biologic information for high-dimensional rare variant association studies,” Human Heredity, vol. 74, pp. 184–195, 2012.
[13]
L. Mellemkj?r, C. Dahl, J. H. Olsen et al., “Risk for contralateral breast cancer among carriers of the CHEK2*1100delC mutation in the WECARE Study,” British Journal of Cancer, vol. 98, no. 4, pp. 728–733, 2008.
[14]
J. L. Bernstein, R. W. Haile, M. Stovall et al., “Radiation exposure, the ATM gene, and contralateral breast cancer in the women's environmental cancer and radiation epidemiology study,” Journal of the National Cancer Institute, vol. 102, no. 7, pp. 475–483, 2010.
[15]
J. L. Bernstein, D. C. Thomas, R. E. Shore et al., “Contralateral breast cancer after radiotherapy among BRCA1 and BRCA2 mutation carriers: a WECARE study report,” European Journal of Cancer, vol. 49, no. 14, pp. 2979–2985, 2013.
[16]
J. N. Poynter, B. Langholz, J. Largent et al., “Reproductive factors and risk of contralateral breast cancer by BRCA1 and BRCA2 mutation status: results from the WECARE study,” Cancer Causes and Control, vol. 21, no. 6, pp. 839–846, 2010.
[17]
K. W. Reding, J. L. Bernstein, B. M. Langholz et al., “Adjuvant systemic therapy for breast cancer in BRCA1/BRCA2 mutation carriers in a population-based study of risk of contralateral breast cancer,” Breast Cancer Research and Treatment, vol. 123, no. 2, pp. 491–498, 2010.
[18]
K. Wang, M. Li, and M. Bucan, “Pathway-based approaches for analysis of genomewide association studies,” American Journal of Human Genetics, vol. 81, no. 6, pp. 1278–1283, 2007.
[19]
D. Thomas, “Methods for investigating gene-environment interactions in candidate pathway and genome-wide association studies,” Annual Review of Public Health, vol. 31, pp. 21–36, 2010.
[20]
D. Thomas, “Gene-environment-wide association studies: emerging approaches,” Nature Reviews Genetics, vol. 11, no. 4, pp. 259–272, 2010.
[21]
S. Basu and W. Pan, “Comparison of statistical tests for disease association with rare variants,” Genetic Epidemiology, vol. 35, no. 7, pp. 606–619, 2011.
[22]
M. A. Quintana and D. V. Conti, “Integrative variable selection via Bayesian model uncertainty,” Statistics in Medicine, 2013.
[23]
T. J. Hoffmann, N. J. Marini, and J. S. Witte, “Comprehensive approach to analyzing rare genetic variants,” PLoS ONE, vol. 5, no. 11, Article ID e13584, 2010.
[24]
L. S. Chen, C. M. Hutter, J. D. Potter et al., “Insights into colon cancer etiology via a regularized approach to gene set analysis of GWAS data,” American Journal of Human Genetics, vol. 86, no. 6, pp. 860–871, 2010.
[25]
J. D. Brooks, S. N. Teraoka, A. S. Reiner et al., “Variants in activators and downstream targets of ATM, radiation exposure, and contralateral breast cancer risk in the WECARE study,” Human Mutation, vol. 33, no. 1, pp. 158–164, 2012.
[26]
Gene Ontology Consortium, “The Gene Ontology in 2010: extensions and refinements,” Nucleic Acids Research, vol. 38, supplement 1, pp. 331–335, 2010.
[27]
R. Kass and A. Raftery, “Bayes factors,” Journal of the American Statistical Association, vol. 90, no. 430, pp. 773–795, 1995.
[28]
P. Concannon, R. W. Haile, A. L. B?orresen-Dale et al., “Variants in the ATM gene associated with a reduced risk of contralateral breast cancer,” Cancer Research, vol. 68, no. 16, pp. 6486–6491, 2008.
[29]
A. C. Antoniou, O. M. Sinilnikova, J. Simard et al., “RAD51 135?G→C modifies breast cancer risk among BRCA2 mutation carriers: results from a combined analysis of 19 studies,” American Journal of Human Genetics, vol. 81, no. 6, pp. 1186–1200, 2007.
[30]
A. C. Antoniou, O. M. Sinilnikova, J. Simard et al., “RAD51 135?G→C modifies breast cancer risk among BRCA2 mutation carriers: results from a combined analysis of 19 studies,” American Journal of Human Genetics, vol. 81, no. 6, pp. 1186–1200, 2007.
[31]
K. D. Yu, C. Yang, L. Fan, A. X. Chen, and Z. M. Shao, “RAD51 135?G>C does not modify breast cancer risk in non-BRCA1/2 mutation carriers: evidence from a meta-analysis of 12 studies,” Breast Cancer Research and Treatment, vol. 126, no. 2, pp. 365–371, 2011.
[32]
P. C. Ng and S. Henikoff, “SIFT: predicting amino acid changes that affect protein function,” Nucleic Acids Research, vol. 31, no. 13, pp. 3812–3814, 2003.
[33]
T. Xi, I. M. Jones, and H. W. Mohrenweiser, “Many amino acid substitution variants identified in DNA repair genes during human population screenings are predicted to impact protein function,” Genomics, vol. 83, no. 6, pp. 970–979, 2004.