全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...
PLOS ONE  2013 

Bayesian Detection of Causal Rare Variants under Posterior Consistency

DOI: 10.1371/journal.pone.0069633

Full-Text   Cite this paper   Add to My Lib

Abstract:

Identification of causal rare variants that are associated with complex traits poses a central challenge on genome-wide association studies. However, most current research focuses only on testing the global association whether the rare variants in a given genomic region are collectively associated with the trait. Although some recent work, e.g., the Bayesian risk index method, have tried to address this problem, it is unclear whether the causal rare variants can be consistently identified by them in the small--large- situation. We develop a new Bayesian method, the so-called Bayesian Rare Variant Detector (BRVD), to tackle this problem. The new method simultaneously addresses two issues: (i) (Global association test) Are there any of the variants associated with the disease, and (ii) (Causal variant detection) Which variants, if any, are driving the association. The BRVD ensures the causal rare variants to be consistently identified in the small--large- situation by imposing some appropriate prior distributions on the model and model specific parameters. The numerical results indicate that the BRVD is more powerful for testing the global association than the existing methods, such as the combined multivariate and collapsing test, weighted sum statistic test, RARECOVER, sequence kernel association test, and Bayesian risk index, and also more powerful for identification of causal rare variants than the Bayesian risk index method. The BRVD has also been successfully applied to the Early-Onset Myocardial Infarction (EOMI) Exome Sequence Data. It identified a few causal rare variants that have been verified in the literature.

References

[1]  Bodmer W, Bonilla C (2008) Common and rare variants in multifactorial susceptibility to common diseases. Nat Genet 40: 695–701.
[2]  Nejentsev S, Walker N, Riches D, Egholm M, Todd JA (2009) Rare variants of IFIH1, a gene implicated in antiviral responses, protect against type 1 diabetes. Science 324: 387–389.
[3]  Cohen JC, Pertsemlidis A, Fahmi S, Esmail S, Vega GL, et al. (2006) Multiple rare variants in NPC1L1 associated with reduced sterol absorption and plasma low-density lipoprotein levels. Proc Natl Acad Sci USA 103: 1810–1815.
[4]  Li B, Leal SM (2008) Methods for detecting associations with rare variants for common disease: application to analysis of sequence data. Am J Hum Genet 83: 311–321.
[5]  Madsen E, Browning SR (2009) A groupwise association test for rare mutations using a weighted sum statistic. PLOS Genet 5: e1000384 Available: http://www.plosgenetics.org/article/info?%3Adoi%2F10.1371%2Fjournal.pgen.1000384. Accessed 2013 Feb 28.
[6]  Wu MC, Lee S, Cai T, Li Y, Boehnke M, et al. (2011) Rare-variant association testing for sequence data with the sequence kernel association test. Am J Hum Genet 89: 82–93.
[7]  Han F, Pan W (2010) A data-adaptive sum test for disease association with multiple common or rare variants. Hum Hered 70: 42–54.
[8]  Zawistowski M, Gopalakrishnan S, Ding J, Li Y, Grimm S, et al. (2010) Extending rare-variant testing strategies: Analysis of noncoding sequence and imputed genotypes. Am J Hum Genet 87: 604–617.
[9]  Bhatia G, Bansal V, Harismendy O, Schork NJ, Topol EJ, et al. (2010) A covering method for detecting genetic associations between rare variants and common Phenotypes. PLoS Comput Bio 6: e1000954s Available: http://www.ploscompbiol.org/article/info?%3Adoi%2F10.1371%2Fjournal.pcbi.1000954. Accessed 2013 Feb 28.
[10]  Price AL, Kryukov GV, de Bakker PI, Purcell SM, Staples J, et al. (2010) Pooled association tests for rare variants in exon-resequencing studies. Am J Hum Genet 86: 832–838.
[11]  King CR, Rathouz PJ, Nicolae DL (2010) An evolutionary framework for association testing in resequencing studies. PLoS Genet 6: e1001202 Available: http://www.plosgenetics.org/article/info?%3Adoi%2F10.1371%2Fjournal.pgen.1001202. Accessed 2013 Feb 28.
[12]  Yi N, Liu N, Zhi D, Li J (2011) Hierarchical generalized linear models for multiple groups of rare and common variants: Jointly estimating group and individual-variant effects. PLoS Genet 7: e1002382 Available: http://www.plosgenetics.org/article/info?%3Adoi%2F10.1371%2Fjournal.pgen.1002382. Accessed 2013 May 20.
[13]  Yi N, Zhi D (2011) Bayesian analysis of rare variants in genetic association studies. Genet Epidemiol 35: 57–69.
[14]  Quintana MA, Berstein JL, Thomas DC, Conti DV (2011) Incorporating model uncertainty in detecting rare variants: The Bayesian risk index. Genet Epidemiol 35: 638–649.
[15]  Wilson MA, Iversen ES, Clyde MA, Schmidler SC, Schildkraut JM (2010) Bayesian model search and multilevel inference for SNP association studies. Ann Appl Statist 4: 1342–1364.
[16]  Jeffreys H (1961) Theory of probability (3rd edition). Oxford: Oxford University Press. 470 p.
[17]  Berger JO (1985) Statistical decision theory and Bayesian analysis. New York: Springer. 617 p.
[18]  Berger JO, Sellke T (1987) Testing a point null hypothesis: The irreconcilability of p values and evidence. J Amer Statist Assoc 82: 112–122.
[19]  Jiang W (2006) On the consistency of Bayesian variable selection for high dimensional binary regression and classification. Neural Comput 18: 2762–2776.
[20]  Jiang W (2007) Bayesian variable selection for high dimensional generalized linear models: convergence rates of the fitted densities. Ann Statist 35: 1487–1511.
[21]  Scott JG, Berger JO (2010) Bayes and empirical-Bayes multiplicity adjustment in the variable selection problem. Ann Statist 38: 2587–2619.
[22]  Liang F, Liu C, Carroll RJ (2007) Stochastic approximation in Monte Carlo computation. J Amer Statist Assoc 102: 305–320.
[23]  Chen HF (2002) Stochastic approximation and its applications. Dordrecht: Kluwer Academic Publishers. 357 p.
[24]  Andrieu C, Moulines é, Priouret P (2005) Stability of Stochastic Approximation Under Verifiable Conditions. SIAM J Control Optim 44: 283–312.
[25]  Barbieri MM, Berger JO (2004) Optimal Predictive Model Selection. Ann Statist 32: 870–897.
[26]  Liang F, Song Q, Yu K (2013) Bayesian subset modeling for high dimensional generalized linear models. J Amer Statist Assoc In press. doi:10.1080/01621459.2012.761942.
[27]  Liang F (2009) On the use of stochastic approximation Monte Carlo for Monte Carlo integration. Stat Prob Lett 79: 581–587.
[28]  Liang F, Zhang J (2008) Estimating the false discovery rate using the stochastic approximation algorithm. Biometrika 95: 961–977.
[29]  Neuhaus JM (1998) Estimation efficiency with omitted covariates in generalized linear models. J Amer Statist Assoc 93: 1124–1129.
[30]  Xing G, Xing C (2010) Adjusting for covariates in logistic regression models. Genet Epidemiol 34: 769–771.
[31]  Pirinen M, Donnelly P, Spencer CC (2012) Including known covariates can reduce power to detect genetic effects in case-control studies. Nat Genet 44: 848–851.
[32]  Inouye M, Ripatti S, Kettunen J, Lyytik?inen LP, Oksala N, et al. (2012) Novel Loci for metabolic networks and multi-tissue expression studies reveal genes for atherosclerosis. PLoS Genet 8: e1002907 Available: http://www.plosgenetics.org/article/info?%3Adoi%2F10.1371%2Fjournal.pgen.1002907. Accessed 2013 Feb 28.
[33]  Do HT, Tselykh TV, M?kel? J, Ho TH, Olkkonen VM, et al. (2012) Fibroblast growth factor-21 (FGF21) regulates low-density lipoprotein receptor (LDLR) levels in cells via the E3-ubiquitin ligase Mylip/Idol and the Canopy2 (Cnpy2)/Mylip-interacting saposin-like protein (Msap). J Biol Chem 287: 12602–12611.
[34]  Eriksson N, Benton GM, Do CB, Kiefer AK, Mountain JL, et al. (2012) Genetic variants associated with breast size also influence breast cancer risk. BMC Med Genet 13: 53 Available: http://www.biomedcentral.com/1471-2350/1?3/53. Accessed 2013 Feb 28.
[35]  Wang KS, Liu XF, Aragam N (2010) A genome-wide meta-analysis identifies novel loci associated with schizophrenia and bipolar disorder. Schizophr Res 124: 192–199.
[36]  Pio R, Blanco D, Pajares MJ, Aibar E, Durany O, et al. (2010) Development of a novel splice array platform and its application in the identification of alternative splice variants in lung cancer. BMC Genom 11: 352 Available: http://www.biomedcentral.com/1471-2164/1?1/352. Accessed 2013 Feb 28.
[37]  Chen J, Yu K, Hsing A, Therneau TM (2007) A partially linear tree-based regression model for assessing complex joint gene-gene and gene-environment effects. Genet Epidemiol 31: 238–251.
[38]  Ladouceur M, Dastani Z, Aulchenko YS, Greenwood CMT, Richards JB (2012) The empirical power of rare variant association methods: Results from sanger sequencing in 1,1998 individuals. PLoS Genet 8: e1002496 Available: http://www.plosgenetics.org/article/info?%3Adoi%2F10.1371%2Fjournal.pgen.1002496. Accessed 2013 Feb 28.
[39]  Liang F, Paulo R, Molina G, Clyde MA, Berger JO (2008) Mixtures of g priors for Bayesian variable selection. J Amer Statist Assoc 103: 410–423.
[40]  Guan Y, Stephens M (2011) Bayesian variable selection regression for genome-wide association studies and other large-scale problems. Ann Appl Statist 5: 1780–1815.
[41]  Stingo FC, Chen YA, Tadesse MG, Vannucci M (2011) Incorporating biological information into linear models: A Bayesian approach to the selection of pathways and genes. Ann Appl Statist 5: 1978–2002.
[42]  Johnson VE, Rossell D (2012) Bayesian model selection in high-dimensional settings. J Amer Statist Assoc 107: 649–660.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133