Quantifying the Underestimation of Relative Risks from Genome-Wide Association Studies

English version České info

Genome-wide association studies (GWAS) have identified hundreds of associated loci across many common diseases. Most risk variants identified by GWAS will merely be tags for as-yet-unknown causal variants. It is therefore possible that identification of the causal variant, by fine mapping, will identify alleles with larger effects on genetic risk than those currently estimated from GWAS replication studies. We show that under plausible assumptions, whilst the majority of the per-allele relative risks (RR) estimated from GWAS data will be close to the true risk at the causal variant, some could be considerable underestimates. For example, for an estimated RR in the range 1.2–1.3, there is approximately a 38% chance that it exceeds 1.4 and a 10% chance that it is over 2. We show how these probabilities can vary depending on the true effects associated with low-frequency variants and on the minor allele frequency (MAF) of the most associated SNP. We investigate the consequences of the underestimation of effect sizes for predictions of an individual's disease risk and interpret our results for the design of fine mapping experiments. Although these effects mean that the amount of heritability explained by known GWAS loci is expected to be larger than current projections, this increase is likely to explain a relatively small amount of the so-called “missing” heritability.

Vyšlo v časopise: Quantifying the Underestimation of Relative Risks from Genome-Wide Association Studies. PLoS Genet 7(3): e32767. doi:10.1371/journal.pgen.1001337
Kategorie: Research Article
prolekare.web.journal.doi_sk: https://doi.org/10.1371/journal.pgen.1001337

Souhrn

Zdroje

1. ManolioTA

BrooksLD

CollinsFS

2008 A HapMap harvest of insights into the genetics of common disease. J Clin Invest 118 1590 1605

2. FrazerKA

BallingerDG

CoxDR

HindsDA

StuveLL

2007 A second generation human haplotype map of over 3.1 million SNPs. Nature 449 851 861

3. MarchiniJ

HowieB

MyersS

McVeanG

DonnellyP

2007 A new multipoint method for genome-wide association studies by imputation of genotypes. Nat Genet 39 906 913

4. ZollnerS

PritchardJK

2007 Overcoming the winner's curse: estimating penetrance parameters from case-control data. Am J Hum Genet 80 605 615

5. IlesMM

2008 What can genome-wide association studies tell us about the genetics of common disease? PLoS Genet 4 e33 doi:10.1371/journal.pgen.0040033

6. 2005 A haplotype map of the human genome. Nature 437 1299 1320

7. WakefieldJ

2008 Bayes factors for genome-wide association studies: comparison with P-values. Genet Epidemiol 33 79 86

8. WangWY

BarrattBJ

ClaytonDG

ToddJA

2005 Genome-wide association studies: theoretical and practical concerns. Nat Rev Genet 6 109 118

9. PritchardJK

2001 Are rare variants responsible for susceptibility to complex diseases? Am J Hum Genet 69 124 137

10. ZondervanKT

CardonLR

2004 The complex interplay among factors that influence allelic association. Nat Rev Genet 5 89 100

11. BarrettJC

HansoulS

NicolaeDL

ChoJH

DuerrRH

2008 Genome-wide association defines more than 30 distinct susceptibility loci for Crohn's disease. Nat Genet 40 955 962

12. ZegginiE

ScottLJ

SaxenaR

VoightBF

MarchiniJL

2008 Meta-analysis of genome-wide association data and large-scale replication identifies additional susceptibility loci for type 2 diabetes. Nat Genet 40 638 645

13. PharoahPD

AntoniouAC

EastonDF

PonderBA

2008 Polygenes, risk prediction, and targeted prevention of breast cancer. N Engl J Med 358 2796 2803

14. RischN

1990 Linkage strategies for genetically complex traits. II. The power of affected relative pairs. Am J Hum Genet 46 229 241

15. ThomasDC

2004 Statistical methods in genetic epidemiology Oxford Oxford University Press xxi, 435

16. PharoahPD

DayNE

DuffyS

EastonDF

PonderBA

1997 Family history and the risk of breast cancer: a systematic review and meta-analysis. Int J Cancer 71 800 809

17. WeijnenCF

RichSS

MeigsJB

KrolewskiAS

WarramJH

2002 Risk of diabetes in siblings of index cases with Type 2 diabetes: implications for genetic studies. Diabet Med 19 41 50

18. OrholmM

MunkholmP

LangholzE

NielsenOH

SorensenTI

1991 Familial occurrence of inflammatory bowel disease. N Engl J Med 324 84 88

19. The Wellcome Trust Case Control Consortium 2007 Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls. Nature 447 661 678

20. MaherB

2008 Personal genomes: The case of the missing heritability. Nature 456 18 21

21. ClaytonDG

2009 Prediction and interaction in complex disease genetics: experience in type 1 diabetes. PLoS Genet 5 e1000540 doi:10.1371/journal.pgen.1000540

22. WallaceC

ClaytonD

2003 Estimating the relative recurrence risk ratio using a global cross-ratio model. Genet Epidemiol 25 293 302

23. 2004 The ENCODE (ENCyclopedia Of DNA Elements) Project. Science 306 636 640

24. 2003 The International HapMap Project. Nature 426 789 796

25. MyersS

BottoloL

FreemanC