-
Články
- Časopisy
- Kurzy
- Témy
- Kongresy
- Videa
- Podcasty
Power and Predictive Accuracy of Polygenic Risk Scores
Polygenic scores have recently been used to summarise genetic effects among an ensemble of markers that do not individually achieve significance in a large-scale association study. Markers are selected using an initial training sample and used to construct a score in an independent replication sample by forming the weighted sum of associated alleles within each subject. Association between a trait and this composite score implies that a genetic signal is present among the selected markers, and the score can then be used for prediction of individual trait values. This approach has been used to obtain evidence of a genetic effect when no single markers are significant, to establish a common genetic basis for related disorders, and to construct risk prediction models. In some cases, however, the desired association or prediction has not been achieved. Here, the power and predictive accuracy of a polygenic score are derived from a quantitative genetics model as a function of the sizes of the two samples, explained genetic variance, selection thresholds for including a marker in the score, and methods for weighting effect sizes in the score. Expressions are derived for quantitative and discrete traits, the latter allowing for case/control sampling. A novel approach to estimating the variance explained by a marker panel is also proposed. It is shown that published studies with significant association of polygenic scores have been well powered, whereas those with negative results can be explained by low sample size. It is also shown that useful levels of prediction may only be approached when predictors are estimated from very large samples, up to an order of magnitude greater than currently available. Therefore, polygenic scores currently have more utility for association testing than predicting complex traits, but prediction will become more feasible as sample sizes continue to grow.
Vyšlo v časopise: Power and Predictive Accuracy of Polygenic Risk Scores. PLoS Genet 9(3): e32767. doi:10.1371/journal.pgen.1003348
Kategorie: Research Article
prolekare.web.journal.doi_sk: https://doi.org/10.1371/journal.pgen.1003348Souhrn
Polygenic scores have recently been used to summarise genetic effects among an ensemble of markers that do not individually achieve significance in a large-scale association study. Markers are selected using an initial training sample and used to construct a score in an independent replication sample by forming the weighted sum of associated alleles within each subject. Association between a trait and this composite score implies that a genetic signal is present among the selected markers, and the score can then be used for prediction of individual trait values. This approach has been used to obtain evidence of a genetic effect when no single markers are significant, to establish a common genetic basis for related disorders, and to construct risk prediction models. In some cases, however, the desired association or prediction has not been achieved. Here, the power and predictive accuracy of a polygenic score are derived from a quantitative genetics model as a function of the sizes of the two samples, explained genetic variance, selection thresholds for including a marker in the score, and methods for weighting effect sizes in the score. Expressions are derived for quantitative and discrete traits, the latter allowing for case/control sampling. A novel approach to estimating the variance explained by a marker panel is also proposed. It is shown that published studies with significant association of polygenic scores have been well powered, whereas those with negative results can be explained by low sample size. It is also shown that useful levels of prediction may only be approached when predictors are estimated from very large samples, up to an order of magnitude greater than currently available. Therefore, polygenic scores currently have more utility for association testing than predicting complex traits, but prediction will become more feasible as sample sizes continue to grow.
Zdroje
1. VisscherPM, BrownMA, McCarthyMI, YangJ (2012) Five years of GWAS discovery. Am J Hum Genet 90 : 7–24.
2. WrayNR, GoddardME, VisscherPM (2007) Prediction of individual genetic risk to disease from genome-wide association studies. Genome Res 17 : 1520–1528.
3. PurcellSM, WrayNR, StoneJL, VisscherPM, O'DonovanMC, et al. (2009) Common polygenic variation contributes to risk of schizophrenia and bipolar disorder. Nature 460 : 748–752.
4. RipkeS, SandersAR, KendlerKS, LevinsonDF, SklarP, et al. (2011) Genome-wide association study identifies five new schizophrenia loci. Nat Genet 43 : 969–976.
5. HamshereML, O'DonovanMC, JonesIR, JonesL, KirovG, et al. (2011) Polygenic dissection of the bipolar phenotype. Br J Psychiatry 198 : 284–288.
6. BushWS, SawcerSJ, de JagerPL, OksenbergJR, McCauleyJL, et al. (2010) Evidence for polygenic susceptibility to multiple sclerosis–the shape of things to come. Am J Hum Genet 86 : 621–625.
7. Lango AllenH, EstradaK, LettreG, BerndtSI, WeedonMN, et al. (2010) Hundreds of variants clustered in genomic loci and biological pathways affect human height. Nature 467 : 832–838.
8. SimonsonMA, WillsAG, KellerMC, McQueenMB (2011) Recent methods for polygenic analysis of genome-wide data implicate an important effect of common variants on cardiovascular disease risk. BMC Med Genet 12 : 146.
9. StahlEA, WegmannD, TrynkaG, Gutierrez-AchuryJ, DoR, et al. (2012) Bayesian inference analyses of the polygenic architecture of rheumatoid arthritis. Nat Genet 44 : 483–489.
10. SpeliotesEK, WillerCJ, BerndtSI, MondaKL, ThorleifssonG, et al. (2010) Association analyses of 249,796 individuals reveal 18 new loci associated with body mass index. Nat Genet 42 : 937–948.
11. PetersonRE, MaesHH, HolmansP, SandersAR, LevinsonDF, et al. (2011) Genetic risk sum score comprised of common polygenic variation is associated with body mass index. Hum Genet 129 : 221–230.
12. CarayolJ, SchellenbergGD, ToresF, HagerJ, ZieglerA, et al. (2010) Assessing the impact of a combined analysis of four common low-risk genetic variants on autism risk. Mol Autism 1 : 4.
13. KangJ, KugathasanS, GeorgesM, ZhaoH, ChoJH (2011) Improved risk prediction for Crohn's disease with a multi-locus approach. Hum Mol Genet 20 : 2435–2442.
14. MachielaMJ, ChenCY, ChenC, ChanockSJ, HunterDJ, et al. (2011) Evaluation of polygenic risk scores for predicting breast and prostate cancer risk. Genet Epidemiol 35 : 506–514.
15. WitteJS, HoffmannTJ (2011) Polygenic modeling of genome-wide association studies: an application to prostate and breast cancer. OMICS 15 : 393–398.
16. PharoahPD, AntoniouAC, EastonDF, PonderBA (2008) Polygenes, risk prediction, and targeted prevention of breast cancer. N Engl J Med 358 : 2796–2803.
17. ClaytonDG (2009) Prediction and interaction in complex disease genetics: experience in type 1 diabetes. PLoS Genet 5: e1000540 doi:10.1371/journal.pgen.1000540
18. SawcerS, BanM, WasonJ, DudbridgeF (2010) What role for genetics in the prediction of multiple sclerosis? Ann Neurol 67 : 3–10.
19. EvansDM, VisscherPM, WrayNR (2009) Harnessing the information contained within genome-wide association studies to improve individual prediction of complex disease risk. Hum Mol Genet 18 : 3525–3531.
20. PharoahPD, AntoniouA, BobrowM, ZimmernRL, EastonDF, et al. (2002) Polygenic susceptibility to breast cancer and implications for prevention. Nat Genet 31 : 33–36.
21. WrayNR, YangJ, GoddardME, VisscherPM (2010) The genetic interpretation of area under the ROC curve in genomic profiling. PLoS Genet 6: e1000864 doi:10.1371/journal.pgen.1000864
22. JanssensAC, AulchenkoYS, ElefanteS, BorsboomGJ, SteyerbergEW, et al. (2006) Predictive testing for complex diseases using multiple genes: fact or fiction? Genet Med 8 : 395–400.
23. DaetwylerHD, VillanuevaB, WoolliamsJA (2008) Accuracy of predicting the genetic risk of disease using a genome-wide approach. PLoS ONE 3: e3395 doi:10.1371/journal.pone.0003395
24. SoHC, ShamPC (2010) A unifying framework for evaluating the predictive power of genetic variants based on the level of heritability explained. PLoS Genet 6: e1001230 doi:10.1371/journal.pgen.1001230
25. LeeSH, GoddardME, WrayNR, VisscherPM (2012) A better coefficient of determination for genetic profile analysis. Genet Epidemiol 36 : 214–224.
26. ShiJ, LevinsonDF, DuanJ, SandersAR, ZhengY, et al. (2009) Common variants on chromosome 6p22.1 are associated with schizophrenia. Nature 460 : 753–757.
27. LeeSH, DeCandiaTR, RipkeS, YangJ, SullivanPF, et al. (2012) Estimating the proportion of variation in susceptibility to schizophrenia captured by common SNPs. Nat Genet 44 : 247–250.
28. SklarP, RipkeS, ScottLJ, AndreassenOA, CichonS, et al. (2011) Large-scale genome-wide association analysis of bipolar disorder identifies a new susceptibility locus near ODZ4. Nat Genet 43 : 977–983.
29. RischN (2001) The genetic epidemiology of cancer: interpreting family and twin studies and their implications for molecular genetic approaches. Cancer Epidemiol Biomarkers Prev 10 : 733–741.
30. LeeSH, WrayNR, GoddardME, VisscherPM (2011) Estimating missing heritability for disease from genome-wide association studies. Am J Hum Genet 88 : 294–305.
31. JanssensAC, MoonesingheR, YangQ, SteyerbergEW, van DuijnCM, et al. (2007) The impact of genotype frequencies on the clinical validity of genomic profiling for predicting common chronic diseases. Genet Med 9 : 528–535.
32. DudbridgeF, GusnantoA (2008) Estimation of significance thresholds for genomewide association scans. Genet Epidemiol 32 : 227–234.
33. YangJ, WeedonMN, PurcellS, LettreG, EstradaK, et al. (2011) Genomic inflation factors under polygenic inheritance. Eur J Hum Genet 19 : 807–812.
34. WacholderS, HartgeP, PrenticeR, Garcia-ClosasM, FeigelsonHS, et al. (2010) Performance of common genetic variants in breast-cancer risk models. N Engl J Med 362 : 986–993.
35. GoddardME, WrayNR, VerbylaK, VisscherPM (2009) Estimating Effects and Making Predictions from Genome-Wide Marker Data. Statistical Science 24 : 517–529.
36. GreenlandS (2000) Principles of multilevel modelling. Int J Epidemiol 29 : 158–167.
37. YangJ, BenyaminB, McEvoyBP, GordonS, HendersAK, et al. (2010) Common SNPs explain a large proportion of the heritability for human height. Nat Genet 42 : 565–569.
38. WuTT, ChenYF, HastieT, SobelE, LangeK (2009) Genome-wide association analysis by lasso penalized logistic regression. Bioinformatics 25 : 714–721.
39. HoggartCJ, WhittakerJC, De IorioM, BaldingDJ (2008) Simultaneous analysis of all SNPs in genome-wide and re-sequencing association studies. PLoS Genet 4: e1000130 doi:10.1371/journal.pgen.1000130
40. Falconer DS, Mackay TFC (1996) Introduction to Quantitative Genetics: Longman.
41. Ruppert D, Wand MP, Carroll RJ (2003) Semiparametric regression: Cambridge University Press.
Štítky
Genetika Reprodukčná medicína
Článek Ubiquitous Polygenicity of Human Complex Traits: Genome-Wide Analysis of 49 Traits in KoreansČlánek Alternative Splicing and Subfunctionalization Generates Functional Diversity in Fungal ProteomesČlánek RFX Transcription Factor DAF-19 Regulates 5-HT and Innate Immune Responses to Pathogenic Bacteria inČlánek Surveillance-Activated Defenses Block the ROS–Induced Mitochondrial Unfolded Protein ResponseČlánek Deficiency Reduces Adipose OXPHOS Capacity and Triggers Inflammation and Insulin Resistance in Mice
Článok vyšiel v časopisePLOS Genetics
Najčítanejšie tento týždeň
2013 Číslo 3- Gynekologové a odborníci na reprodukční medicínu se sejdou na prvním virtuálním summitu
- Je „freeze-all“ pro všechny? Odborníci na fertilitu diskutovali na virtuálním summitu
-
Všetky články tohto čísla
- Power and Predictive Accuracy of Polygenic Risk Scores
- Rare Copy Number Variants Are a Common Cause of Short Stature
- Coordination of Flower Maturation by a Regulatory Circuit of Three MicroRNAs
- Ubiquitous Polygenicity of Human Complex Traits: Genome-Wide Analysis of 49 Traits in Koreans
- Genomic Evidence for Island Population Conversion Resolves Conflicting Theories of Polar Bear Evolution
- Mechanistic Insight into the Pathology of Polyalanine Expansion Disorders Revealed by a Mouse Model for X Linked Hypopituitarism
- Genome-Wide Association Study and Gene Expression Analysis Identifies as a Predictor of Response to Etanercept Therapy in Rheumatoid Arthritis
- Problem Solved: An Interview with Sir Edwin Southern
- Long Interspersed Element–1 (LINE-1): Passenger or Driver in Human Neoplasms?
- Mouse HFM1/Mer3 Is Required for Crossover Formation and Complete Synapsis of Homologous Chromosomes during Meiosis
- Alternative Splicing and Subfunctionalization Generates Functional Diversity in Fungal Proteomes
- A WRKY Transcription Factor Recruits the SYG1-Like Protein SHB1 to Activate Gene Expression and Seed Cavity Enlargement
- Microhomology-Mediated Mechanisms Underlie Non-Recurrent Disease-Causing Microdeletions of the Gene or Its Regulatory Domain
- Ancient Evolutionary Trade-Offs between Yeast Ploidy States
- Differential Evolutionary Fate of an Ancestral Primate Endogenous Retrovirus Envelope Gene, the EnvV , Captured for a Function in Placentation
- A Feed-Forward Loop Coupling Extracellular BMP Transport and Morphogenesis in Wing
- The Tomato Yellow Leaf Curl Virus Resistance Genes and Are Allelic and Code for DFDGD-Class RNA–Dependent RNA Polymerases
- The U-Box E3 Ubiquitin Ligase TUD1 Functions with a Heterotrimeric G α Subunit to Regulate Brassinosteroid-Mediated Growth in Rice
- Role of the DSC1 Channel in Regulating Neuronal Excitability in : Extending Nervous System Stability under Stress
- –Independent Phenotypic Switching in and a Dual Role for Wor1 in Regulating Switching and Filamentation
- Pax6 Regulates Gene Expression in the Vertebrate Lens through miR-204
- Blood-Informative Transcripts Define Nine Common Axes of Peripheral Blood Gene Expression
- Genetic Architecture of Skin and Eye Color in an African-European Admixed Population
- Fine Characterisation of a Recombination Hotspot at the Locus and Resolution of the Paradoxical Excess of Duplications over Deletions in the General Population
- Estrogen Mediated-Activation of miR-191/425 Cluster Modulates Tumorigenicity of Breast Cancer Cells Depending on Estrogen Receptor Status
- Complex Patterns of Genomic Admixture within Southern Africa
- Yap- and Cdc42-Dependent Nephrogenesis and Morphogenesis during Mouse Kidney Development
- Molecular Networks of Human Muscle Adaptation to Exercise and Age
- Alp/Enigma Family Proteins Cooperate in Z-Disc Formation and Myofibril Assembly
- Polycomb Group Gene Regulates Rice () Seed Development and Grain Filling via a Mechanism Distinct from
- RFX Transcription Factor DAF-19 Regulates 5-HT and Innate Immune Responses to Pathogenic Bacteria in
- Distinct Molecular Strategies for Hox-Mediated Limb Suppression in : From Cooperativity to Dispensability/Antagonism in TALE Partnership
- A Natural Polymorphism in rDNA Replication Origins Links Origin Activation with Calorie Restriction and Lifespan
- TDP2–Dependent Non-Homologous End-Joining Protects against Topoisomerase II–Induced DNA Breaks and Genome Instability in Cells and
- Recurrent Rearrangement during Adaptive Evolution in an Interspecific Yeast Hybrid Suggests a Model for Rapid Introgression
- Genome-Wide Association Study in Mutation Carriers Identifies Novel Loci Associated with Breast and Ovarian Cancer Risk
- Coincident Resection at Both Ends of Random, γ–Induced Double-Strand Breaks Requires MRX (MRN), Sae2 (Ctp1), and Mre11-Nuclease
- Identification of a -Specific Modifier Locus at 6p24 Related to Breast Cancer Risk
- A Novel Function for the Hox Gene in the Male Accessory Gland Regulates the Long-Term Female Post-Mating Response in
- Tdp2: A Means to Fixing the Ends
- A Novel Role for the RNA–Binding Protein FXR1P in Myoblasts Cell-Cycle Progression by Modulating mRNA Stability
- Association Mapping and the Genomic Consequences of Selection in Sunflower
- Histone Deacetylase 2 (HDAC2) Regulates Chromosome Segregation and Kinetochore Function via H4K16 Deacetylation during Oocyte Maturation in Mouse
- A Novel Mutation in the Upstream Open Reading Frame of the Gene Causes a MEN4 Phenotype
- Ataxin1L Is a Regulator of HSC Function Highlighting the Utility of Cross-Tissue Comparisons for Gene Discovery
- Human Spermatogenic Failure Purges Deleterious Mutation Load from the Autosomes and Both Sex Chromosomes, including the Gene
- A Conserved Upstream Motif Orchestrates Autonomous, Germline-Enriched Expression of piRNAs
- Statistical Analysis Reveals Co-Expression Patterns of Many Pairs of Genes in Yeast Are Jointly Regulated by Interacting Loci
- Matefin/SUN-1 Phosphorylation Is Part of a Surveillance Mechanism to Coordinate Chromosome Synapsis and Recombination with Meiotic Progression and Chromosome Movement
- A Role for the Malignant Brain Tumour (MBT) Domain Protein LIN-61 in DNA Double-Strand Break Repair by Homologous Recombination
- The Population and Evolutionary Dynamics of Phage and Bacteria with CRISPR–Mediated Immunity
- Long Noncoding RNA MALAT1 Controls Cell Cycle Progression by Regulating the Expression of Oncogenic Transcription Factor B-MYB
- Surveillance-Activated Defenses Block the ROS–Induced Mitochondrial Unfolded Protein Response
- DNA Topoisomerase III Localizes to Centromeres and Affects Centromeric CENP-A Levels in Fission Yeast
- Genome-Wide Control of RNA Polymerase II Activity by Cohesin
- Divergent Selection Drives Genetic Differentiation in an R2R3-MYB Transcription Factor That Contributes to Incipient Speciation in
- NODULE INCEPTION Directly Targets Subunit Genes to Regulate Essential Processes of Root Nodule Development in
- Spreading of a Prion Domain from Cell-to-Cell by Vesicular Transport in
- Deficiency in Origin Licensing Proteins Impairs Cilia Formation: Implications for the Aetiology of Meier-Gorlin Syndrome
- Deficiency Reduces Adipose OXPHOS Capacity and Triggers Inflammation and Insulin Resistance in Mice
- The Conserved SKN-1/Nrf2 Stress Response Pathway Regulates Synaptic Function in
- Functional Genomic Analysis of the Regulatory Network in
- Astakine 2—the Dark Knight Linking Melatonin to Circadian Regulation in Crustaceans
- CRL2 E3-Ligase Regulates Proliferation and Progression through Meiosis in the Germline
- Both the Caspase CSP-1 and a Caspase-Independent Pathway Promote Programmed Cell Death in Parallel to the Canonical Pathway for Apoptosis in
- PRMT4 Is a Novel Coactivator of c-Myb-Dependent Transcription in Haematopoietic Cell Lines
- A Copy Number Variant at the Locus Likely Confers Risk for Canine Squamous Cell Carcinoma of the Digit
- Evidence of Gene–Environment Interactions between Common Breast Cancer Susceptibility Loci and Established Environmental Risk Factors
- HIV Infection Disrupts the Sympatric Host–Pathogen Relationship in Human Tuberculosis
- Trans-Ethnic Fine-Mapping of Lipid Loci Identifies Population-Specific Signals and Allelic Heterogeneity That Increases the Trait Variance Explained
- A Gene Transfer Agent and a Dynamic Repertoire of Secretion Systems Hold the Keys to the Explosive Radiation of the Emerging Pathogen
- The Role of ATM in the Deficiency in Nonhomologous End-Joining near Telomeres in a Human Cancer Cell Line
- Dynamic Circadian Protein–Protein Interaction Networks Predict Temporal Organization of Cellular Functions
- Nuclear Myosin 1c Facilitates the Chromatin Modifications Required to Activate rRNA Gene Transcription and Cell Cycle Progression
- Robust Prediction of Expression Differences among Human Individuals Using Only Genotype Information
- A Single Cohesin Complex Performs Mitotic and Meiotic Functions in the Protist
- The Role of the Arabidopsis Exosome in siRNA–Independent Silencing of Heterochromatic Loci
- Elevated Expression of the Integrin-Associated Protein PINCH Suppresses the Defects of Muscle Hypercontraction Mutants
- Twist1 Controls a Cell-Specification Switch Governing Cell Fate Decisions within the Cardiac Neural Crest
- Genome-Wide Testing of Putative Functional Exonic Variants in Relationship with Breast and Prostate Cancer Risk in a Multiethnic Population
- Heteroduplex DNA Position Defines the Roles of the Sgs1, Srs2, and Mph1 Helicases in Promoting Distinct Recombination Outcomes
- PLOS Genetics
- Archív čísel
- Aktuálne číslo
- Informácie o časopise
Najčítanejšie v tomto čísle- Fine Characterisation of a Recombination Hotspot at the Locus and Resolution of the Paradoxical Excess of Duplications over Deletions in the General Population
- Molecular Networks of Human Muscle Adaptation to Exercise and Age
- Recurrent Rearrangement during Adaptive Evolution in an Interspecific Yeast Hybrid Suggests a Model for Rapid Introgression
- Genome-Wide Association Study and Gene Expression Analysis Identifies as a Predictor of Response to Etanercept Therapy in Rheumatoid Arthritis
Prihlásenie#ADS_BOTTOM_SCRIPTS#Zabudnuté hesloZadajte e-mailovú adresu, s ktorou ste vytvárali účet. Budú Vám na ňu zasielané informácie k nastaveniu nového hesla.
- Časopisy