-
Články
- Časopisy
- Kurzy
- Témy
- Kongresy
- Videa
- Podcasty
Prediction of Complex Human Traits Using the Genomic Best Linear Unbiased Predictor
Despite important advances from Genome Wide Association Studies (GWAS), for most complex human traits and diseases, a sizable proportion of genetic variance remains unexplained and prediction accuracy (PA) is usually low. Evidence suggests that PA can be improved using Whole-Genome Regression (WGR) models where phenotypes are regressed on hundreds of thousands of variants simultaneously. The Genomic Best Linear Unbiased Prediction (G-BLUP, a ridge-regression type method) is a commonly used WGR method and has shown good predictive performance when applied to plant and animal breeding populations. However, breeding and human populations differ greatly in a number of factors that can affect the predictive performance of G-BLUP. Using theory, simulations, and real data analysis, we study the performance of G-BLUP when applied to data from related and unrelated human subjects. Under perfect linkage disequilibrium (LD) between markers and QTL, the prediction R-squared (R2) of G-BLUP reaches trait-heritability, asymptotically. However, under imperfect LD between markers and QTL, prediction R2 based on G-BLUP has a much lower upper bound. We show that the minimum decrease in prediction accuracy caused by imperfect LD between markers and QTL is given by (1−b)2, where b is the regression of marker-derived genomic relationships on those realized at causal loci. For pairs of related individuals, due to within-family disequilibrium, the patterns of realized genomic similarity are similar across the genome; therefore b is close to one inducing small decrease in R2. However, with distantly related individuals b reaches very low values imposing a very low upper bound on prediction R2. Our simulations suggest that for the analysis of data from unrelated individuals, the asymptotic upper bound on R2 may be of the order of 20% of the trait heritability. We show how PA can be enhanced with use of variable selection or differential shrinkage of estimates of marker effects.
Vyšlo v časopise: Prediction of Complex Human Traits Using the Genomic Best Linear Unbiased Predictor. PLoS Genet 9(7): e32767. doi:10.1371/journal.pgen.1003608
Kategorie: Research Article
prolekare.web.journal.doi_sk: https://doi.org/10.1371/journal.pgen.1003608Souhrn
Despite important advances from Genome Wide Association Studies (GWAS), for most complex human traits and diseases, a sizable proportion of genetic variance remains unexplained and prediction accuracy (PA) is usually low. Evidence suggests that PA can be improved using Whole-Genome Regression (WGR) models where phenotypes are regressed on hundreds of thousands of variants simultaneously. The Genomic Best Linear Unbiased Prediction (G-BLUP, a ridge-regression type method) is a commonly used WGR method and has shown good predictive performance when applied to plant and animal breeding populations. However, breeding and human populations differ greatly in a number of factors that can affect the predictive performance of G-BLUP. Using theory, simulations, and real data analysis, we study the performance of G-BLUP when applied to data from related and unrelated human subjects. Under perfect linkage disequilibrium (LD) between markers and QTL, the prediction R-squared (R2) of G-BLUP reaches trait-heritability, asymptotically. However, under imperfect LD between markers and QTL, prediction R2 based on G-BLUP has a much lower upper bound. We show that the minimum decrease in prediction accuracy caused by imperfect LD between markers and QTL is given by (1−b)2, where b is the regression of marker-derived genomic relationships on those realized at causal loci. For pairs of related individuals, due to within-family disequilibrium, the patterns of realized genomic similarity are similar across the genome; therefore b is close to one inducing small decrease in R2. However, with distantly related individuals b reaches very low values imposing a very low upper bound on prediction R2. Our simulations suggest that for the analysis of data from unrelated individuals, the asymptotic upper bound on R2 may be of the order of 20% of the trait heritability. We show how PA can be enhanced with use of variable selection or differential shrinkage of estimates of marker effects.
Zdroje
1. GuttmacherAE, CollinsFS (2002) Genomic medicine—a primer. New England Journal of Medicine 347 : 1512–1520.
2. National Institutes of Health, National Human Genome Research Institute (n.d.) A catalog of published genome-wide association studies. Available: http://www.genome.gov/gwastudies/.
3. MaherB (2008) Personal genomes: The case of the missing heritability. Nature 456 : 18.
4. ManolioTA, CollinsFS, CoxNJ, GoldsteinDB, HindorffLA, et al. (2009) Finding the missing heritability of complex diseases. Nature 461 : 747–753.
5. Lango AllenH, EstradaK, LettreG, BerndtSI, WeedonMN, et al. (2010) Hundreds of variants clustered in genomic loci and biological pathways affect human height. Nature 467 : 832–838 doi:10.1038/nature09410
6. de los CamposG, GianolaD, AllisonDB (2010) Predicting genetic predisposition in humans: the promise of whole-genome markers. Nat Rev Genet 11 : 880–886 doi:10.1038/nrg2898
7. YangJ, BenyaminB, McEvoyBP, GordonS, HendersAK, et al. (2010) Common SNPs explain a large proportion of the heritability for human height. Nature genetics 42 : 565–569.
8. MakowskyR, PajewskiNM, KlimentidisYC, VazquezAI, DuarteCW, et al. (2011) Beyond Missing Heritability: Prediction of Complex Traits. PLoS Genet 7: e1002051.
9. MeuwissenTH, HayesBJ, GoddardME (2001) Prediction of total genetic value using genome-wide dense marker maps. Genetics 157 : 1819–1829.
10. BenjaminDJ, CesariniD, van der LoosMJHM, DawesCT, KoellingerPD, et al. (2012) The genetic architecture of economic and political preferences. Proceedings of the National Academy of Sciences 109 : 8026–8031.
11. HabierD, FernandoRL, DekkersJCM (2007) The impact of genetic relationship information on genome-assisted breeding values. Genetics 177 : 2389–2397.
12. HendersonCR (1975) Best linear unbiased estimation and prediction under a selection model. Biometrics 31 : 423–447.
13. PszczolaM, StrabelT, MulderHA, CalusMPL (2012) Reliability of direct genomic values for animals with different relationships within and to the reference population. Journal of dairy science 95 : 389–400.
14. DawberTR, MeadorsGF, MooreFEJr (1951) Epidemiological Approaches to Heart Disease: The Framingham Study*. American Journal of Public Health and the Nations Health 41 : 279–286.
15. CornelisMC, AgrawalA, ColeJW, HanselNN, BarnesKC, et al. (2010) The Gene, Environment Association Studies consortium (GENEVA): maximizing the knowledge obtained from GWAS by collaboration across studies of multiple conditions. Genetic epidemiology 34 : 364–372.
16. de los CamposG, HickeyJM, DaetwylerHD, Pong-WongR, CalusMPL (2012) Whole Genome Regression and Prediction Methods Applied to Plant and Animal Breeding. Genetics 193 : 327–345.
17. HoerlAE, KennardRW (1970) Ridge regression: Biased estimation for nonorthogonal problems. Technometrics 12 : 55–67.
18. FisherRA (1918) The correlation between relatives on the supposition of Mendelian inheritance. Transactions of the Royal Society of Edinburgh 52 : 399–433.
19. WrightS (1921) Systems of mating. II. The effects of inbreeding on the genetic composition of a population. Genetics 6 : 124.
20. HillWG, WeirBS (2011) Variation in actual relationship as a consequence of Mendelian sampling and linkage. Genetics Research 93 : 47–64 doi:10.1017/S0016672310000480
21. RitlandK (1996) A marker-based method for inferences about quantitative inheritance in natural populations. Evolution 1062–1073.
22. LynchM, RitlandK (1999) Estimation of pairwise relatedness with molecular markers. Genetics 152 : 1753.
23. VanRadenP (2007) Genomic measures of relationship and inbreeding. Interbull bull 37 : 33–36.
24. HayesBJ, VisscherP, GoddardM (2009) Increased accuracy of artificial selection by using the realized relationship matrix. Genet Res 91 : 47–60.
25. StrandénI, ChristensenOF (2011) Allele coding in genomic evaluation. GSE 43 : 25.
26. ZhangZ, LiuJ, DingX, BijmaP, De KoningDJ, et al. (2010) Best linear unbiased prediction of genomic breeding values using a trait-specific marker-derived relationship matrix. PloS one 5: e12648.
27. VanRadenPM, Van TassellCP, WiggansGR, SonstegardTS, SchnabelRD, et al. (2009) Invited review: reliability of genomic predictions for North American Holstein bulls. Journal of Dairy Science 92 : 16–24.
28. GoddardM (2009) Genomic selection: prediction of accuracy and maximisation of long term response. Genetica 136 : 245–257.
29. DaetwylerHD, VillanuevaB, WoolliamsJA (2008) Accuracy of predicting the genetic risk of disease using a genome-wide approach. PLoS One 3: e3395.
30. DaetwylerHD, Pong-WongR, VillanuevaB, WoolliamsJA (2010) The Impact of Genetic Architecture on Genome-Wide Evaluation Methods. Genetics 185 : 1021–1031 doi:10.1534/genetics.110.116855
31. VisscherPM (2010) A commentary on ‘common SNPs explain a large proportion of the heritability for human height’ by Yang et al.(2010). Twin Research and Human Genetics 13 : 517.
32. JanssL, de los CamposG, SheehanN, SorensenDA (2012) Inferences from Genomic Models in Stratifi_ed Populations. Genetics 693–704 doi:10.1534/genetics.112.141143
33. GoddardME, HayesBJ (2009) Mapping genes for complex traits in domestic animals and their use in breeding programmes. Nature Reviews Genetics 10 : 381–391.
Štítky
Genetika Reprodukčná medicína
Článek Independent Evolution of Transcriptional Inactivation on Sex Chromosomes in Birds and MammalsČlánek The bHLH Subgroup IIId Factors Negatively Regulate Jasmonate-Mediated Plant Defense and DevelopmentČlánek Selective Pressures to Maintain Attachment Site Specificity of Integrative and Conjugative ElementsČlánek Reassembly of Nucleosomes at the Promoter Initiates Resilencing Following Decitabine ExposureČlánek Hepatocyte Growth Factor Signaling in Intrapancreatic Ductal Cells Drives Pancreatic Morphogenesis
Článok vyšiel v časopisePLOS Genetics
Najčítanejšie tento týždeň
2013 Číslo 7- Gynekologové a odborníci na reprodukční medicínu se sejdou na prvním virtuálním summitu
- Je „freeze-all“ pro všechny? Odborníci na fertilitu diskutovali na virtuálním summitu
-
Všetky články tohto čísla
- An Solution for Crossover Formation
- Genome-Wide Association Mapping in Dogs Enables Identification of the Homeobox Gene, , as a Genetic Component of Neural Tube Defects in Humans
- Independent Evolution of Transcriptional Inactivation on Sex Chromosomes in Birds and Mammals
- Stepwise Activation of the ATR Signaling Pathway upon Increasing Replication Stress Impacts Fragile Site Integrity
- Genomic Analysis of Natural Selection and Phenotypic Variation in High-Altitude Mongolians
- Modification of tRNA by Elongator Is Essential for Efficient Translation of Stress mRNAs
- Role of CTCF Protein in Regulating Locus Transcription
- Gene Set Signature of Reversal Reaction Type I in Leprosy Patients
- Mapping of PARK2 and PACRG Overlapping Regulatory Region Reveals LD Structure and Functional Variants in Association with Leprosy in Unrelated Indian Population Groups
- Is Required for Formation of the Genital Ridge in Mice
- Monopolin Subunit Csm1 Associates with MIND Complex to Establish Monopolar Attachment of Sister Kinetochores at Meiosis I
- Recombination Dynamics of a Human Y-Chromosomal Palindrome: Rapid GC-Biased Gene Conversion, Multi-kilobase Conversion Tracts, and Rare Inversions
- Mechanisms of Protein Sequence Divergence and Incompatibility
- Histone Methyltransferase DOT1L Drives Recovery of Gene Expression after a Genotoxic Attack
- Female Behaviour Drives Expression and Evolution of Gustatory Receptors in Butterflies
- Combinatorial Regulation of Meiotic Holliday Junction Resolution in by HIM-6 (BLM) Helicase, SLX-4, and the SLX-1, MUS-81 and XPF-1 Nucleases
- The bHLH Subgroup IIId Factors Negatively Regulate Jasmonate-Mediated Plant Defense and Development
- The Role of Interruptions in polyQ in the Pathology of SCA1
- Dietary Restriction Induced Longevity Is Mediated by Nuclear Receptor NHR-62 in
- Fine Time Course Expression Analysis Identifies Cascades of Activation and Repression and Maps a Putative Regulator of Mammalian Sex Determination
- Genome-scale Co-evolutionary Inference Identifies Functions and Clients of Bacterial Hsp90
- Oxidative Stress and Replication-Independent DNA Breakage Induced by Arsenic in
- A Moonlighting Enzyme Links Cell Size with Central Metabolism
- Budding Yeast Greatwall and Endosulfines Control Activity and Spatial Regulation of PP2A for Timely Mitotic Progression
- The Conserved Intronic Cleavage and Polyadenylation Site of CstF-77 Gene Imparts Control of 3′ End Processing Activity through Feedback Autoregulation and by U1 snRNP
- The BTB-zinc Finger Transcription Factor Abrupt Acts as an Epithelial Oncogene in through Maintaining a Progenitor-like Cell State
- The Cohesion Protein SOLO Associates with SMC1 and Is Required for Synapsis, Recombination, Homolog Bias and Cohesion and Pairing of Centromeres in Drosophila Meiosis
- The RNA-binding Proteins FMR1, Rasputin and Caprin Act Together with the UBA Protein Lingerer to Restrict Tissue Growth in
- Pattern Dynamics in Adaxial-Abaxial Specific Gene Expression Are Modulated by a Plastid Retrograde Signal during Leaf Development
- A Network of HMG-box Transcription Factors Regulates Sexual Cycle in the Fungus
- Bacterial Adaptation through Loss of Function
- ENU-induced Mutation in the DNA-binding Domain of KLF3 Reveals Important Roles for KLF3 in Cardiovascular Development and Function in Mice
- Interplay between Structure-Specific Endonucleases for Crossover Control during Meiosis
- FGF Signalling Regulates Chromatin Organisation during Neural Differentiation via Mechanisms that Can Be Uncoupled from Transcription
- The Arabidopsis RNA Binding Protein with K Homology Motifs, SHINY1, Interacts with the C-terminal Domain Phosphatase-like 1 (CPL1) to Repress Stress-Inducible Gene Expression
- Selective Pressures to Maintain Attachment Site Specificity of Integrative and Conjugative Elements
- The Conserved ADAMTS-like Protein Lonely heart Mediates Matrix Formation and Cardiac Tissue Integrity
- The cGMP-Dependent Protein Kinase EGL-4 Regulates Nociceptive Behavioral Sensitivity
- RBM5 Is a Male Germ Cell Splicing Factor and Is Required for Spermatid Differentiation and Male Fertility
- Disease-Related Growth Factor and Embryonic Signaling Pathways Modulate an Enhancer of Expression at the 6q23.2 Coronary Heart Disease Locus
- Yeast Pol4 Promotes Tel1-Regulated Chromosomal Translocations
- A Dual Role for SOX10 in the Maintenance of the Postnatal Melanocyte Lineage and the Differentiation of Melanocyte Stem Cell Progenitors
- SLC26A4 Targeted to the Endolymphatic Sac Rescues Hearing and Balance in Mutant Mice
- Odoriferous Defensive Stink Gland Transcriptome to Identify Novel Genes Necessary for Quinone Synthesis in the Red Flour Beetle,
- Prediction of Complex Human Traits Using the Genomic Best Linear Unbiased Predictor
- Gene × Physical Activity Interactions in Obesity: Combined Analysis of 111,421 Individuals of European Ancestry
- Reassembly of Nucleosomes at the Promoter Initiates Resilencing Following Decitabine Exposure
- Exquisite Light Sensitivity of Cryptochrome
- miR-133a Regulates Adipocyte Browning In Vivo
- Strabismus Promotes Recruitment and Degradation of Farnesylated Prickle in Planar Polarity Specification
- Hepatocyte Growth Factor Signaling in Intrapancreatic Ductal Cells Drives Pancreatic Morphogenesis
- Is a Potential Tumor Suppressor Gene Commonly Inactivated by Epigenetic Mechanisms in Colorectal Cancer
- Joint Molecule Resolution Requires the Redundant Activities of MUS-81 and XPF-1 during Meiosis
- The Mating Competence of Geographically Diverse Strains in Their Natural and Unnatural Sand Fly Vectors
- Defective Repair of Oxidative Base Lesions by the DNA Glycosylase Nth1 Associates with Multiple Telomere Defects
- Effective Blocking of the Enhancer Requires Cooperation between Two Main Mechanisms Suggested for the Insulator Function
- Trans-Ancestral Studies Fine Map the SLE-Susceptibility Locus
- PLOS Genetics
- Archív čísel
- Aktuálne číslo
- Informácie o časopise
Najčítanejšie v tomto čísle- SLC26A4 Targeted to the Endolymphatic Sac Rescues Hearing and Balance in Mutant Mice
- Bacterial Adaptation through Loss of Function
- The Cohesion Protein SOLO Associates with SMC1 and Is Required for Synapsis, Recombination, Homolog Bias and Cohesion and Pairing of Centromeres in Drosophila Meiosis
- Gene × Physical Activity Interactions in Obesity: Combined Analysis of 111,421 Individuals of European Ancestry
Prihlásenie#ADS_BOTTOM_SCRIPTS#Zabudnuté hesloZadajte e-mailovú adresu, s ktorou ste vytvárali účet. Budú Vám na ňu zasielané informácie k nastaveniu nového hesla.
- Časopisy