-
Články
- Časopisy
- Kurzy
- Témy
- Kongresy
- Videa
- Podcasty
Relationship Estimation from Whole-Genome Sequence Data
The determination of the relationship between a pair of individuals is a fundamental application of genetics. Previously, we and others have demonstrated that identity-by-descent (IBD) information generated from high-density single-nucleotide polymorphism (SNP) data can greatly improve the power and accuracy of genetic relationship detection. Whole-genome sequencing (WGS) marks the final step in increasing genetic marker density by assaying all single-nucleotide variants (SNVs), and thus has the potential to further improve relationship detection by enabling more accurate detection of IBD segments and more precise resolution of IBD segment boundaries. However, WGS introduces new complexities that must be addressed in order to achieve these improvements in relationship detection. To evaluate these complexities, we estimated genetic relationships from WGS data for 1490 known pairwise relationships among 258 individuals in 30 families along with 46 population samples as controls. We identified several genomic regions with excess pairwise IBD in both the pedigree and control datasets using three established IBD methods: GERMLINE, fastIBD, and ISCA. These spurious IBD segments produced a 10-fold increase in the rate of detected false-positive relationships among controls compared to high-density microarray datasets. To address this issue, we developed a new method to identify and mask genomic regions with excess IBD. This method, implemented in ERSA 2.0, fully resolved the inflated cryptic relationship detection rates while improving relationship estimation accuracy. ERSA 2.0 detected all 1st through 6th degree relationships, and 55% of 9th through 11th degree relationships in the 30 families. We estimate that WGS data provides a 5% to 15% increase in relationship detection power relative to high-density microarray data for distant relationships. Our results identify regions of the genome that are highly problematic for IBD mapping and introduce new software to accurately detect 1st through 9th degree relationships from whole-genome sequence data.
Vyšlo v časopise: Relationship Estimation from Whole-Genome Sequence Data. PLoS Genet 10(1): e32767. doi:10.1371/journal.pgen.1004144
Kategorie: Research Article
prolekare.web.journal.doi_sk: https://doi.org/10.1371/journal.pgen.1004144Souhrn
The determination of the relationship between a pair of individuals is a fundamental application of genetics. Previously, we and others have demonstrated that identity-by-descent (IBD) information generated from high-density single-nucleotide polymorphism (SNP) data can greatly improve the power and accuracy of genetic relationship detection. Whole-genome sequencing (WGS) marks the final step in increasing genetic marker density by assaying all single-nucleotide variants (SNVs), and thus has the potential to further improve relationship detection by enabling more accurate detection of IBD segments and more precise resolution of IBD segment boundaries. However, WGS introduces new complexities that must be addressed in order to achieve these improvements in relationship detection. To evaluate these complexities, we estimated genetic relationships from WGS data for 1490 known pairwise relationships among 258 individuals in 30 families along with 46 population samples as controls. We identified several genomic regions with excess pairwise IBD in both the pedigree and control datasets using three established IBD methods: GERMLINE, fastIBD, and ISCA. These spurious IBD segments produced a 10-fold increase in the rate of detected false-positive relationships among controls compared to high-density microarray datasets. To address this issue, we developed a new method to identify and mask genomic regions with excess IBD. This method, implemented in ERSA 2.0, fully resolved the inflated cryptic relationship detection rates while improving relationship estimation accuracy. ERSA 2.0 detected all 1st through 6th degree relationships, and 55% of 9th through 11th degree relationships in the 30 families. We estimate that WGS data provides a 5% to 15% increase in relationship detection power relative to high-density microarray data for distant relationships. Our results identify regions of the genome that are highly problematic for IBD mapping and introduce new software to accurately detect 1st through 9th degree relationships from whole-genome sequence data.
Zdroje
1. BoehnkeM, CoxNJ (1997) Accurate inference of relationships in sib-pair linkage studies. Am J Hum Genet 61 : 423–429.
2. VoightBF, PritchardJK (2005) Confounding from cryptic relatedness in case-control association studies. PLoS Genet 1: e32.
3. KongA, ThorleifssonG, GudbjartssonDF, MassonG, SigurdssonA, et al. (2010) Fine-scale recombination rate differences between sexes, populations and individuals. Nature 467 : 1099–1103.
4. XingJ, WatkinsWS, ShlienA, WalkerE, HuffCD, et al. (2010) Toward a more uniform sampling of human genetic diversity: A survey of worldwide populations by high-density genotyping. Genomics 96 : 199–210.
5. LinTH, MyersEW, XingEP (2006) Interpreting anonymous DNA samples from mass disasters–probabilistic forensic inference using genetic markers. Bioinformatics 22: e298–306.
6. Alvarez-CuberoMJ, SaizM, Martinez-GonzalezLJ, AlvarezJC, EisenbergAJ, et al. (2012) Genetic identification of missing persons: DNA analysis of human remains and compromised samples. Pathobiology 79 : 228–238.
7. ThompsonEA (1975) The estimation of pairwise relationships. Ann Hum Genet 39 : 173–188.
8. Ehm MGWM (1996) Test statistic to detect errors in sib-pair relationships. Am J Hum Genet Suppl 69: A217.
9. EpsteinMP, DurenWL, BoehnkeM (2000) Improved inference of relationship for pairs of individuals. Am J Hum Genet 67 : 1219–1231.
10. HuffCD, WitherspoonDJ, SimonsonTS, XingJ, WatkinsWS, et al. (2011) Maximum-likelihood estimation of recent shared ancestry (ERSA). Genome Res 21 : 768–774.
11. HennBM, HonL, MacphersonJM, ErikssonN, SaxonovS, et al. (2012) Cryptic distant relatives are common in both isolated and cosmopolitan genetic samples. PLoS One 7: e34267.
12. GusevA, LoweJK, StoffelM, DalyMJ, AltshulerD, et al. (2009) Whole population, genome-wide mapping of hidden relatedness. Genome Res 19 : 318–326.
13. BrowningBL, BrowningSR (2011) A fast, powerful method for detecting identity by descent. Am J Hum Genet 88 : 173–182.
14. RoachJC, GlusmanG, SmitAF, HuffCD, HubleyR, et al. (2010) Analysis of Genetic Inheritance in a Family Quartet by Whole-Genome Sequencing. Science 328 : 636–9.
15. Smit AFA, Hubley R. (2008–2010) RepeatModeler Open-1.0.
16. AlbrechtsenA, MoltkeI, NielsenR (2010) Natural selection and the distribution of identity-by-descent in the human genome. Genetics 186 : 295–308.
17. GusevA, PalamaraPF, AponteG, ZhuangZ, DarvasiA, et al. (2012) The architecture of long-range haplotypes shared within and across populations. Mol Biol Evol 29 : 473–486.
18. PriceAL, WealeME, PattersonN, MyersSR, NeedAC, et al. (2008) Long-range LD can confound genome scans in admixed populations. Am J Hum Genet 83 : 132–135 author reply 135–139.
19. TianC, PlengeRM, RansomM, LeeA, VillosladaP, et al. (2008) Analysis and application of European genetic substructure using 300 K SNP information. PLoS Genet 4: e4.
20. RoachJC, GlusmanG, SmitAF, HuffCD, HubleyR, et al. (2010) Analysis of genetic inheritance in a family quartet by whole-genome sequencing. Science 328 : 636–639.
21. RoachJC, GlusmanG, HubleyR, MontsaroffSZ, HollowayAK, et al. (2011) Chromosomal haplotypes by genetic phasing of human families. Am J Hum Genet 89 : 382–397.
22. ConsortiumTIH (2007) A second generation human haplotype map of over 3.1 million SNPs. Nature 449 : 851–861.
23. McVeanGA, MyersSR, HuntS, DeloukasP, BentleyDR, et al. (2004) The fine-scale structure of recombination rate variation in the human genome. Science 304 : 581–584.
24. BrowningSR, BrowningBL (2007) Rapid and accurate haplotype phasing and missing-data inference for whole-genome association studies by use of localized haplotype clustering. Am J Hum Genet 81 : 1084–1097.
25. ThomasA, SkolnickMH, LewisCM (1994) Genomic mismatch scanning in pedigrees. IMA J Math Appl Med Biol 11 : 1–16.
26. HillWG, WhiteIM (2013) Identification of pedigree relationship from genome sharing. G3 (Bethesda) 3 : 1553–1571.
27. PurcellS, NealeB, Todd-BrownK, ThomasL, FerreiraMA, et al. (2007) PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet 81 : 559–575.
28. BrowningBL, BrowningSR (2013) Improving the Accuracy and Efficiency of Identity-By-Descent Detection in Population Data. Genetics 194(2): 459–471.
Štítky
Genetika Reprodukčná medicína
Článek Unwrapping BacteriaČlánek A Chaperone-Assisted Degradation Pathway Targets Kinetochore Proteins to Ensure Genome StabilityČlánek The Candidate Splicing Factor Sfswap Regulates Growth and Patterning of Inner Ear Sensory OrgansČlánek The SPF27 Homologue Num1 Connects Splicing and Kinesin 1-Dependent Cytoplasmic Trafficking inČlánek Down-Regulation of eIF4GII by miR-520c-3p Represses Diffuse Large B Cell Lymphoma DevelopmentČlánek Meta-Analysis Identifies Gene-by-Environment Interactions as Demonstrated in a Study of 4,965 MiceČlánek High Risk Population Isolate Reveals Low Frequency Variants Predisposing to Intracranial Aneurysms
Článok vyšiel v časopisePLOS Genetics
Najčítanejšie tento týždeň
2014 Číslo 1- Gynekologové a odborníci na reprodukční medicínu se sejdou na prvním virtuálním summitu
- Je „freeze-all“ pro všechny? Odborníci na fertilitu diskutovali na virtuálním summitu
-
Všetky články tohto čísla
- How Much Is That in Dog Years? The Advent of Canine Population Genomics
- The Sense and Sensibility of Strand Exchange in Recombination Homeostasis
- Unwrapping Bacteria
- DNA Methylation Changes Separate Allergic Patients from Healthy Controls and May Reflect Altered CD4 T-Cell Population Structure
- Evidence for Mito-Nuclear and Sex-Linked Reproductive Barriers between the Hybrid Italian Sparrow and Its Parent Species
- Translation Enhancing ACA Motifs and Their Silencing by a Bacterial Small Regulatory RNA
- Relationship Estimation from Whole-Genome Sequence Data
- Genetic Models of Apoptosis-Induced Proliferation Decipher Activation of JNK and Identify a Requirement of EGFR Signaling for Tissue Regenerative Responses in
- ComEA Is Essential for the Transfer of External DNA into the Periplasm in Naturally Transformable Cells
- Loss and Recovery of Genetic Diversity in Adapting Populations of HIV
- Bioelectric Signaling Regulates Size in Zebrafish Fins
- Defining NELF-E RNA Binding in HIV-1 and Promoter-Proximal Pause Regions
- Loss of Histone H3 Methylation at Lysine 4 Triggers Apoptosis in
- Cell-Cycle Dependent Expression of a Translocation-Mediated Fusion Oncogene Mediates Checkpoint Adaptation in Rhabdomyosarcoma
- How a Retrotransposon Exploits the Plant's Heat Stress Response for Its Activation
- A Nonsense Mutation in Encoding a Nondescript Transmembrane Protein Causes Idiopathic Male Subfertility in Cattle
- Deletion of a Conserved -Element in the Locus Highlights the Role of Acute Histone Acetylation in Modulating Inducible Gene Transcription
- Developmental Link between Sex and Nutrition; Regulates Sex-Specific Mandible Growth via Juvenile Hormone Signaling in Stag Beetles
- PP2A/B55 and Fcp1 Regulate Greatwall and Ensa Dephosphorylation during Mitotic Exit
- Differential Effects of Collagen Prolyl 3-Hydroxylation on Skeletal Tissues
- Comprehensive Functional Annotation of 77 Prostate Cancer Risk Loci
- Evolution of Chloroplast Transcript Processing in and Its Chromerid Algal Relatives
- A Chaperone-Assisted Degradation Pathway Targets Kinetochore Proteins to Ensure Genome Stability
- New MicroRNAs in —Birth, Death and Cycles of Adaptive Evolution
- A Genome-Wide Screen for Bacterial Envelope Biogenesis Mutants Identifies a Novel Factor Involved in Cell Wall Precursor Metabolism
- FGFR1-Frs2/3 Signalling Maintains Sensory Progenitors during Inner Ear Hair Cell Formation
- Regulation of Synaptic /Neuroligin Abundance by the /Nrf Stress Response Pathway Protects against Oxidative Stress
- Intrasubtype Reassortments Cause Adaptive Amino Acid Replacements in H3N2 Influenza Genes
- Molecular Specificity, Convergence and Constraint Shape Adaptive Evolution in Nutrient-Poor Environments
- WNT7B Promotes Bone Formation in part through mTORC1
- Natural Selection Reduced Diversity on Human Y Chromosomes
- In-Vivo Quantitative Proteomics Reveals a Key Contribution of Post-Transcriptional Mechanisms to the Circadian Regulation of Liver Metabolism
- The Candidate Splicing Factor Sfswap Regulates Growth and Patterning of Inner Ear Sensory Organs
- The Acid Phosphatase-Encoding Gene Contributes to Soybean Tolerance to Low-Phosphorus Stress
- p53 and TAp63 Promote Keratinocyte Proliferation and Differentiation in Breeding Tubercles of the Zebrafish
- Affects Plant Architecture by Regulating Local Auxin Biosynthesis
- The SET Domain Proteins SUVH2 and SUVH9 Are Required for Pol V Occupancy at RNA-Directed DNA Methylation Loci
- Down-Regulation of Rad51 Activity during Meiosis in Yeast Prevents Competition with Dmc1 for Repair of Double-Strand Breaks
- Multi-tissue Analysis of Co-expression Networks by Higher-Order Generalized Singular Value Decomposition Identifies Functionally Coherent Transcriptional Modules
- A Neurotoxic Glycerophosphocholine Impacts PtdIns-4, 5-Bisphosphate and TORC2 Signaling by Altering Ceramide Biosynthesis in Yeast
- Subtle Changes in Motif Positioning Cause Tissue-Specific Effects on Robustness of an Enhancer's Activity
- C/EBPα Is Required for Long-Term Self-Renewal and Lineage Priming of Hematopoietic Stem Cells and for the Maintenance of Epigenetic Configurations in Multipotent Progenitors
- The SPF27 Homologue Num1 Connects Splicing and Kinesin 1-Dependent Cytoplasmic Trafficking in
- Down-Regulation of eIF4GII by miR-520c-3p Represses Diffuse Large B Cell Lymphoma Development
- Genome Sequencing Highlights the Dynamic Early History of Dogs
- Re-sequencing Expands Our Understanding of the Phenotypic Impact of Variants at GWAS Loci
- Meta-Analysis Identifies Gene-by-Environment Interactions as Demonstrated in a Study of 4,965 Mice
- , a -Antisense Gene of , Encodes a Evolved Protein That Inhibits GSK3β Resulting in the Stabilization of MYCN in Human Neuroblastomas
- A Transcription Factor Is Wound-Induced at the Planarian Midline and Required for Anterior Pole Regeneration
- A Comprehensive tRNA Deletion Library Unravels the Genetic Architecture of the tRNA Pool
- A PNPase Dependent CRISPR System in
- Genomic Confirmation of Hybridisation and Recent Inbreeding in a Vector-Isolated Population
- Zinc Finger Transcription Factors Displaced SREBP Proteins as the Major Sterol Regulators during Saccharomycotina Evolution
- GATA6 Is a Crucial Regulator of Shh in the Limb Bud
- Tissue Specific Roles for the Ribosome Biogenesis Factor Wdr43 in Zebrafish Development
- A Cell Cycle and Nutritional Checkpoint Controlling Bacterial Surface Adhesion
- High Risk Population Isolate Reveals Low Frequency Variants Predisposing to Intracranial Aneurysms
- E3 Ubiquitin Ligase CHIP and NBR1-Mediated Selective Autophagy Protect Additively against Proteotoxicity in Plant Stress Responses
- Evolutionary Rate Covariation Identifies New Members of a Protein Network Required for Female Post-Mating Responses
- 3′ Untranslated Regions Mediate Transcriptional Interference between Convergent Genes Both Locally and Ectopically in
- Single Nucleus Genome Sequencing Reveals High Similarity among Nuclei of an Endomycorrhizal Fungus
- Metabolic QTL Analysis Links Chloroquine Resistance in to Impaired Hemoglobin Catabolism
- Notch Controls Cell Adhesion in the Drosophila Eye
- AL PHD-PRC1 Complexes Promote Seed Germination through H3K4me3-to-H3K27me3 Chromatin State Switch in Repression of Seed Developmental Genes
- Genomes Reveal Evolution of Microalgal Oleaginous Traits
- Large Inverted Duplications in the Human Genome Form via a Fold-Back Mechanism
- Variation in Genome-Wide Levels of Meiotic Recombination Is Established at the Onset of Prophase in Mammalian Males
- Age, Gender, and Cancer but Not Neurodegenerative and Cardiovascular Diseases Strongly Modulate Systemic Effect of the Apolipoprotein E4 Allele on Lifespan
- Lifespan Extension Conferred by Endoplasmic Reticulum Secretory Pathway Deficiency Requires Induction of the Unfolded Protein Response
- Is Non-Homologous End-Joining Really an Inherently Error-Prone Process?
- Vestigialization of an Allosteric Switch: Genetic and Structural Mechanisms for the Evolution of Constitutive Activity in a Steroid Hormone Receptor
- Functional Divergence and Evolutionary Turnover in Mammalian Phosphoproteomes
- A 660-Kb Deletion with Antagonistic Effects on Fertility and Milk Production Segregates at High Frequency in Nordic Red Cattle: Additional Evidence for the Common Occurrence of Balancing Selection in Livestock
- Comparative Evolutionary and Developmental Dynamics of the Cotton () Fiber Transcriptome
- The Transcription Factor BcLTF1 Regulates Virulence and Light Responses in the Necrotrophic Plant Pathogen
- Crossover Patterning by the Beam-Film Model: Analysis and Implications
- Single Cell Genomics: Advances and Future Perspectives
- PLOS Genetics
- Archív čísel
- Aktuálne číslo
- Informácie o časopise
Najčítanejšie v tomto čísle- GATA6 Is a Crucial Regulator of Shh in the Limb Bud
- Large Inverted Duplications in the Human Genome Form via a Fold-Back Mechanism
- Differential Effects of Collagen Prolyl 3-Hydroxylation on Skeletal Tissues
- Affects Plant Architecture by Regulating Local Auxin Biosynthesis
Prihlásenie#ADS_BOTTOM_SCRIPTS#Zabudnuté hesloZadajte e-mailovú adresu, s ktorou ste vytvárali účet. Budú Vám na ňu zasielané informácie k nastaveniu nového hesla.
- Časopisy