-
Články
- Časopisy
- Kurzy
- Témy
- Kongresy
- Videa
- Podcasty
A Quantitative Comparison of the Similarity between Genes and Geography in Worldwide Human Populations
Multivariate statistical techniques such as principal components analysis (PCA) and multidimensional scaling (MDS) have been widely used to summarize the structure of human genetic variation, often in easily visualized two-dimensional maps. Many recent studies have reported similarity between geographic maps of population locations and MDS or PCA maps of genetic variation inferred from single-nucleotide polymorphisms (SNPs). However, this similarity has been evident primarily in a qualitative sense; and, because different multivariate techniques and marker sets have been used in different studies, it has not been possible to formally compare genetic variation datasets in terms of their levels of similarity with geography. In this study, using genome-wide SNP data from 128 populations worldwide, we perform a systematic analysis to quantitatively evaluate the similarity of genes and geography in different geographic regions. For each of a series of regions, we apply a Procrustes analysis approach to find an optimal transformation that maximizes the similarity between PCA maps of genetic variation and geographic maps of population locations. We consider examples in Europe, Sub-Saharan Africa, Asia, East Asia, and Central/South Asia, as well as in a worldwide sample, finding that significant similarity between genes and geography exists in general at different geographic levels. The similarity is highest in our examples for Asia and, once highly distinctive populations have been removed, Sub-Saharan Africa. Our results provide a quantitative assessment of the geographic structure of human genetic variation worldwide, supporting the view that geography plays a strong role in giving rise to human population structure.
Vyšlo v časopise: A Quantitative Comparison of the Similarity between Genes and Geography in Worldwide Human Populations. PLoS Genet 8(8): e32767. doi:10.1371/journal.pgen.1002886
Kategorie: Research Article
prolekare.web.journal.doi_sk: https://doi.org/10.1371/journal.pgen.1002886Souhrn
Multivariate statistical techniques such as principal components analysis (PCA) and multidimensional scaling (MDS) have been widely used to summarize the structure of human genetic variation, often in easily visualized two-dimensional maps. Many recent studies have reported similarity between geographic maps of population locations and MDS or PCA maps of genetic variation inferred from single-nucleotide polymorphisms (SNPs). However, this similarity has been evident primarily in a qualitative sense; and, because different multivariate techniques and marker sets have been used in different studies, it has not been possible to formally compare genetic variation datasets in terms of their levels of similarity with geography. In this study, using genome-wide SNP data from 128 populations worldwide, we perform a systematic analysis to quantitatively evaluate the similarity of genes and geography in different geographic regions. For each of a series of regions, we apply a Procrustes analysis approach to find an optimal transformation that maximizes the similarity between PCA maps of genetic variation and geographic maps of population locations. We consider examples in Europe, Sub-Saharan Africa, Asia, East Asia, and Central/South Asia, as well as in a worldwide sample, finding that significant similarity between genes and geography exists in general at different geographic levels. The similarity is highest in our examples for Asia and, once highly distinctive populations have been removed, Sub-Saharan Africa. Our results provide a quantitative assessment of the geographic structure of human genetic variation worldwide, supporting the view that geography plays a strong role in giving rise to human population structure.
Zdroje
1. SokalRR, OdenNL, WilsonC (1991) Genetic evidence for the spread of agriculture in Europe by demic diffusion. Nature 351 : 143–145.
2. Cavalli-Sforza LL, Menozzi P, Piazza A (1994) The History and Geography of Human Genes. Princeton: Princeton University Press.
3. BarbujaniG (2000) Geographic patterns: how to identify them and why. Hum Biol 72 : 133–153.
4. Cavalli-SforzaLL, FeldmanMW (2003) The application of molecular genetic approaches to the study of human evolution. Nat Genet 33(Suppl):266–275.
5. NovembreJ, RamachandranS (2011) Perspectives on human population structure at the cusp of the sequencing era. Annu Rev Genomics Hum Genet 12 : 245–274.
6. RamachandranS, DeshpandeO, RosemanCC, RosenbergNA, FeldmanMW, et al. (2005) Support from the relationship of genetic and geographic distance in human populations for a serial founder effect originating in Africa. Proc Natl Acad Sci USA 102 : 15942–15947.
7. LiJZ, AbsherDM, TangH, SouthwickAM, CastoAM, et al. (2008) Worldwide human relationships inferred from genome-wide patterns of variation. Science 319 : 1100–1104.
8. JakobssonM, ScholzSW, ScheetP, GibbsJR, VanLiereJM, et al. (2008) Genotype, haplotype and copy-number variation in worldwide human populations. Nature 451 : 998–1003.
9. NovembreJ, JohnsonT, BrycK, KutalikZ, BoykoAR, et al. (2008) Genes mirror geography within Europe. Nature 456 : 98–101.
10. BiswasS, ScheinfeldtLB, AkeyJM (2009) Genome-wide insights into the patterns and determinants of fine-scale population structure in humans. Am J Hum Genet 84 : 641–650.
11. MenozziP, PiazzaA, Cavalli-SforzaL (1978) Synthetic maps of human gene frequencies in Europeans. Science 201 : 786–792.
12. PattersonN, PriceAL, ReichD (2006) Population structure and eigenanalysis. PLoS Genet 2: e190 doi:10.1371/journal.pgen.0020190.
13. Cox TF, Cox MAA (2001) Multidimensional Scaling. Boca Raton: Chapman & Hall, 2nd edition.
14. PaschouP, ZivE, BurchardEG, ChoudhryS, Rodriguez-CintronW, et al. (2007) PCA-correlated SNPs for structure identification in worldwide human populations. PLoS Genet 3: e160 doi:10.1371/journal.pgen.0030160.
15. WangC, SzpiechZA, DegnanJH, JakobssonM, PembertonTJ, et al. (2010) Comparing spatial maps of human population-genetic variation using Procrustes analysis. Stat Appl Genet Mol Biol 9: Article 13.
16. LaoO, LuTT, NothnagelM, JungeO, Freitag-WolfS, et al. (2008) Correlation between genetic and geographic structure in Europe. Curr Biol 18 : 1241–1248.
17. HeathSC, GutIG, BrennanP, McKayJD, BenckoV, et al. (2008) Investigation of the fine structure of European populations with applications to disease association studies. Eur J Hum Genet 16 : 1413–1429.
18. JakkulaE, RehnströmK, VariloT, PietiläinenOPH, PaunioT, et al. (2008) The genome-wide patterns of variation expose significant substructure in a founder population. Am J Hum Genet 83 : 787–794.
19. HoggartCJ, O'ReillyPF, KaakinenM, ZhangW, ChambersJC, et al. (2012) Fine-scale estimation of location of birth from genome-wide single-nucleotide polymorphism data. Genetics 190 : 669–677.
20. PriceAL, HelgasonA, PalssonS, StefanssonH, St ClairD, et al. (2009) The impact of divergence time on the nature of population structure: an example from Iceland. PLoS Genet 5: e1000505 doi:10.1371/journal.pgen.1000505.
21. SalmelaE, LappalainenT, LiuJ, SistonenP, AndersenPM, et al. (2011) Swedish population substructure revealed by genome-wide single nucleotide polymorphism data. PLoS ONE 6: e16747 doi:10.1371/journal.pone.0016747.
22. XingJ, WatkinsWS, WitherspoonDJ, ZhangY, GutherySL, et al. (2009) Fine-scaled human genetic structure revealed by SNP microarrays. Genome Res 19 : 815–825.
23. XingJ, WatkinsWS, ShlienA, WalkerE, HuffCD, et al. (2010) Toward a more uniform sampling of human genetic diversity: a survey of worldwide populations by high-density genotyping. Genomics 96 : 199–210.
24. The HUGO Pan-Asian SNP Consortium (2009) Mapping human genetic diversity in Asia. Science 326 : 1541–1545.
25. TianC, KosoyR, LeeA, RansomM, BelmontJW, et al. (2008) Analysis of East Asia genetic substructure using genome-wide SNP arrays. PLoS ONE 3: e3862 doi:10.1371/journal.pone.0003862.
26. BrycK, AutonA, NelsonMR, OksenbergJR, HauserSL, et al. (2010) Genome-wide patterns of population structure and admixture in West Africans and African Americans. Proc Natl Acad Sci USA 107 : 786–791.
27. SikoraM, LaayouniH, CalafellF, ComasD, BertranpetitJ (2011) A genomic analysis identifies a novel component in the genetic structure of sub-Saharan African populations. Eur J Hum Genet 19 : 84–88.
28. ChenJ, ZhengH, BeiJX, SunL, JiaWH, et al. (2009) Genetic structure of the Han Chinese population revealed by genome-wide SNP variation. Am J Hum Genet 85 : 775–785.
29. XuS, YinX, LiS, JinW, LouH, et al. (2009) Genomic dissection of population substructure of Han Chinese and its implication in association studies. Am J Hum Genet 85 : 762–774.
30. Yamaguchi-KabataY, NakazonoK, TakahashiA, SaitoS, HosonoN, et al. (2008) Japanese population structure, based on SNP genotypes from 7003 individuals compared to other ethnic groups: effects on population-based association studies. Am J Hum Genet 83 : 445–456.
31. PembertonTJ, AbsherD, FeldmanMW, MyersRM, RosenbergNA, et al. Genomic patterns of homozygosity in worldwide human populations. Am J Hum Genet (in press).
32. SimonsonT, YangY, HuffCD, YunH, QinG, et al. (2010) Genetic evidence for high-altitude adaptation in Tibet. Science 329 : 72–75.
33. The International HapMap 3 Consortium (2010) Integrating common and rare genetic variation in diverse human populations. Nature 467 : 52–58.
34. AutonA, BrycK, BoykoAR, LohmuellerKE, NovembreJ, et al. (2009) Global distribution of genomic diversity underscores rich complex history of continental human populations. Genome Res 19 : 795–803.
35. BowcockAM, Ruiz-LinaresA, TomfohrdeJ, MinchE, KiddJR, et al. (1994) High resolution of human evolutionary trees with polymorphic microsatellites. Nature 368 : 455–457.
36. RosenbergNA, PritchardJK, WeberJL, CannHM, KiddKK, et al. (2002) Genetic structure of human populations. Science 298 : 2381–2385.
37. TishkoffSA, ReedFA, FriedlaenderFR, EhretC, RanciaroA, et al. (2009) The genetic structure and history of Africans and African Americans. Science 324 : 1035–1044.
38. HennBM, GignouxCR, JobinM, GrankaJM, MacphersonJM, et al. (2011) Hunter-gatherer genomic diversity suggests a southern African origin for modern humans. Proc Natl Acad Sci USA 108 : 5154–5162.
39. Bregel Y (2003) An Historical Atlas of Central Asia. Boston: Brill.
40. Du R, Yip VF (1993) Ethnic Groups in China. Beijing: Science Press.
41. PowellGT, YangH, Tyler-SmithC, XueY (2007) The population history of the Xibe in northern China: a comparison of autosomal, mtDNA and Y-chromosomal analyses of migration and gene ow. Forensic Sci Int Genet 1 : 115–119.
42. Weir BS (1996) Genetic Data Analysis II. Sunderland, MA: Sinauer.
43. McVeanG (2009) A genealogical interpretation of principal components analysis. PLoS Genet 5: e1000686 doi:10.1371/journal.pgen.1000686.
44. NovembreJ, StephensM (2008) Interpreting principal component analyses of spatial population genetic variation. Nature Genet 40 : 646–649.
45. RosenbergNA (2011) A population-genetic perspective on the similarities and differences among worldwide human populations. Hum Biol 83 : 659–684.
46. EngelhardtBE, StephensM (2010) Analysis of population structure: a unifying framework and novel methods based on sparse factor analysis. PLoS Genet 6: e1001117 doi:10.1371/journal.pgen.1001117.
47. RosenbergNA, MahajanS, RamachandranS, ZhaoC, PritchardJK, et al. (2005) Clines, clusters, and the effect of study design on the inference of human population structure. PLoS Genet 1 doi:10.1371/journal.pgen.0010070.
48. YangWY, NovembreJ, EskinE, HalperinE (2012) A model-based approach for analysis of spatial structure in genetic data. Nat Genet 44 : 725–731.
49. PembertonTJ, WangC, LiJZ, RosenbergNA (2010) Inference of unexpected genetic relatedness among individuals in HapMap Phase III. Am J Hum Genet 87 : 457–464.
50. NelsonMR, BrycK, KingKS, IndapA, BoykoAR, et al. (2008) The Population Reference Sample, POPRES: a resource for population, disease, and pharmacological genetics research. Am J Hum Genet 83 : 347–358.
51. MailmanMD, FeoloM, JinY, KimuraM, TrykaK, et al. (2007) The NCBI dbGaP database of genotypes and phenotypes. Nat Genet 39 : 1181–1186.
52. PriceAL, PattersonNJ, PlengeRM, WeinblattME, ShadickNA, et al. (2006) Principal components analysis corrects for stratification in genome-wide association studies. Nature Genet 38 : 904–909.
53. Hastie T, Tibshirani R, Friedman J (2009) The Elements of Statistical Learning: Data Mining, Inference, and Prediction. New York: Springer, 2nd edition.
54. WeirBS, CockerhamCC (1984) Estimating F-statistics for the analysis of population structure. Evolution 38 : 1358–1370.
Štítky
Genetika Reprodukčná medicína
Článek Mutational Signatures of De-Differentiation in Functional Non-Coding Regions of Melanoma GenomesČlánek Rescuing Alu: Recovery of Inserts Shows LINE-1 Preserves Alu Activity through A-Tail ExpansionČlánek Genetics and Regulatory Impact of Alternative Polyadenylation in Human B-Lymphoblastoid CellsČlánek Retrovolution: HIV–Driven Evolution of Cellular Genes and Improvement of Anticancer Drug ActivationČlánek The Mi-2 Chromatin-Remodeling Factor Regulates Higher-Order Chromatin Structure and Cohesin DynamicsČlánek Identification of Human Proteins That Modify Misfolding and Proteotoxicity of Pathogenic Ataxin-1
Článok vyšiel v časopisePLOS Genetics
Najčítanejšie tento týždeň
2012 Číslo 8- Gynekologové a odborníci na reprodukční medicínu se sejdou na prvním virtuálním summitu
- Je „freeze-all“ pro všechny? Odborníci na fertilitu diskutovali na virtuálním summitu
-
Všetky články tohto čísla
- Mutational Signatures of De-Differentiation in Functional Non-Coding Regions of Melanoma Genomes
- Rescuing Alu: Recovery of Inserts Shows LINE-1 Preserves Alu Activity through A-Tail Expansion
- Genetics and Regulatory Impact of Alternative Polyadenylation in Human B-Lymphoblastoid Cells
- Chromosome Territories Meet a Condensin
- It's All in the Timing: Too Much E2F Is a Bad Thing
- Fine-Mapping and Initial Characterization of QT Interval Loci in African Americans
- Genome Patterns of Selection and Introgression of Haplotypes in Natural Populations of the House Mouse ()
- A Combinatorial Amino Acid Code for RNA Recognition by Pentatricopeptide Repeat Proteins
- Advances in Quantitative Trait Analysis in Yeast
- Experimental Evolution of a Novel Sexually Antagonistic Allele
- Variation of Contributes to Dog Breed Skull Diversity
- , a Gene Involved in Axonal Pathfinding, Is Mutated in Patients with Kallmann Syndrome
- A Single Origin for Nymphalid Butterfly Eyespots Followed by Widespread Loss of Associated Gene Expression
- Cryptocephal, the ATF4, Is a Specific Coactivator for Ecdysone Receptor Isoform B2
- Retrovolution: HIV–Driven Evolution of Cellular Genes and Improvement of Anticancer Drug Activation
- The PARN Deadenylase Targets a Discrete Set of mRNAs for Decay and Regulates Cell Motility in Mouse Myoblasts
- A Sexual Ornament in Chickens Is Affected by Pleiotropic Alleles at and , Selected during Domestication
- Use of Allele-Specific FAIRE to Determine Functional Regulatory Polymorphism Using Large-Scale Genotyping Arrays
- Novel Loci for Metabolic Networks and Multi-Tissue Expression Studies Reveal Genes for Atherosclerosis
- The Genetic Basis of Pollinator Adaptation in a Sexually Deceptive Orchid
- Uncovering the Genome-Wide Transcriptional Responses of the Filamentous Fungus to Lignocellulose Using RNA Sequencing
- Inheritance Beyond Plain Heritability: Variance-Controlling Genes in
- The Metabochip, a Custom Genotyping Array for Genetic Studies of Metabolic, Cardiovascular, and Anthropometric Traits
- Reprogramming to Pluripotency Can Conceal Somatic Cell Chromosomal Instability
- Condensin II Promotes the Formation of Chromosome Territories by Inducing Axial Compaction of Polyploid Interphase Chromosomes
- PTEN Negatively Regulates MAPK Signaling during Vulval Development
- A Dynamic Response Regulator Protein Modulates G-Protein–Dependent Polarity in the Bacterium
- Population Genomics of the Facultatively Mutualistic Bacteria and
- Components of a Fanconi-Like Pathway Control Pso2-Independent DNA Interstrand Crosslink Repair in Yeast
- Polysome Profiling in Liver Identifies Dynamic Regulation of Endoplasmic Reticulum Translatome by Obesity and Fasting
- Stromal Liver Kinase B1 [STK11] Signaling Loss Induces Oviductal Adenomas and Endometrial Cancer by Activating Mammalian Target of Rapamycin Complex 1
- Reprogramming of H3K27me3 Is Critical for Acquisition of Pluripotency from Cultured Tissues
- Transgene Induced Co-Suppression during Vegetative Growth in
- Hox and Sex-Determination Genes Control Segment Elimination through EGFR and Activity
- A Quantitative Comparison of the Similarity between Genes and Geography in Worldwide Human Populations
- Minibrain/Dyrk1a Regulates Food Intake through the Sir2-FOXO-sNPF/NPY Pathway in and Mammals
- Comparative Analysis of Regulatory Elements between and by Genome-Wide Transcription Start Site Profiling
- Simple Methods for Generating and Detecting Locus-Specific Mutations Induced with TALENs in the Zebrafish Genome
- S Phase–Coupled E2f1 Destruction Ensures Homeostasis in Proliferating Tissues
- Cell-Nonautonomous Signaling of FOXO/DAF-16 to the Stem Cells of
- The Mi-2 Chromatin-Remodeling Factor Regulates Higher-Order Chromatin Structure and Cohesin Dynamics
- Comparative Analysis of the Genomes of Two Field Isolates of the Rice Blast Fungus
- Role of Mex67-Mtr2 in the Nuclear Export of 40S Pre-Ribosomes
- Genetic Modulation of Lipid Profiles following Lifestyle Modification or Metformin Treatment: The Diabetes Prevention Program
- HAL-2 Promotes Homologous Pairing during Meiosis by Antagonizing Inhibitory Effects of Synaptonemal Complex Precursors
- SLX-1 Is Required for Maintaining Genomic Integrity and Promoting Meiotic Noncrossovers in the Germline
- Phylogenetic and Transcriptomic Analysis of Chemosensory Receptors in a Pair of Divergent Ant Species Reveals Sex-Specific Signatures of Odor Coding
- Reduced Prostasin (CAP1/PRSS8) Activity Eliminates HAI-1 and HAI-2 Deficiency–Associated Developmental Defects by Preventing Matriptase Activation
- Dissecting the Gene Network of Dietary Restriction to Identify Evolutionarily Conserved Pathways and New Functional Genes
- Identification of Human Proteins That Modify Misfolding and Proteotoxicity of Pathogenic Ataxin-1
- and Link Transcription of Phospholipid Biosynthetic Genes to ER Stress and the UPR
- CDK9 and H2B Monoubiquitination: A Well-Choreographed Dance
- Rare Copy Number Variations in Adults with Tetralogy of Fallot Implicate Novel Risk Gene Pathways
- Ccdc94 Protects Cells from Ionizing Radiation by Inhibiting the Expression of
- NOL11, Implicated in the Pathogenesis of North American Indian Childhood Cirrhosis, Is Required for Pre-rRNA Transcription and Processing
- Human Developmental Enhancers Conserved between Deuterostomes and Protostomes
- A Luminal Glycoprotein Drives Dose-Dependent Diameter Expansion of the Hindgut Tube
- Melanophore Migration and Survival during Zebrafish Adult Pigment Stripe Development Require the Immunoglobulin Superfamily Adhesion Molecule Igsf11
- Dynamic Distribution of Linker Histone H1.5 in Cellular Differentiation
- Combining Comparative Proteomics and Molecular Genetics Uncovers Regulators of Synaptic and Axonal Stability and Degeneration
- Chemical Genetics Reveals a Specific Requirement for Cdk2 Activity in the DNA Damage Response and Identifies Nbs1 as a Cdk2 Substrate in Human Cells
- Experimental Relocation of the Mitochondrial Gene to the Nucleus Reveals Forces Underlying Mitochondrial Genome Evolution
- Rates of Gyrase Supercoiling and Transcription Elongation Control Supercoil Density in a Bacterial Chromosome
- Mutations in a P-Type ATPase Gene Cause Axonal Degeneration
- A General G1/S-Phase Cell-Cycle Control Module in the Flowering Plant
- Multiple Roles and Interactions of and in Development of the Respiratory System
- UNC-40/DCC, SAX-3/Robo, and VAB-1/Eph Polarize F-Actin during Embryonic Morphogenesis by Regulating the WAVE/SCAR Actin Nucleation Complex
- Epigenetic Remodeling of Meiotic Crossover Frequency in DNA Methyltransferase Mutants
- Modulating the Strength and Threshold of NOTCH Oncogenic Signals by
- Loss of Axonal Mitochondria Promotes Tau-Mediated Neurodegeneration and Alzheimer's Disease–Related Tau Phosphorylation Via PAR-1
- Acetyl-CoA-Carboxylase Sustains a Fatty Acid–Dependent Remote Signal to Waterproof the Respiratory System
- ATXN2-CAG42 Sequesters PABPC1 into Insolubility and Induces FBXW8 in Cerebellum of Old Ataxic Knock-In Mice
- Cohesin Rings Devoid of Scc3 and Pds5 Maintain Their Stable Association with the DNA
- The MicroRNA Inhibits Calcium Signaling by Targeting the TIR-1/Sarm1 Adaptor Protein to Control Stochastic L/R Neuronal Asymmetry in
- Rapid-Throughput Skeletal Phenotyping of 100 Knockout Mice Identifies 9 New Genes That Determine Bone Strength
- The Genes Define Unique Classes of Two-Partner Secretion and Contact Dependent Growth Inhibition Systems
- PLOS Genetics
- Archív čísel
- Aktuálne číslo
- Informácie o časopise
Najčítanejšie v tomto čísle- Dissecting the Gene Network of Dietary Restriction to Identify Evolutionarily Conserved Pathways and New Functional Genes
- It's All in the Timing: Too Much E2F Is a Bad Thing
- Variation of Contributes to Dog Breed Skull Diversity
- The PARN Deadenylase Targets a Discrete Set of mRNAs for Decay and Regulates Cell Motility in Mouse Myoblasts
Prihlásenie#ADS_BOTTOM_SCRIPTS#Zabudnuté hesloZadajte e-mailovú adresu, s ktorou ste vytvárali účet. Budú Vám na ňu zasielané informácie k nastaveniu nového hesla.
- Časopisy