Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 42
Filter
1.
G3 (Bethesda) ; 14(2)2024 Feb 07.
Article in English | MEDLINE | ID: mdl-38038370

ABSTRACT

Low-pass sequencing with genotype imputation has been adopted as a cost-effective method for genotyping. The most widely used method of short-read sequencing uses sequencing by synthesis (SBS). Here we perform a study of a novel sequencing technology-avidity sequencing. In this short note, we compare the performance of imputation from low-pass libraries sequenced on an Element AVITI system (which utilizes avidity sequencing) to those sequenced on an Illumina NovaSeq 6000 (which utilizes SBS) with an SP flow cell for the same set of biological samples across a range of genetic ancestries. We observed dramatically lower optical duplication rates in the data deriving from the AVITI system compared to the NovaSeq 6000, resulting in higher effective coverage given a fixed number of sequenced bases, and comparable imputation accuracy performance between sequencing chemistries across ancestries. This study demonstrates that avidity sequencing is a viable alternative to the standard SBS chemistries for applications involving low-pass sequencing plus imputation.


Subject(s)
Genome-Wide Association Study , Polymorphism, Single Nucleotide , Genotype , Genome-Wide Association Study/methods
2.
Front Genet ; 14: 1148301, 2023.
Article in English | MEDLINE | ID: mdl-37359370

ABSTRACT

The increasing incidence of bovine congestive heart failure (BCHF) in feedlot cattle poses a significant challenge to the beef industry from economic loss, reduced performance, and reduced animal welfare attributed to cardiac insufficiency. Changes to cardiac morphology as well as abnormal pulmonary arterial pressure (PAP) in cattle of mostly Angus ancestry have been recently characterized. However, congestive heart failure affecting cattle late in the feeding period has been an increasing problem and tools are needed for the industry to address the rate of mortality in the feedlot for multiple breeds. At harvest, a population of 32,763 commercial fed cattle were phenotyped for cardiac morphology with associated production data collected from feedlot processing to harvest at a single feedlot and packing plant in the Pacific Northwest. A sub-population of 5,001 individuals were selected for low-pass genotyping to estimate variance components and genetic correlations between heart score and the production traits observed during the feeding period. At harvest, the incidence of a heart score of 4 or 5 in this population was approximately 4.14%, indicating a significant proportion of feeder cattle are at risk of cardiac mortality before harvest. Heart scores were also significantly and positively correlated with the percentage Angus ancestry observed by genomic breed percentage analysis. The heritability of heart score measured as a binary (scores 1 and 2 = 0, scores 4 and 5 = 1) trait was 0.356 in this population, which indicates development of a selection tool to reduce the risk of congestive heart failure as an EPD (expected progeny difference) is feasible. Genetic correlations of heart score with growth traits and feed intake were moderate and positive (0.289-0.460). Genetic correlations between heart score and backfat and marbling score were -0.120 and -0.108, respectively. Significant genetic correlation to traits of high economic importance in existing selection indexes explain the increased rate of congestive heart failure observed over time. These results indicate potential to implement heart score observed at harvest as a phenotype under selection in genetic evaluation in order to reduce feedlot mortality due to cardiac insufficiency and improve overall cardiopulmonary health in feeder cattle.

3.
BMC Genomics ; 22(1): 197, 2021 Mar 20.
Article in English | MEDLINE | ID: mdl-33743587

ABSTRACT

BACKGROUND: Low pass sequencing has been proposed as a cost-effective alternative to genotyping arrays to identify genetic variants that influence multifactorial traits in humans. For common diseases this typically has required both large sample sizes and comprehensive variant discovery. Genotyping arrays are also routinely used to perform pharmacogenetic (PGx) experiments where sample sizes are likely to be significantly smaller, but clinically relevant effect sizes likely to be larger. RESULTS: To assess how low pass sequencing would compare to array based genotyping for PGx we compared a low-pass assay (in which 1x coverage or less of a target genome is sequenced) along with software for genotype imputation to standard approaches. We sequenced 79 individuals to 1x genome coverage and genotyped the same samples on the Affymetrix Axiom Biobank Precision Medicine Research Array (PMRA). We then down-sampled the sequencing data to 0.8x, 0.6x, and 0.4x coverage, and performed imputation. Both the genotype data and the sequencing data were further used to impute human leukocyte antigen (HLA) genotypes for all samples. We compared the sequencing data and the genotyping array data in terms of four metrics: overall concordance, concordance at single nucleotide polymorphisms in pharmacogenetics-related genes, concordance in imputed HLA genotypes, and imputation r2. Overall concordance between the two assays ranged from 98.2% (for 0.4x coverage sequencing) to 99.2% (for 1x coverage sequencing), with qualitatively similar numbers for the subsets of variants most important in pharmacogenetics. At common single nucleotide polymorphisms (SNPs), the mean imputation r2 from the genotyping array was 0.90, which was comparable to the imputation r2 from 0.4x coverage sequencing, while the mean imputation r2 from 1x sequencing data was 0.96. CONCLUSIONS: These results indicate that low-pass sequencing to a depth above 0.4x coverage attains higher power for association studies when compared to the PMRA and should be considered as a competitive alternative to genotyping arrays for trait mapping in pharmacogenetics.


Subject(s)
Genome-Wide Association Study , Pharmacogenetics , Genotype , Genotyping Techniques , High-Throughput Nucleotide Sequencing , Humans , Polymorphism, Single Nucleotide
4.
Am J Hum Genet ; 108(4): 656-668, 2021 04 01.
Article in English | MEDLINE | ID: mdl-33770507

ABSTRACT

Genetic studies in underrepresented populations identify disproportionate numbers of novel associations. However, most genetic studies use genotyping arrays and sequenced reference panels that best capture variation most common in European ancestry populations. To compare data generation strategies best suited for underrepresented populations, we sequenced the whole genomes of 91 individuals to high coverage as part of the Neuropsychiatric Genetics of African Population-Psychosis (NeuroGAP-Psychosis) study with participants from Ethiopia, Kenya, South Africa, and Uganda. We used a downsampling approach to evaluate the quality of two cost-effective data generation strategies, GWAS arrays versus low-coverage sequencing, by calculating the concordance of imputed variants from these technologies with those from deep whole-genome sequencing data. We show that low-coverage sequencing at a depth of ≥4× captures variants of all frequencies more accurately than all commonly used GWAS arrays investigated and at a comparable cost. Lower depths of sequencing (0.5-1×) performed comparably to commonly used low-density GWAS arrays. Low-coverage sequencing is also sensitive to novel variation; 4× sequencing detects 45% of singletons and 95% of common variants identified in high-coverage African whole genomes. Low-coverage sequencing approaches surmount the problems induced by the ascertainment of common genotyping arrays, effectively identify novel variation particularly in underrepresented populations, and present opportunities to enhance variant discovery at a cost similar to traditional approaches.


Subject(s)
DNA Mutational Analysis/economics , DNA Mutational Analysis/standards , Genetic Variation/genetics , Genetics, Population/economics , Africa , DNA Mutational Analysis/methods , Genetics, Population/methods , Genome, Human/genetics , Genome-Wide Association Study , Health Equity , Humans , Microbiota , Whole Genome Sequencing/economics , Whole Genome Sequencing/standards
5.
Genome Res ; 31(4): 529-537, 2021 04.
Article in English | MEDLINE | ID: mdl-33536225

ABSTRACT

Low-pass sequencing (sequencing a genome to an average depth less than 1× coverage) combined with genotype imputation has been proposed as an alternative to genotyping arrays for trait mapping and calculation of polygenic scores. To empirically assess the relative performance of these technologies for different applications, we performed low-pass sequencing (targeting coverage levels of 0.5× and 1×) and array genotyping (using the Illumina Global Screening Array [GSA]) on 120 DNA samples derived from African- and European-ancestry individuals that are part of the 1000 Genomes Project. We then imputed both the sequencing data and the genotyping array data to the 1000 Genomes Phase 3 haplotype reference panel using a leave-one-out design. We evaluated overall imputation accuracy from these different assays as well as overall power for GWAS from imputed data and computed polygenic risk scores for coronary artery disease and breast cancer using previously derived weights. We conclude that low-pass sequencing plus imputation, in addition to providing a substantial increase in statistical power for genome-wide association studies, provides increased accuracy for polygenic risk prediction at effective coverages of ∼0.5× and higher compared to the Illumina GSA.


Subject(s)
Genome-Wide Association Study , Genotype , High-Throughput Nucleotide Sequencing , Genome, Human , Genome-Wide Association Study/methods , Genome-Wide Association Study/standards , Haplotypes , Humans , Risk Factors
6.
Genes (Basel) ; 11(11)2020 11 05.
Article in English | MEDLINE | ID: mdl-33167493

ABSTRACT

Decreasing costs are making low coverage sequencing with imputation to a comprehensive reference panel an attractive alternative to obtain functional variant genotypes that can increase the accuracy of genomic prediction. To assess the potential of low-pass sequencing, genomic sequence of 77 steers sequenced to >10X coverage was downsampled to 1X and imputed to a reference of 946 cattle representing multiple Bos taurus and Bos indicus-influenced breeds. Genotypes for nearly 60 million variants detected in the reference were imputed from the downsampled sequence. The imputed genotypes strongly agreed with the SNP array genotypes (r¯=0.99) and the genotypes called from the transcript sequence (r¯=0.97). Effects of BovineSNP50 and GGP-F250 variants on birth weight, postweaning gain, and marbling were solved without the steers' phenotypes and genotypes, then applied to their genotypes, to predict the molecular breeding values (MBV). The steers' MBV were similar when using imputed and array genotypes. Replacing array variants with functional sequence variants might allow more robust MBV. Imputation from low coverage sequence offers a viable, low-cost approach to obtain functional variant genotypes that could improve genomic prediction.


Subject(s)
Animal Husbandry/methods , Cattle/genetics , Sequence Analysis, DNA/methods , Animals , Breeding/methods , Genomics/methods , Genotype , Male , Phenotype , Polymorphism, Single Nucleotide/genetics , Red Meat , United States
7.
Genetics ; 208(4): 1565-1584, 2018 04.
Article in English | MEDLINE | ID: mdl-29348143

ABSTRACT

An open question in human evolution is the importance of polygenic adaptation: adaptive changes in the mean of a multifactorial trait due to shifts in allele frequencies across many loci. In recent years, several methods have been developed to detect polygenic adaptation using loci identified in genome-wide association studies (GWAS). Though powerful, these methods suffer from limited interpretability: they can detect which sets of populations have evidence for polygenic adaptation, but are unable to reveal where in the history of multiple populations these processes occurred. To address this, we created a method to detect polygenic adaptation in an admixture graph, which is a representation of the historical divergences and admixture events relating different populations through time. We developed a Markov chain Monte Carlo (MCMC) algorithm to infer branch-specific parameters reflecting the strength of selection in each branch of a graph. Additionally, we developed a set of summary statistics that are fast to compute and can indicate which branches are most likely to have experienced polygenic adaptation. We show via simulations that this method-which we call PolyGraph-has good power to detect polygenic adaptation, and applied it to human population genomic data from around the world. We also provide evidence that variants associated with several traits, including height, educational attainment, and self-reported unibrow, have been influenced by polygenic adaptation in different populations during human evolution.


Subject(s)
Adaptation, Biological/genetics , Models, Genetic , Multifactorial Inheritance , Algorithms , Computer Simulation , Genetics, Population , Genome, Human , Genome-Wide Association Study , Genomics/methods , Humans , Markov Chains , Polymorphism, Single Nucleotide , Selection, Genetic
8.
PLoS Biol ; 15(9): e2002458, 2017 Sep.
Article in English | MEDLINE | ID: mdl-28873088

ABSTRACT

A number of open questions in human evolutionary genetics would become tractable if we were able to directly measure evolutionary fitness. As a step towards this goal, we developed a method to examine whether individual genetic variants, or sets of genetic variants, currently influence viability. The approach consists in testing whether the frequency of an allele varies across ages, accounting for variation in ancestry. We applied it to the Genetic Epidemiology Research on Adult Health and Aging (GERA) cohort and to the parents of participants in the UK Biobank. Across the genome, we found only a few common variants with large effects on age-specific mortality: tagging the APOE ε4 allele and near CHRNA3. These results suggest that when large, even late-onset effects are kept at low frequency by purifying selection. Testing viability effects of sets of genetic variants that jointly influence 1 of 42 traits, we detected a number of strong signals. In participants of the UK Biobank of British ancestry, we found that variants that delay puberty timing are associated with a longer parental life span (P~6.2 × 10-6 for fathers and P~2.0 × 10-3 for mothers), consistent with epidemiological studies. Similarly, variants associated with later age at first birth are associated with a longer maternal life span (P~1.4 × 10-3). Signals are also observed for variants influencing cholesterol levels, risk of coronary artery disease (CAD), body mass index, as well as risk of asthma. These signals exhibit consistent effects in the GERA cohort and among participants of the UK Biobank of non-British ancestry. We also found marked differences between males and females, most notably at the CHRNA3 locus, and variants associated with risk of CAD and cholesterol levels. Beyond our findings, the analysis serves as a proof of principle for how upcoming biomedical data sets can be used to learn about selection effects in contemporary humans.


Subject(s)
Evolution, Molecular , Genetic Fitness , Genetics, Population/methods , Models, Genetic , Selection, Genetic , Cohort Studies , Female , Gene Frequency , Genetic Variation , Humans , Male
9.
Nat Commun ; 8(1): 266, 2017 08 16.
Article in English | MEDLINE | ID: mdl-28814792

ABSTRACT

The immune system plays a major role in human health and disease, and understanding genetic causes of interindividual variability of immune responses is vital. Here, we isolate monocytes from 134 genotyped individuals, stimulate these cells with three defined microbe-associated molecular patterns (LPS, MDP, and 5'-ppp-dsRNA), and profile the transcriptomes at three time points. Mapping expression quantitative trait loci (eQTL), we identify 417 response eQTLs (reQTLs) with varying effects between conditions. We characterize the dynamics of genetic regulation on early and late immune response and observe an enrichment of reQTLs in distal cis-regulatory elements. In addition, reQTLs are enriched for recent positive selection with an evolutionary trend towards enhanced immune response. Finally, we uncover reQTL effects in multiple GWAS loci and show a stronger enrichment for response than constant eQTLs in GWAS signals of several autoimmune diseases. This demonstrates the importance of infectious stimuli in modifying genetic predisposition to disease.Insight into the genetic influence on the immune response is important for the understanding of interindividual variability in human pathologies. Here, the authors generate transcriptome data from human blood monocytes stimulated with various immune stimuli and provide a time-resolved response eQTL map.


Subject(s)
Acetylmuramyl-Alanyl-Isoglutamine/pharmacology , Adjuvants, Immunologic/pharmacology , Autoimmune Diseases/genetics , Gene Expression/drug effects , Lipopolysaccharides/pharmacology , Monocytes/drug effects , RNA, Double-Stranded/pharmacology , RNA, Messenger/drug effects , Adolescent , Adult , Gene Expression/genetics , Gene Expression/immunology , Gene Expression Profiling , Gene Expression Regulation , Genetic Predisposition to Disease , Healthy Volunteers , Humans , Indicators and Reagents , Lipids , Male , Monocytes/immunology , Monocytes/metabolism , Quantitative Trait Loci , RNA, Messenger/metabolism , Regulatory Sequences, Nucleic Acid , Young Adult
10.
Nat Genet ; 49(3): 325-331, 2017 Mar.
Article in English | MEDLINE | ID: mdl-28092683

ABSTRACT

Collecting cases for case-control genetic association studies can be time-consuming and expensive. In some situations (such as studies of late-onset or rapidly lethal diseases), it may be more practical to identify family members of cases. In randomly ascertained cohorts, replacing cases with their first-degree relatives enables studies of diseases that are absent (or nearly absent) in the cohort. We refer to this approach as genome-wide association study by proxy (GWAX) and apply it to 12 common diseases in 116,196 individuals from the UK Biobank. Meta-analysis with published genome-wide association study summary statistics replicated established risk loci and yielded four newly associated loci for Alzheimer's disease, eight for coronary artery disease and five for type 2 diabetes. In addition to informing disease biology, our results demonstrate the utility of association mapping without directly observing cases. We anticipate that GWAX will prove useful in future genetic studies of complex traits in large population cohorts.


Subject(s)
Alzheimer Disease/genetics , Coronary Artery Disease/genetics , Diabetes Mellitus, Type 2/genetics , Genetic Predisposition to Disease/genetics , Case-Control Studies , Genome, Human/genetics , Genome-Wide Association Study/methods , Humans , Risk
12.
Nature ; 533(7604): 539-42, 2016 05 26.
Article in English | MEDLINE | ID: mdl-27225129

ABSTRACT

Educational attainment is strongly influenced by social and other environmental factors, but genetic factors are estimated to account for at least 20% of the variation across individuals. Here we report the results of a genome-wide association study (GWAS) for educational attainment that extends our earlier discovery sample of 101,069 individuals to 293,723 individuals, and a replication study in an independent sample of 111,349 individuals from the UK Biobank. We identify 74 genome-wide significant loci associated with the number of years of schooling completed. Single-nucleotide polymorphisms associated with educational attainment are disproportionately found in genomic regions regulating gene expression in the fetal brain. Candidate genes are preferentially expressed in neural tissue, especially during the prenatal period, and enriched for biological pathways involved in neural development. Our findings demonstrate that, even for a behavioural phenotype that is mostly environmentally determined, a well-powered GWAS identifies replicable associated genetic variants that suggest biologically relevant pathways. Because educational attainment is measured in large numbers of individuals, it will continue to be useful as a proxy phenotype in efforts to characterize the genetic influences of related phenotypes, including cognition and neuropsychiatric diseases.


Subject(s)
Brain/metabolism , Educational Status , Fetus/metabolism , Gene Expression Regulation/genetics , Genome-Wide Association Study , Polymorphism, Single Nucleotide/genetics , Alzheimer Disease/genetics , Bipolar Disorder/genetics , Cognition , Computational Biology , Gene-Environment Interaction , Humans , Molecular Sequence Annotation , Schizophrenia/genetics , United Kingdom
13.
Nat Genet ; 48(7): 709-17, 2016 07.
Article in English | MEDLINE | ID: mdl-27182965

ABSTRACT

We performed a scan for genetic variants associated with multiple phenotypes by comparing large genome-wide association studies (GWAS) of 42 traits or diseases. We identified 341 loci (at a false discovery rate of 10%) associated with multiple traits. Several loci are associated with multiple phenotypes; for example, a nonsynonymous variant in the zinc transporter SLC39A8 influences seven of the traits, including risk of schizophrenia (rs13107325: log-transformed odds ratio (log OR) = 0.15, P = 2 × 10(-12)) and Parkinson disease (log OR = -0.15, P = 1.6 × 10(-7)), among others. Second, we used these loci to identify traits that have multiple genetic causes in common. For example, variants associated with increased risk of schizophrenia also tended to be associated with increased risk of inflammatory bowel disease. Finally, we developed a method to identify pairs of traits that show evidence of a causal relationship. For example, we show evidence that increased body mass index causally increases triglyceride levels.


Subject(s)
Genetic Pleiotropy/genetics , Genetic Predisposition to Disease , Inflammatory Bowel Diseases/genetics , Multifactorial Inheritance/genetics , Parkinson Disease/genetics , Polymorphism, Single Nucleotide/genetics , Schizophrenia/genetics , Body Mass Index , Genome-Wide Association Study , Humans , Phenotype , Triglycerides/metabolism
14.
Bioinformatics ; 32(2): 283-5, 2016 Jan 15.
Article in English | MEDLINE | ID: mdl-26395773

ABSTRACT

UNLABELLED: We present a method to identify approximately independent blocks of linkage disequilibrium in the human genome. These blocks enable automated analysis of multiple genome-wide association studies. AVAILABILITY AND IMPLEMENTATION: code: http://bitbucket.org/nygcresearch/ldetect; data: http://bitbucket.org/nygcresearch/ldetect-data. CONTACT: tberisa@nygenome.org SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
Chromosome Mapping/methods , Genome, Human , Genome-Wide Association Study , Software , Algorithms , Genetic Markers , Humans , Linkage Disequilibrium
15.
Trends Genet ; 30(9): 377-89, 2014 Sep.
Article in English | MEDLINE | ID: mdl-25168683

ABSTRACT

Genetic information contains a record of the history of our species, and technological advances have transformed our ability to access this record. Many studies have used genome-wide data from populations today to learn about the peopling of the globe and subsequent adaptation to local conditions. Implicit in this research is the assumption that the geographic locations of people today are informative about the geographic locations of their ancestors in the distant past. However, it is now clear that long-range migration, admixture, and population replacement subsequent to the initial out-of-Africa expansion have altered the genetic structure of most of the world's human populations. In light of this we argue that it is time to critically reevaluate current models of the peopling of the globe, as well as the importance of natural selection in determining the geographic distribution of phenotypes. We specifically highlight the transformative potential of ancient DNA. By accessing the genetic make-up of populations living at archaeologically known times and places, ancient DNA makes it possible to directly track migrations and responses to natural selection.


Subject(s)
DNA/genetics , DNA/history , Genetics, Population , Genome, Human , Geography , Selection, Genetic/genetics , Africa , Evolution, Molecular , History, Ancient , Humans , Phenotype
16.
Proc Biol Sci ; 281(1789): 20140930, 2014 Aug 22.
Article in English | MEDLINE | ID: mdl-24990677

ABSTRACT

While gene flow between distantly related populations is increasingly recognized as a potentially important source of adaptive genetic variation for humans, fully characterized examples are rare. In addition, the role that natural selection for resistance to vivax malaria may have played in the extreme distribution of the protective Duffy-null allele, which is nearly completely fixed in mainland sub-Saharan Africa and absent elsewhere, is controversial. We address both these issues by investigating the evolution of the Duffy-null allele in the Malagasy, a recently admixed population with major ancestry components from both East Asia and mainland sub-Saharan Africa. We used genome-wide genetic data and extensive computer simulations to show that the high frequency of the Duffy-null allele in Madagascar can only be explained in the absence of positive natural selection under extreme demographic scenarios involving high genetic drift. However, the observed genomic single nucleotide polymorphism diversity in the Malagasy is incompatible with such extreme demographic scenarios, indicating that positive selection for the Duffy-null allele best explains the high frequency of the allele in Madagascar. We estimate the selection coefficient to be 0.066. Because vivax malaria is endemic to Madagascar, this result supports the hypothesis that malaria resistance drove fixation of the Duffy-null allele in mainland sub-Saharan Africa.


Subject(s)
Duffy Blood-Group System/genetics , Gene Frequency , Receptors, Cell Surface/genetics , Selection, Genetic , Africa South of the Sahara , Asian People/genetics , Black People/genetics , Computer Simulation , Genetic Drift , Genetics, Population , Humans , Madagascar , Models, Genetic , Polymorphism, Single Nucleotide
17.
Am J Hum Genet ; 94(4): 559-73, 2014 Apr 03.
Article in English | MEDLINE | ID: mdl-24702953

ABSTRACT

Annotations of gene structures and regulatory elements can inform genome-wide association studies (GWASs). However, choosing the relevant annotations for interpreting an association study of a given trait remains challenging. I describe a statistical model that uses association statistics computed across the genome to identify classes of genomic elements that are enriched with or depleted of loci influencing a trait. The model naturally incorporates multiple types of annotations. I applied the model to GWASs of 18 human traits, including red blood cell traits, platelet traits, glucose levels, lipid levels, height, body mass index, and Crohn disease. For each trait, I used the model to evaluate the relevance of 450 different genomic annotations, including protein-coding genes, enhancers, and DNase-I hypersensitive sites in over 100 tissues and cell lines. The fraction of phenotype-associated SNPs influencing protein sequence ranged from around 2% (for platelet volume) up to around 20% (for low-density lipoprotein cholesterol), repressed chromatin was significantly depleted for SNPs associated with several traits, and cell-type-specific DNase-I hypersensitive sites were enriched with SNPs associated with several traits (for example, the spleen in platelet volume). Finally, reweighting each GWAS by using information from functional genomics increased the number of loci with high-confidence associations by around 5%.


Subject(s)
Genome-Wide Association Study , Models, Genetic , Bayes Theorem , Humans , Phenotype , Polymorphism, Single Nucleotide
18.
Proc Natl Acad Sci U S A ; 111(7): 2632-7, 2014 Feb 18.
Article in English | MEDLINE | ID: mdl-24550290

ABSTRACT

The history of southern Africa involved interactions between indigenous hunter-gatherers and a range of populations that moved into the region. Here we use genome-wide genetic data to show that there are at least two admixture events in the history of Khoisan populations (southern African hunter-gatherers and pastoralists who speak non-Bantu languages with click consonants). One involved populations related to Niger-Congo-speaking African populations, and the other introduced ancestry most closely related to west Eurasian (European or Middle Eastern) populations. We date this latter admixture event to ∼900-1,800 y ago and show that it had the largest demographic impact in Khoisan populations that speak Khoe-Kwadi languages. A similar signal of west Eurasian ancestry is present throughout eastern Africa. In particular, we also find evidence for two admixture events in the history of Kenyan, Tanzanian, and Ethiopian populations, the earlier of which involved populations related to west Eurasians and which we date to ∼2,700-3,300 y ago. We reconstruct the allele frequencies of the putative west Eurasian population in eastern Africa and show that this population is a good proxy for the west Eurasian ancestry in southern Africa. The most parsimonious explanation for these findings is that west Eurasian ancestry entered southern Africa indirectly through eastern Africa.


Subject(s)
Demography , Emigration and Immigration , Ethnicity/genetics , Genetics, Population/methods , White People/genetics , Africa, Eastern , Africa, Southern , Computer Simulation , Europe/ethnology , Gene Flow , Gene Frequency , Genotype , Humans , Linkage Disequilibrium , Models, Genetic
19.
Science ; 342(6155): 257-61, 2013 Oct 11.
Article in English | MEDLINE | ID: mdl-24115443

ABSTRACT

The processes that shaped modern European mitochondrial DNA (mtDNA) variation remain unclear. The initial peopling by Palaeolithic hunter-gatherers ~42,000 years ago and the immigration of Neolithic farmers into Europe ~8000 years ago appear to have played important roles but do not explain present-day mtDNA diversity. We generated mtDNA profiles of 364 individuals from prehistoric cultures in Central Europe to perform a chronological study, spanning the Early Neolithic to the Early Bronze Age (5500 to 1550 calibrated years before the common era). We used this transect through time to identify four marked shifts in genetic composition during the Neolithic period, revealing a key role for Late Neolithic cultures in shaping modern Central European genetic diversity.


Subject(s)
DNA, Mitochondrial/genetics , Genetic Drift , Genetic Variation , Population/genetics , Agriculture/history , Base Sequence , DNA, Mitochondrial/history , Europe , History, Ancient , Humans , Molecular Sequence Data , Transients and Migrants
20.
Genetics ; 193(4): 1233-54, 2013 Apr.
Article in English | MEDLINE | ID: mdl-23410830

ABSTRACT

Long-range migrations and the resulting admixtures between populations have been important forces shaping human genetic diversity. Most existing methods for detecting and reconstructing historical admixture events are based on allele frequency divergences or patterns of ancestry segments in chromosomes of admixed individuals. An emerging new approach harnesses the exponential decay of admixture-induced linkage disequilibrium (LD) as a function of genetic distance. Here, we comprehensively develop LD-based inference into a versatile tool for investigating admixture. We present a new weighted LD statistic that can be used to infer mixture proportions as well as dates with fewer constraints on reference populations than previous methods. We define an LD-based three-population test for admixture and identify scenarios in which it can detect admixture events that previous formal tests cannot. We further show that we can uncover phylogenetic relationships among populations by comparing weighted LD curves obtained using a suite of references. Finally, we describe several improvements to the computation and fitting of weighted LD curves that greatly increase the robustness and speed of the calculations. We implement all of these advances in a software package, ALDER, which we validate in simulations and apply to test for admixture among all populations from the Human Genome Diversity Project (HGDP), highlighting insights into the admixture history of Central African Pygmies, Sardinians, and Japanese.


Subject(s)
Linkage Disequilibrium , Population/genetics , Software , Africa, Central , Human Migration , Humans , Italy , Japan , Models, Genetic , Phylogeny , Population Groups/genetics
SELECTION OF CITATIONS
SEARCH DETAIL
...