Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 27
Filter
1.
Proc Biol Sci ; 286(1896): 20182541, 2019 02 13.
Article in English | MEDLINE | ID: mdl-30963935

ABSTRACT

A fundamental question about adaptation in a population is the time of onset of the selective pressure acting on beneficial alleles. Inferring this time, in turn, depends on the selection model. We develop a framework of approximate Bayesian computation (ABC) that enables the use of the full site frequency spectrum and haplotype structure to test the goodness-of-fit of selection models and estimate the timing of selection under varying population size scenarios. We show that our method has sufficient power to distinguish natural selection from neutrality even if relatively old selection increased the frequency of a pre-existing allele from 20% to 50% or from 40% to 80%. Our ABC can accurately estimate the time of onset of selection on a new mutation. However, estimates are prone to bias under the standing variation model, possibly due to the uncertainty in the allele frequency at the onset of selection. We further extend our approach to take advantage of ancient DNA data that provides information on the allele frequency path of the beneficial allele. Applying our ABC, including both modern and ancient human DNA data, to four pigmentation alleles in Europeans, we detected selection on standing variants that occurred after the dispersal from Africa even though models of selection on a new mutation were initially supported for two of these alleles without the ancient data.


Subject(s)
DNA, Ancient/analysis , Gene Frequency , Haplotypes/genetics , Human Migration , Selection, Genetic , Skin Pigmentation/genetics , Bayes Theorem , Europe , Humans , Models, Genetic , Population Density , Time Factors
2.
PLoS Biol ; 16(3): e2002985, 2018 03.
Article in English | MEDLINE | ID: mdl-29547617

ABSTRACT

Human genome-wide association studies (GWASs) are revealing the genetic architecture of anthropomorphic and biomedical traits, i.e., the frequencies and effect sizes of variants that contribute to heritable variation in a trait. To interpret these findings, we need to understand how genetic architecture is shaped by basic population genetics processes-notably, by mutation, natural selection, and genetic drift. Because many quantitative traits are subject to stabilizing selection and because genetic variation that affects one trait often affects many others, we model the genetic architecture of a focal trait that arises under stabilizing selection in a multidimensional trait space. We solve the model for the phenotypic distribution and allelic dynamics at steady state and derive robust, closed-form solutions for summary statistics of the genetic architecture. Our results provide a simple interpretation for missing heritability and why it varies among traits. They predict that the distribution of variances contributed by loci identified in GWASs is well approximated by a simple functional form that depends on a single parameter: the expected contribution to genetic variance of a strongly selected site affecting the trait. We test this prediction against the results of GWASs for height and body mass index (BMI) and find that it fits the data well, allowing us to make inferences about the degree of pleiotropy and mutational target size for these traits. Our findings help to explain why the GWAS for height explains more of the heritable variance than the similarly sized GWAS for BMI and to predict the increase in explained heritability with study sample size. Considering the demographic history of European populations, in which these GWASs were performed, we further find that most of the associations they identified likely involve mutations that arose shortly before or during the Out-of-Africa bottleneck at sites with selection coefficients around s = 10-3.


Subject(s)
Body Height/genetics , Body Mass Index , Genome-Wide Association Study , Models, Genetic , Quantitative Trait Loci , Genetic Drift , Genetic Variation , Genetics, Population , Humans , Phenotype , Selection, Genetic
3.
Mol Biol Evol ; 33(3): 657-69, 2016 Mar.
Article in English | MEDLINE | ID: mdl-26545921

ABSTRACT

Genetic variation harbors signatures of natural selection driven by selective pressures that are often unknown. Estimating the ages of selection signals may allow reconstructing the history of environmental changes that shaped human phenotypes and diseases. We have developed an approximate Bayesian computation (ABC) approach to estimate allele ages under a model of selection on new mutations and under demographic models appropriate for human populations. We have applied it to two resequencing data sets: An ultra-high depth data set from a relatively small sample of unrelated individuals and a lower depth data set in a larger sample with transmission information. In addition to evaluating the accuracy of our method based on simulations, for each SNP, we assessed the consistency between the posterior probabilities estimated by the ABC approach and the ancient DNA record, finding good agreement between the two types of data and methods. Applying this ABC approach to data for eight single nucleotide polymorphisms (SNPs), we were able to rule out an onset of selection prior to the dispersal out-of-Africa for three of them and more recent than the spread of agriculture for an additional three SNPs.


Subject(s)
Genetics, Population , Models, Genetic , Selection, Genetic , Alleles , Bayes Theorem , Computational Biology/methods , Computer Simulation , Evolution, Molecular , Gene Frequency , Genetic Variation , Humans , Polymorphism, Single Nucleotide , Sequence Analysis, DNA
4.
Proc Natl Acad Sci U S A ; 112(47): E6496-505, 2015 Nov 24.
Article in English | MEDLINE | ID: mdl-26561581

ABSTRACT

The prevailing view that the evolution of cells in a tumor is driven by Darwinian selection has never been rigorously tested. Because selection greatly affects the level of intratumor genetic diversity, it is important to assess whether intratumor evolution follows the Darwinian or the non-Darwinian mode of evolution. To provide the statistical power, many regions in a single tumor need to be sampled and analyzed much more extensively than has been attempted in previous intratumor studies. Here, from a hepatocellular carcinoma (HCC) tumor, we evaluated multiregional samples from the tumor, using either whole-exome sequencing (WES) (n = 23 samples) or genotyping (n = 286) under both the infinite-site and infinite-allele models of population genetics. In addition to the many single-nucleotide variations (SNVs) present in all samples, there were 35 "polymorphic" SNVs among samples. High genetic diversity was evident as the 23 WES samples defined 20 unique cell clones. With all 286 samples genotyped, clonal diversity agreed well with the non-Darwinian model with no evidence of positive Darwinian selection. Under the non-Darwinian model, MALL (the number of coding region mutations in the entire tumor) was estimated to be greater than 100 million in this tumor. DNA sequences reveal local diversities in small patches of cells and validate the estimation. In contrast, the genetic diversity under a Darwinian model would generally be orders of magnitude smaller. Because the level of genetic diversity will have implications on therapeutic resistance, non-Darwinian evolution should be heeded in cancer treatments even for microscopic tumors.


Subject(s)
Biological Evolution , Genetic Variation , Neoplasms/genetics , Neoplasms/pathology , Selection, Genetic , Aged , Base Sequence , Cell Count , Cell Line, Tumor , Clone Cells , Computer Simulation , Gene Library , Genes, Neoplasm , Genotype , Humans , Male , Microdissection , Models, Genetic , Molecular Sequence Data , Mutation , Mutation Rate , Polymorphism, Single Nucleotide/genetics , Reproducibility of Results , Sequence Analysis, DNA
5.
PLoS One ; 10(7): e0118087, 2015.
Article in English | MEDLINE | ID: mdl-26197064

ABSTRACT

The sample frequency spectrum is an informative and frequently employed approach for summarizing DNA variation data. Under the standard neutral model the expectation of the sample frequency spectrum has been derived by at least two distinct approaches. One relies on using results from diffusion approximations to the Wright-Fisher Model. The other is based on Pólya urn models that correspond to the standard coalescent model. A new proof of the expected frequency spectrum is presented here. It is a proof by induction and does not require diffusion results and does not require the somewhat complex sums and combinatorics of the derivations based on urn models.


Subject(s)
Genetic Variation , Algorithms , Models, Genetic , Sequence Analysis, DNA/methods
6.
Nature ; 512(7515): 436-440, 2014 Aug 28.
Article in English | MEDLINE | ID: mdl-25043057

ABSTRACT

Plant resistance (R) genes are a crucial component in plant defence against pathogens. Although R genes often fail to provide durable resistance in an agricultural context, they frequently persist as long-lived balanced polymorphisms in nature. Standard theory explains the maintenance of such polymorphisms through a balance of the costs and benefits of resistance and virulence in a tightly coevolving host-pathogen pair. However, many plant-pathogen interactions lack such specificity. Whether, and how, balanced polymorphisms are maintained in diffusely interacting species is unknown. Here we identify a naturally interacting R gene and effector pair in Arabidopsis thaliana and its facultative plant pathogen, Pseudomonas syringae. The protein encoded by the R gene RPS5 recognizes an AvrPphB homologue (AvrPphB2) and exhibits a balanced polymorphism that has been maintained for over 2 million years (ref. 3). Consistent with the presence of an ancient balanced polymorphism, the R gene confers a benefit when plants are infected with P. syringae carrying avrPphB2 but also incurs a large cost in the absence of infection. RPS5 alleles are maintained at intermediate frequencies in populations globally, suggesting ubiquitous selection for resistance. However, the presence of P. syringae carrying avrPphB is probably insufficient to explain the RPS5 polymorphism. First, avrPphB homologues occur at very low frequencies in P. syringae populations on A. thaliana. Second, AvrPphB only rarely confers a virulence benefit to P. syringae on A. thaliana. Instead, we find evidence that selection for RPS5 involves multiple non-homologous effectors and multiple pathogen species. These results and an associated model suggest that the R gene polymorphism in A. thaliana may not be maintained through a tightly coupled interaction involving a single coevolved R gene and effector pair. More likely, the stable polymorphism is maintained through complex and diffuse community-wide interactions.


Subject(s)
Arabidopsis Proteins/genetics , Arabidopsis/genetics , Arabidopsis/microbiology , Evolution, Molecular , Host-Pathogen Interactions/genetics , Polymorphism, Genetic , Pseudomonas syringae/genetics , Selection, Genetic/genetics , Alleles , Arabidopsis Proteins/metabolism , Bacterial Proteins/genetics , Bacterial Proteins/metabolism , Genes, Plant/genetics , Models, Genetic , Plant Immunity/genetics , Pseudomonas syringae/pathogenicity , Virulence/genetics
7.
BMC Evol Biol ; 13: 9, 2013 Jan 12.
Article in English | MEDLINE | ID: mdl-23311980

ABSTRACT

BACKGROUND: The enzyme phosphoenolpyruvate carboxykinase, PEPCK, occurs in its guanosine-nucleotide-using form in animals and a few prokaryotes. We study its natural genetic variation in Colias (Lepidoptera, Pieridae). PEPCK offers a route, alternative to pyruvate kinase, for carbon skeletons to move between cytosolic glycolysis and mitochondrial Krebs cycle reactions. RESULTS: PEPCK is expressed in both cytosol and mitochondrion, but differently in diverse animal clades. In vertebrates and independently in Drosophila, compartment-specific paralogous genes occur. In a contrasting expression strategy, compartment-specific PEPCKs of Colias and of the silkmoth, Bombyx, differ only in their first, 5', exons; these are alternatively spliced onto a common series of following exons. In two Colias species from distinct clades, PEPCK sequence is highly variable at nonsynonymous and synonymous sites, mainly in its common exons. Three major amino acid polymorphisms, Gly 335 ↔ Ser, Asp 503 ↔ Glu, and Ile 629 ↔ Val occur in both species, and in the first two cases are similar in frequency between species. Homology-based structural modelling shows that the variants can alter hydrogen bonding, salt bridging, or van der Waals interactions of amino acid side chains, locally or at one another's sites which are distant in PEPCK's structure, and thus may affect its enzyme function. We ask, using coalescent simulations, if these polymorphisms' cross-species similarities are compatible with neutral evolution by genetic drift, but find the probability of this null hypothesis is 0.001 ≤ P ≤ 0.006 under differing scenarios. CONCLUSION: Our results make the null hypothesis of neutrality of these PEPCK polymorphisms quite unlikely, but support an alternative hypothesis that they are maintained by natural selection in parallel in the two species. This alternative can now be justifiably tested further via studies of PEPCK genotypes' effects on function, organismal performance, and fitness. This case emphasizes the importance, for evolutionary insight, of studying gene-specific mechanisms affected by natural genetic variation as an essential complement to surveys of such variation.


Subject(s)
Butterflies/genetics , Evolution, Molecular , Phosphoenolpyruvate Carboxykinase (ATP)/genetics , Polymorphism, Genetic , Amino Acid Sequence , Animals , Butterflies/enzymology , Citric Acid Cycle , Cytosol/enzymology , Genes, Insect , Glycolysis , Likelihood Functions , Mitochondria/enzymology , Models, Genetic , Molecular Sequence Data , Sequence Analysis, DNA , Species Specificity
8.
Genetics ; 191(1): 215-32, 2012 May.
Article in English | MEDLINE | ID: mdl-22377632

ABSTRACT

Children of a heterozygous parent are expected to carry either allele with equal probability. Exceptions can occur, however, due to meiotic drive, competition among gametes, or viability selection, which we collectively term "transmission distortion" (TD). Although there are several well-characterized examples of these phenomena, their existence in humans remains unknown. We therefore performed a genome-wide scan for TD by applying the transmission disequilibrium test (TDT) genome-wide to three large sets of human pedigrees of European descent: the Framingham Heart Study (FHS), a founder population of European origin (HUTT), and a subset of the Autism Genetic Resource Exchange (AGRE). Genotyping error is an important confounder in this type of analysis. In FHS and HUTT, despite extensive quality control, we did not find sufficient evidence to exclude genotyping error in the strongest signals. In AGRE, however, many signals extended across multiple SNPs, a pattern highly unlikely to arise from genotyping error. We identified several candidate regions in this data set, notably a locus in 10q26.13 displaying a genome-wide significant TDT in combined female and male transmissions and a signature of recent positive selection, as well as a paternal TD signal in 6p21.1, the same region in which a significant TD signal was previously observed in 30 European males. Neither region replicated in FHS, however, and the paternal signal was not visible in sperm competition assays or as allelic imbalance in sperm. In maternal transmissions, we detected no strong signals near centromeres or telomeres, the regions predicted to be most susceptible to female-specific meiotic drive, but we found a significant enrichment of top signals among genes involved in cell junctions. These results illustrate both the potential benefits and the challenges of using the TDT to study transmission distortion and provide candidates for investigation in future studies.


Subject(s)
Alleles , Genomics , Genotype , Inheritance Patterns/genetics , Pedigree , Autistic Disorder/genetics , Centromere/genetics , Cohort Studies , Fathers , Female , Genotyping Techniques , Humans , Male , Meiosis/genetics , Sperm Motility/genetics , Telomere/genetics , White People/genetics
9.
PLoS Genet ; 7(6): e1002100, 2011 Jun.
Article in English | MEDLINE | ID: mdl-21695282

ABSTRACT

Genealogical patterns in different genomic regions may be different due to the joint influence of gene flow and selection. The existence of two subspecies of cultivated rice provides a unique opportunity for analyzing these effects during domestication. We chose 66 accessions from the three rice taxa (about 22 each from Oryza sativa indica, O. sativa japonica, and O. rufipogon) for whole-genome sequencing. In the search for the signature of selection, we focus on low diversity regions (LDRs) shared by both cultivars. We found that the genealogical histories of these overlapping LDRs are distinct from the genomic background. While indica and japonica genomes generally appear to be of independent origin, many overlapping LDRs may have originated only once, as a result of selection and subsequent introgression. Interestingly, many such LDRs contain only one candidate gene of rice domestication, and several known domestication genes have indeed been "rediscovered" by this approach. In summary, we identified 13 additional candidate genes of domestication.


Subject(s)
Evolution, Molecular , Genome, Plant , Oryza/genetics , Crops, Agricultural/classification , Crops, Agricultural/genetics , Gene Flow/genetics , Genetic Variation , Oryza/classification
10.
Genome Res ; 21(7): 1087-98, 2011 Jul.
Article in English | MEDLINE | ID: mdl-21628451

ABSTRACT

Second-generation sequencing technologies allow surveys of sequence variation on an unprecedented scale. However, despite the rapid decrease in sequencing costs, collecting whole-genome sequence data on a population scale is still prohibitive for many laboratories. We have implemented an inexpensive, reduced representation protocol for preparing resequencing targets, and we have developed the analytical tools necessary for making population genetic inferences. This approach can be applied to any species for which a draft or complete reference genome sequence is available. The new tools we have developed include methods for aligning reads, calling genotypes, and incorporating sample-specific sequencing error rates in the estimate of evolutionary parameters. When applied to 19 individuals from a total of 18 human populations, our approach allowed sampling regions that are largely overlapping across individuals and that are representative of the entire genome. The resequencing data were used to test the serial founder model of human dispersal and to estimate the time of the Out of Africa migration. Our results also represent the first attempt to provide a time frame for the colonization of Australia based on large-scale resequencing data.


Subject(s)
Biological Evolution , Genetics, Population , Genome, Human , Sequence Analysis, DNA/methods , Africa , Australia , Databases, Genetic , Female , Gene Frequency , Genetic Variation , Genotype , Humans , Male , Models, Biological , Polymorphism, Single Nucleotide , Sequence Alignment
11.
PLoS One ; 6(5): e20234, 2011.
Article in English | MEDLINE | ID: mdl-21633511

ABSTRACT

HERV-K113 and HERV-K115 have been considered to be among the youngest HERVs because they are the only known full-length proviruses that are insertionally polymorphic and maintain the open reading frames of their coding genes. However, recent data suggest that HERV-K113 is at least 800,000 years old, and HERV-K115 even older. A systematic study of HERV-K HML2 members to identify HERVs that may have infected the human genome in the more recent evolutionary past is lacking. Therefore, we sought to determine how recently HERVs were exogenous and infectious by examining sequence variation in the long terminal repeat (LTR) regions of all full-length HERV-K loci. We used the traditional method of inter-LTR comparison to analyze all full length HERV-Ks and determined that two insertions, HERV-K106 and HERV-K116 have no differences between their 5' and 3' LTR sequences, suggesting that these insertions were endogenized in the recent evolutionary past. Among these insertions with no sequence differences between their LTR regions, HERV-K106 had the most intact viral sequence structure. Coalescent analysis of HERV-K106 3' LTR sequences representing 51 ethnically diverse individuals suggests that HERV-K106 integrated into the human germ line approximately 150,000 years ago, after the emergence of anatomically modern humans.


Subject(s)
Endogenous Retroviruses/genetics , Genome, Human/genetics , Genome, Viral/genetics , Proviruses/genetics , Base Sequence , Binding Sites/genetics , Endogenous Retroviruses/classification , Evolution, Molecular , Humans , Molecular Sequence Data , Mutagenesis, Insertional , Phylogeny , Proviruses/classification , Terminal Repeat Sequences/genetics , Time Factors
12.
J Mol Evol ; 64(6): 702-5, 2007 Jun.
Article in English | MEDLINE | ID: mdl-17487522

ABSTRACT

A method for estimating the variance of a simple estimate of the time of the most recent common ancestor of a sample of sequences is described.


Subject(s)
Chromosomes, Human, Y/genetics , Evolution, Molecular , Phylogeny , Base Sequence , Humans , Time Factors
13.
Proc Natl Acad Sci U S A ; 104(7): 2271-6, 2007 Feb 13.
Article in English | MEDLINE | ID: mdl-17284599

ABSTRACT

Determining the extent of adaptive evolution at the genomic level is central to our understanding of molecular evolution. A suitable observation for this purpose would consist of polymorphic data on a large and unbiased collection of genes from two closely related species, each having a large and stable population. In this study, we sequenced 419 genes from 24 lines of Drosophila melanogaster and its close relatives. Together with data from Drosophila simulans, these data reveal the following. (i) Approximately 10% of the loci in regions of normal recombination are much less polymorphic at silent sites than expected, hinting at the action of selective sweeps. (ii) The level of polymorphism is negatively correlated with the rate of nonsynonymous divergence across loci. Thus, even under strict neutrality, the ratio of amino acid to silent nucleotide changes (A:S) between Drosophila species is expected to be 25-40% higher than the A:S ratio for polymorphism when data are pooled across the genome. (iii) The observed A/S ratio between species among the 419 loci is 28.9% higher than the (adjusted) neutral expectation. We estimate that nearly 30% of the amino acid substitutions between D. melanogaster and its close relatives were adaptive. (iv) This signature of adaptive evolution is observable only in regions of normal recombination. Hence, the low level of polymorphism observed in regions of reduced recombination may not be driven primarily by positive selection. Finally, we discuss the theories and data pertaining to the interpretation of adaptive evolution in genomic studies.


Subject(s)
Adaptation, Biological/genetics , Drosophila melanogaster/genetics , Evolution, Molecular , Genome, Insect/genetics , Amino Acid Substitution , Animals , Molecular Sequence Data , Polymorphism, Genetic , Recombination, Genetic
14.
Genet Res ; 89(5-6): 427-32, 2007 Dec.
Article in English | MEDLINE | ID: mdl-18976531
15.
Proc Natl Acad Sci U S A ; 103(48): 18178-83, 2006 Nov 28.
Article in English | MEDLINE | ID: mdl-17090677

ABSTRACT

At the center of the debate on the emergence of modern humans and their spread throughout the globe is the question of whether archaic Homo lineages contributed to the modern human gene pool, and more importantly, whether such contributions impacted the evolutionary adaptation of our species. A major obstacle to answering this question is that low levels of admixture with archaic lineages are not expected to leave extensive traces in the modern human gene pool because of genetic drift. Loci that have undergone strong positive selection, however, offer a unique opportunity to identify low-level admixture with archaic lineages, provided that the introgressed archaic allele has risen to high frequency under positive selection. The gene microcephalin (MCPH1) regulates brain size during development and has experienced positive selection in the lineage leading to Homo sapiens. Within modern humans, a group of closely related haplotypes at this locus, known as haplogroup D, rose from a single copy approximately 37,000 years ago and swept to exceptionally high frequency (approximately 70% worldwide today) because of positive selection. Here, we examine the origin of haplogroup D. By using the interhaplogroup divergence test, we show that haplogroup D likely originated from a lineage separated from modern humans approximately 1.1 million years ago and introgressed into humans by approximately 37,000 years ago. This finding supports the possibility of admixture between modern humans and archaic Homo populations (Neanderthals being one possibility). Furthermore, it buttresses the important notion that, through such adminture, our species has benefited evolutionarily by gaining new advantageous alleles. The interhaplogroup divergence test developed here may be broadly applicable to the detection of introgression at other loci in the human genome or in genomes of other species.


Subject(s)
Adaptation, Biological/genetics , Alleles , Biological Evolution , Brain/metabolism , Hominidae/genetics , Nerve Tissue Proteins/genetics , Animals , Humans , Models, Genetic , Reproduction/genetics
16.
Proc Natl Acad Sci U S A ; 102(51): 18508-13, 2005 Dec 20.
Article in English | MEDLINE | ID: mdl-16352722

ABSTRACT

We present an expanded data set of 50 unlinked autosomal noncoding regions, resequenced in samples of Hausa from Cameroon, Italians, and Chinese. We use these data to make inferences about human demographic history by using a technique that combines multiple aspects of genetic data, including levels of polymorphism, the allele frequency spectrum, and linkage disequilibrium. We explore an extensive range of demographic parameters and demonstrate that our method of combining multiple aspects of the data results in a significant reduction of the compatible parameter space. In agreement with previous reports, we find that the Hausa data are compatible with demographic equilibrium as well as a set of recent population expansion models. In contrast to the Hausa, when multiple aspects of the data are considered jointly, the non-Africans depart from an equilibrium model of constant population size and are compatible with a range of simple bottleneck models, including a 50-90% reduction in effective population size occurring some time after the appearance of modern humans in Africa 160,000-120,000 years ago.


Subject(s)
Genetic Variation/genetics , Population Density , Cameroon/ethnology , China/ethnology , Gene Frequency , Humans , Italy/ethnology , Models, Genetic , Time Factors
17.
Science ; 309(5741): 1717-20, 2005 Sep 09.
Article in English | MEDLINE | ID: mdl-16151009

ABSTRACT

The gene Microcephalin (MCPH1) regulates brain size and has evolved under strong positive selection in the human evolutionary lineage. We show that one genetic variant of Microcephalin in modern humans, which arose approximately 37,000 years ago, increased in frequency too rapidly to be compatible with neutral drift. This indicates that it has spread under strong positive selection, although the exact nature of the selection is unknown. The finding that an important brain gene has continued to evolve adaptively in anatomically modern humans suggests the ongoing evolutionary plasticity of the human brain. It also makes Microcephalin an attractive candidate locus for studying the genetics of human variation in brain-related phenotypes.


Subject(s)
Biological Evolution , Brain/anatomy & histology , Nerve Tissue Proteins/genetics , Selection, Genetic , Adaptation, Biological , Alleles , Amino Acid Substitution , Asian People/genetics , Black People/genetics , Brain/physiology , Cell Cycle Proteins , Cytoskeletal Proteins , Exons , Gene Conversion , Gene Frequency , Genetic Variation , Genotype , Haplotypes , Humans , Linkage Disequilibrium , Microcephaly/genetics , Organ Size , Polymorphism, Genetic , Recombination, Genetic , Sequence Analysis, DNA , Time , White People/genetics
18.
Science ; 309(5741): 1720-2, 2005 Sep 09.
Article in English | MEDLINE | ID: mdl-16151010

ABSTRACT

The gene ASPM (abnormal spindle-like microcephaly associated) is a specific regulator of brain size, and its evolution in the lineage leading to Homo sapiens was driven by strong positive selection. Here, we show that one genetic variant of ASPM in humans arose merely about 5800 years ago and has since swept to high frequency under strong positive selection. These findings, especially the remarkably young age of the positively selected variant, suggest that the human brain is still undergoing rapid adaptive evolution.


Subject(s)
Biological Evolution , Brain/anatomy & histology , Nerve Tissue Proteins/genetics , Selection, Genetic , Adaptation, Biological , Animals , Asian People/genetics , Black People , Brain/physiology , Gene Conversion , Gene Frequency , Genotype , Haplotypes , Humans , Linkage Disequilibrium , Models, Genetic , Organ Size , Pan troglodytes/genetics , Phylogeny , Polymorphism, Genetic , Recombination, Genetic , Sequence Analysis, DNA , Time , White People
19.
Trends Genet ; 21(11): 596-601, 2005 Nov.
Article in English | MEDLINE | ID: mdl-16153740

ABSTRACT

Unlike rare mendelian diseases, which are due to new mutations (i.e. derived alleles), several alleles that increase the risk to common diseases are ancestral. Moreover, population genetics studies suggest that some derived alleles that protect against common diseases became advantageous recently. These observations can be explained within an evolutionary framework in which ancestral alleles reflect ancient adaptations to the lifestyle of ancient human populations, whereas the derived alleles were deleterious. However, with the shift in environment and lifestyle, the ancestral alleles now increase the risk of common diseases in modern populations. In this article, we develop an explicit evolutionary model and use population genetics simulations to investigate the expected haplotype structure and type of disease-association signals of ancestral risk alleles.


Subject(s)
Evolution, Molecular , Genetic Predisposition to Disease , Genetic Variation , Genetics, Population , Alleles , Humans , Linkage Disequilibrium , Polymorphism, Single Nucleotide
20.
Genetics ; 168(3): 1699-712, 2004 Nov.
Article in English | MEDLINE | ID: mdl-15579718

ABSTRACT

A maximum-likelihood method for demographic inference is applied to data sets consisting of the frequency spectrum of unlinked single-nucleotide polymorphisms (SNPs). We use simulation analyses to explore the effect of sample size and number of polymorphic sites on both the power to reject the null hypothesis of constant population size and the properties of two- and three-dimensional maximum-likelihood estimators (MLEs). Large amounts of data are required to produce accurate demographic inferences, particularly for scenarios of recent growth. Properties of the MLEs are highly dependent upon the demographic scenario, as estimates improve with a more ancient time of growth onset and smaller degree of growth. Severe episodes of growth lead to an upward bias in the estimates of the current population size, and that bias increases with the magnitude of growth. One data set of African origin supports a model of mild, ancient growth, and another is compatible with both constant population size and a variety of growth scenarios, rejecting greater than fivefold growth beginning >36,000 years ago. Analysis of a data set of European origin indicates a bottlenecked population history, with an 85% population reduction occurring approximately 30,000 years ago.


Subject(s)
Data Interpretation, Statistical , Gene Frequency , Likelihood Functions , Polymorphism, Single Nucleotide , Black or African American , Humans , Models, Genetic
SELECTION OF CITATIONS
SEARCH DETAIL
...