Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 39
Filter
1.
Int J Biol Macromol ; 234: 123041, 2023 Apr 15.
Article in English | MEDLINE | ID: mdl-36581038

ABSTRACT

Olduvai protein domains (formerly DUF1220) show the greatest human-specific increase in copy number of any coding region in the genome and are highly correlated with human brain evolution and cognitive disease. The majority of human copies are found within four NBPF genes organized in a variable number of a tandemly arranged three-domain blocks called Olduvai triplets. Here we show that these human-specific Olduvai domains are posttranslationally processed by the furin protease, with a cleavage site occurring once at each triplet. These findings suggest that all expanded human-specific NBPF genes encode proproteins consisting of many independent Olduvai triplet proteins which are activated by furin processing. The exceptional correlation of Olduvai copy number and brain size taken together with our new furin data, indicates the ultimate target of selection was a rapid increase in dosage of autonomously functioning Olduvai triplet proteins, and that these proteins are the primary active agent underlying Olduvai's role in human brain expansion.


Subject(s)
Furin , Peptide Hydrolases , Protein Domains , Humans , Furin/genetics , Genome , Peptide Hydrolases/genetics , Protein Domains/genetics , Proteins/genetics
2.
Biomol NMR Assign ; 16(1): 113-119, 2022 04.
Article in English | MEDLINE | ID: mdl-35098449

ABSTRACT

Olduvai protein domains, encoded by the NBPF gene family, are responsible for the largest increase in copy number of any protein-coding region in the human genome. This has spawned various genetics studies which have linked these domains to human brain development and divergence from our primate ancestors, as well as currently relevant cognitive diseases such as schizophrenia and autism spectrum disorder (ASD). There are six separate Olduvai domains which together form the majority of the various protein products of the NBPF genes. The six domains include three conserved domains (CON1-3), and three human-lineage-specific domains (HLS1-3) which occur in triplet. Here, we present the solution nuclear magnetic resonance backbone assignments for the CON1 domain, which has been linked to the severity of ASD. The data confirm that CON1 is an intrinsically disordered protein (IDP). Additionally, we use innovative Hα-detected experiments which allow us to not only assign the Hα atoms and N atoms of proline residues, but also to assign residues where HN-experiments suffered from peak overlap or broadening.


Subject(s)
Autism Spectrum Disorder , Animals , Autism Spectrum Disorder/genetics , Genome, Human , Humans , Nuclear Magnetic Resonance, Biomolecular , Protein Domains/genetics , Proteins
3.
Genetics ; 214(1): 179-191, 2020 01.
Article in English | MEDLINE | ID: mdl-31754017

ABSTRACT

Sequences encoding Olduvai protein domains (formerly DUF1220) show the greatest human lineage-specific increase in copy number of any coding region in the genome and have been associated, in a dosage-dependent manner, with brain size, cognitive aptitude, autism, and schizophrenia. Tandem intragenic duplications of a three-domain block, termed the Olduvai triplet, in four NBPF genes in the chromosomal 1q21.1-0.2 region, are primarily responsible for the striking human-specific copy number increase. Interestingly, most of the Olduvai triplets are adjacent to, and transcriptionally coregulated with, three human-specific NOTCH2NL genes that have been shown to promote cortical neurogenesis. Until now, the underlying genomic events that drove the Olduvai hyperamplification in humans have remained unexplained. Here, we show that the presence or absence of an alternative first exon of the Olduvai triplet perfectly discriminates between amplified (58/58) and unamplified (0/12) triplets. We provide sequence and breakpoint analyses that suggest the alternative exon was produced by an nonallelic homologous recombination-based mechanism involving the duplicative transposition of an existing Olduvai exon found in the CON3 domain, which typically occurs at the C-terminal end of NBPF genes. We also provide suggestive in vitro evidence that the alternative exon may promote instability through a putative G-quadraplex (pG4)-based mechanism. Lastly, we use single-molecule optical mapping to characterize the intragenic structural variation observed in NBPF genes in 154 unrelated individuals and 52 related individuals from 16 families and show that the presence of pG4-containing Olduvai triplets is strongly correlated with high levels of Olduvai copy number variation. These results suggest that the same driver of genomic instability that allowed the evolutionarily recent, rapid, and extreme human-specific Olduvai expansion remains highly active in the human genome.


Subject(s)
Carrier Proteins/genetics , Genome, Human , Trinucleotide Repeat Expansion , Animals , Base Sequence , DNA Copy Number Variations , Evolution, Molecular , G-Quadruplexes , Gene Amplification , Gene Dosage , Genomic Instability , Homologous Recombination , Humans , Primates , Protein Domains , Sequence Homology
4.
Hum Genet ; 138(7): 715-721, 2019 Jul.
Article in English | MEDLINE | ID: mdl-31087184

ABSTRACT

Sequences encoding Olduvai (DUF1220) protein domains show the largest human-specific increase in copy number of any coding region in the genome and have been linked to human brain evolution. Most human-specific copies of Olduvai (119/165) are encoded by three NBPF genes that are adjacent to three human-specific NOTCH2NL genes that have been shown to promote cortical neurogenesis. Here, employing genomic, phylogenetic, and transcriptomic evidence, we show that these NOTCH2NL/NBPF gene pairs evolved jointly, as two-gene units, very recently in human evolution, and are likely co-regulated. Remarkably, while three NOTCH2NL paralogs were added, adjacent Olduvai sequences hyper-amplified, adding 119 human-specific copies. The data suggest that human-specific Olduvai domains and adjacent NOTCH2NL genes may function in a coordinated, complementary fashion to promote neurogenesis and human brain expansion in a dosage-related manner.


Subject(s)
Biological Evolution , Brain/growth & development , Brain/metabolism , Carrier Proteins/genetics , Genome, Human , Receptor, Notch2/genetics , Genomics , Humans , Phylogeny , Protein Domains
5.
Am J Psychiatry ; 176(8): 643-650, 2019 08 01.
Article in English | MEDLINE | ID: mdl-30764650

ABSTRACT

OBJECTIVE: The authors previously reported that the copy number of sequences encoding an Olduvai protein domain subtype (CON1) shows a linear association with the severity of social deficits and communication impairment in individuals with autism. In this study, using an improved measurement method, the authors replicated this association in an independent population. METHOD: The authors obtained whole genome sequence (WGS) data and phenotype data on 215 individuals from the Autism Speaks MSSNG project. They derived copy number from WGS data using a modified sequence read-depth technique. A linear mixed-effects model was used to test the association between Olduvai CON1 copy number and symptom severity as measured by the Autism Diagnostic Interview-Revised. The authors then combined data from previous studies (N=524) for final analyses. RESULTS: A significant linear association was observed between CON1 copy number and social diagnostic score (SDS) (ß=0.24) and communicative diagnostic score (CDS) (ß=0.23). Using the combined data, the authors present strong significant associations of CON1 dosage with SDS (ß=0.18) and CDS (ß=0.13). The authors also implicate Olduvai subtypes found in two genes, NBPF1 and NBPF14 (R2=6.2%). Associations were preferentially found in multiplex versus simplex families. CONCLUSIONS: The finding of a third dose-dependent association between Olduvai sequences and autism severity, preferentially in multiplex families, provides strong evidence that this highly duplicated and underexamined protein domain family plays an important role in inherited autism.


Subject(s)
Autistic Disorder/genetics , DNA Copy Number Variations/genetics , Protein Domains/genetics , Autistic Disorder/psychology , Carrier Proteins/genetics , Child , Female , Genetic Association Studies , Genome, Human , Humans , Male
6.
BMC Genomics ; 18(1): 614, 2017 Aug 14.
Article in English | MEDLINE | ID: mdl-28807002

ABSTRACT

BACKGROUND: DUF1220 protein domains found primarily in Neuroblastoma BreakPoint Family (NBPF) genes show the greatest human lineage-specific increase in copy number of any coding region in the genome. There are 302 haploid copies of DUF1220 in hg38 (~160 of which are human-specific) and the majority of these can be divided into 6 different subtypes (referred to as clades). Copy number changes of specific DUF1220 clades have been associated in a dose-dependent manner with brain size variation (both evolutionarily and within the human population), cognitive aptitude, autism severity, and schizophrenia severity. However, no published methods can directly measure copies of DUF1220 with high accuracy and no method can distinguish between domains within a clade. RESULTS: Here we describe a novel method for measuring copies of DUF1220 domains and the NBPF genes in which they are found from whole genome sequence data. We have characterized the effect that various sequencing and alignment parameters and strategies have on the accuracy and precision of the method and defined the parameters that lead to optimal DUF1220 copy number measurement and resolution. We show that copy number estimates obtained using our read depth approach are highly correlated with those generated by ddPCR for three representative DUF1220 clades. By simulation, we demonstrate that our method provides sufficient resolution to analyze DUF1220 copy number variation at three levels: (1) DUF1220 clade copy number within individual genes and groups of genes (gene-specific clade groups) (2) genome wide DUF1220 clade copies and (3) gene copy number for DUF1220-encoding genes. CONCLUSIONS: To our knowledge, this is the first method to accurately measure copies of all six DUF1220 clades and the first method to provide gene specific resolution of these clades. This allows one to discriminate among the ~300 haploid human DUF1220 copies to an extent not possible with any other method. The result is a greatly enhanced capability to analyze the role that these sequences play in human variation and disease.


Subject(s)
Gene Dosage/genetics , Genomics , Neoplasm Proteins/genetics , Evolution, Molecular , Genome, Human/genetics , Human Genetics , Humans , Neoplasm Proteins/chemistry , Protein Domains , Sequence Alignment
7.
F1000Res ; 6: 2185, 2017.
Article in English | MEDLINE | ID: mdl-29399325

ABSTRACT

We are jointly proposing a new name for a protein domain of approximately 65 amino acids that has been previously termed NBPF or DUF1220. Our two labs independently reported the initial studies of this domain, which is encoded almost entirely within a single gene family. The name Neuroblastoma Breakpoint Family ( NBPF) was applied to this gene family when the first identified member of the family was found to be interrupted in an individual with neuroblastoma. Prior to this discovery, the Pfam database had termed the domain DUF1220, denoting it as one of many protein domains of unknown function. It has been Pfam's intention to use "DUF" nomenclature to serve only as a temporary placeholder until more appropriate names are proposed based on research findings. We believe that additional studies of this domain, primarily from our laboratories over the past 10 years, have resulted in furthering our understanding of these sequences to the point where proposing a new name for this domain is warranted. Because of considerable data linking the domain to human-specific evolution, brain expansion and cognition, we believe a name reflecting these findings would be appropriate. With this in mind, we have chosen to name the domain (and the repeat that encodes it) Olduvai. The gene family will remain as NBPF for now. The primary domain subtypes will retain their previously assigned names (e.g. CON1-3; HLS1-3), and the three-domain block that expanded dramatically in the human lineage will be termed the Olduvai triplet. The new name refers to Olduvai Gorge, which is a site in East Africa that has been the source of major anthropological discoveries in the early-mid 1900's. We also chose the name as a tribute to the scientists who made important contributions to the early studies of human origins and our African genesis.

9.
Hum Genet ; 134(1): 67-75, 2015 Jan.
Article in English | MEDLINE | ID: mdl-25287832

ABSTRACT

DUF1220 protein domains exhibit the greatest human lineage-specific copy number expansion of any protein-coding sequence in the genome, and variation in DUF1220 copy number has been linked to both brain size in humans and brain evolution among primates. Given these findings, we examined associations between DUF1220 subtypes CON1 and CON2 and cognitive aptitude. We identified a linear association between CON2 copy number and cognitive function in two independent populations of European descent. In North American males, an increase in CON2 copy number corresponded with an increase in WISC IQ (R (2) = 0.13, p = 0.02), which may be driven by males aged 6-11 (R (2) = 0.42, p = 0.003). We utilized ddPCR in a subset as a confirmatory measurement. This group had 26-33 copies of CON2 with a mean of 29, and each copy increase of CON2 was associated with a 3.3-point increase in WISC IQ (R (2) = 0.22, p = 0.045). In individuals from New Zealand, an increase in CON2 copy number was associated with an increase in math aptitude ability (R (2) = 0.10 p = 0.018). These were not confounded by brain size. To our knowledge, this is the first study to report a replicated association between copy number of a gene coding sequence and cognitive aptitude. Remarkably, dosage variations involving DUF1220 sequences have now been linked to human brain expansion, autism severity and cognitive aptitude, suggesting that such processes may be genetically and mechanistically inter-related. The findings presented here warrant expanded investigations in larger, well-characterized cohorts.


Subject(s)
Aptitude/physiology , Brain/metabolism , Carrier Proteins/genetics , Chromosomes, Human, Pair 1/genetics , Cognition/physiology , DNA Copy Number Variations/genetics , Intelligence/physiology , Adolescent , Adult , Brain/pathology , Child , Comparative Genomic Hybridization/methods , Female , Follow-Up Studies , Humans , Male , Mathematics , Organ Size , Polymerase Chain Reaction/methods , Protein Structure, Tertiary , Young Adult
10.
Genetics ; 197(4): 1063-7, 2014 Aug.
Article in English | MEDLINE | ID: mdl-25104804

ABSTRACT

This article recounts some of the early days of the Human Genome Project, covering the important and sometimes controversial role that complementary DNA-based approaches played in the discovery and mapping of the majority of human genes. It also describes my involvement in this effort and my lab's development of methods for rapid sequence identification and mapping of human genes.


Subject(s)
Chromosome Mapping , DNA, Complementary/genetics , Human Genome Project/history , 3' Untranslated Regions/genetics , Cloning, Molecular , Genome, Human , History, 20th Century , Humans , Presenilin-2/genetics , Presenilin-2/isolation & purification , Sequence Analysis, DNA
11.
Front Hum Neurosci ; 8: 427, 2014.
Article in English | MEDLINE | ID: mdl-25009482

ABSTRACT

Here we present the hypothesis that increasing copy number (dosage) of sequences encoding DUF1220 protein domains is a major contributor to the evolutionary increase in brain size, neuron number, and cognitive capacity that is associated with the primate order. We further propose that this relationship is restricted to the anthropoid sub-order of primates, with DUF1220 copy number markedly increasing in monkeys, further in apes, and most extremely in humans where the greatest number of copies (~272 haploid copies) is found. We show that this increase closely parallels the increase in brain size and neuron number that has occurred among anthropoid primate species. We also provide evidence linking DUF1220 copy number to brain size within the human species, both in normal populations and in individuals associated with brain size pathologies (1q21-associated microcephaly and macrocephaly). While we believe these and other findings presented here strongly suggest increase in DUF1220 copy number is a key contributor to anthropoid brain expansion, the data currently available rely largely on correlative measures that, though considerable, do not yet provide direct evidence for a causal connection. Nevertheless, we believe the evidence presented is sufficient to provide the basis for a testable model which proposes that DUF1220 protein domain dosage increase is a main contributor to the increase in brain size and neuron number found among the anthropoid primate species and that is at its most extreme in human.

12.
BMC Genomics ; 15: 387, 2014 May 20.
Article in English | MEDLINE | ID: mdl-24885025

ABSTRACT

BACKGROUND: Although the reference human genome sequence was declared finished in 2003, some regions of the genome remain incomplete due to their complex architecture. One such region, 1q21.1-q21.2, is of increasing interest due to its relevance to human disease and evolution. Elucidation of the exact variants behind these associations has been hampered by the repetitive nature of the region and its incomplete assembly. This region also contains 238 of the 270 human DUF1220 protein domains, which are implicated in human brain evolution and neurodevelopment. Additionally, examinations of this protein domain have been challenging due to the incomplete 1q21 build. To address these problems, a single-haplotype hydatidiform mole BAC library (CHORI-17) was used to produce the first complete sequence of the 1q21.1-q21.2 region. RESULTS: We found and addressed several inaccuracies in the GRCh37sequence of the 1q21 region on large and small scales, including genomic rearrangements and inversions, and incorrect gene copy number estimates and assemblies. The DUF1220-encoding NBPF genes required the most corrections, with 3 genes removed, 2 genes reassigned to the 1p11.2 region, 8 genes requiring assembly corrections for DUF1220 domains (~91 DUF1220 domains were misassigned), and multiple instances of nucleotide changes that reassigned the domain to a different DUF1220 subtype. These corrections resulted in an overall increase in DUF1220 copy number, yielding a haploid total of 289 copies. Approximately 20 of these new DUF1220 copies were the result of a segmental duplication from 1q21.2 to 1p11.2 that included two NBPF genes. Interestingly, this duplication may have been the catalyst for the evolutionarily important human lineage-specific chromosome 1 pericentric inversion. CONCLUSIONS: Through the hydatidiform mole genome sequencing effort, the 1q21.1-q21.2 region is complete and misassemblies involving inter- and intra-region duplications have been resolved. The availability of this single haploid sequence path will aid in the investigation of many genetic diseases linked to 1q21, including several associated with DUF1220 copy number variations. Finally, the corrected sequence identified a recent segmental duplication that added 20 additional DUF1220 copies to the human genome, and may have facilitated the chromosome 1 pericentric inversion that is among the most notable human-specific genomic landmarks.


Subject(s)
Chromosomes, Human, Pair 1 , Genome, Human , Biological Evolution , Carrier Proteins/genetics , Comparative Genomic Hybridization , DNA Copy Number Variations , Genetic Linkage , Haploidy , Humans , Protein Structure, Tertiary/genetics , Segmental Duplications, Genomic
13.
PLoS Genet ; 10(3): e1004241, 2014 Mar.
Article in English | MEDLINE | ID: mdl-24651471

ABSTRACT

One of the three most frequently documented copy number variations associated with autism spectrum disorder (ASD) is a 1q21.1 duplication that encompasses sequences encoding DUF1220 protein domains, the dosage of which we previously implicated in increased human brain size. Further, individuals with ASD frequently display accelerated brain growth and a larger brain size that is also associated with increased symptom severity. Given these findings, we investigated the relationship between DUF1220 copy number and ASD severity, and here show that in individuals with ASD (n = 170), the copy number (dosage) of DUF1220 subtype CON1 is highly variable, ranging from 56 to 88 copies following a Gaussian distribution. More remarkably, in individuals with ASD CON1 copy number is also linearly associated, in a dose-response manner, with increased severity of each of the three primary symptoms of ASD: social deficits (p = 0.021), communicative impairments (p = 0.030), and repetitive behaviors (p = 0.047). These data indicate that DUF1220 protein domain (CON1) dosage has an ASD-wide effect and, as such, is likely to be a key component of a major pathway underlying ASD severity. Finally, these findings, by implicating the dosage of a previously unexamined, copy number polymorphic and brain evolution-related gene coding sequence in ASD severity, provide an important new direction for further research into the genetic factors underlying ASD.


Subject(s)
Autistic Disorder/genetics , DNA Copy Number Variations/genetics , Gene Dosage , Adolescent , Adult , Autistic Disorder/pathology , Brain , Child , Child, Preschool , Chromosomes, Human, Pair 1/genetics , Female , Gene Duplication , Genetic Predisposition to Disease , Humans , Infant , Male , Protein Structure, Tertiary
14.
Mamm Genome ; 25(5-6): 235-43, 2014 Jun.
Article in English | MEDLINE | ID: mdl-24553828

ABSTRACT

It has been well documented that genetic factors can influence predisposition to develop alcoholism. While the underlying genomic changes may be of several types, two of the most common and disease associated are copy number variations (CNVs) and sequence alterations of protein coding regions. The goal of this study was to identify CNVs and single-nucleotide polymorphisms that occur in gene coding regions that may play a role in influencing the risk of an individual developing alcoholism. Toward this end, two mouse strains were used that have been selectively bred based on their differential sensitivity to alcohol: the Inbred long sleep (ILS) and Inbred short sleep (ISS) mouse strains. Differences in initial response to alcohol have been linked to risk for alcoholism, and the ILS/ISS strains are used to investigate the genetics of initial sensitivity to alcohol. Array comparative genomic hybridization (arrayCGH) and exome sequencing were conducted to identify CNVs and gene coding sequence differences, respectively, between ILS and ISS mice. Mouse arrayCGH was performed using catalog Agilent 1 × 244 k mouse arrays. Subsequently, exome sequencing was carried out using an Illumina HiSeq 2000 instrument. ArrayCGH detected 74 CNVs that were strain-specific (38 ILS/36 ISS), including several ISS-specific deletions that contained genes implicated in brain function and neurotransmitter release. Among several interesting coding variations detected by exome sequencing was the gain of a premature stop codon in the alpha-amylase 2B (AMY2B) gene specifically in the ILS strain. In total, exome sequencing detected 2,597 and 1,768 strain-specific exonic gene variants in the ILS and ISS mice, respectively. This study represents the most comprehensive and detailed genomic comparison of ILS and ISS mouse strains to date. The two complementary genome-wide approaches identified strain-specific CNVs and gene coding sequence variations that should provide strong candidates to contribute to the alcohol-related phenotypic differences associated with these strains.


Subject(s)
Alcoholism/genetics , DNA Copy Number Variations , Exome , Mice/genetics , Animals , Base Sequence , Comparative Genomic Hybridization , Humans , Mice, Inbred C57BL , Molecular Sequence Data , Polymorphism, Single Nucleotide
15.
PLoS One ; 8(9): e74940, 2013.
Article in English | MEDLINE | ID: mdl-24058641

ABSTRACT

BACKGROUND: Frequently individuals with autism spectrum disorder (ASD) have been noted with a larger head circumference (HC) than their typical developing peers. Biologic hypotheses suggest that an overly rapid brain growth leads to the core symptoms of ASD by impairing connectivity. Literature is divided however where deleterious, protective and null associations of HC with ASD symptoms in individuals with ASD have been found. METHOD: Individuals (n = 1,416) from the Autism Genetic Resource Exchange with ASD were examined for associations of HC with ASD like symptoms. Mixed models controlling for sex, age, race/ethnicity, simplex/multiplex status and accounting for correlations between siblings were used. Interactions by simplex/multiplex were explored. Adjustments for height in a sub-population with available data were explored as well. RESULTS: A Significant interaction term (p = 0.03) suggested that the effect of HC was dependent on whether the individual was simplex or multiplex. In simplex individuals at mean age (8.9 years) 1 cm increase in head circumference was associated with a 24% increase in the odds of a high social diagnostic score from the Autism Diagnostic Interview-Revised (odds ratio  = 1.24, p = 0.01). There was no association in multiplex individuals. Additionally, individuals classified with a non-verbal IQ <70 were 90% simplex and had a significantly increased head circumference (0.7 cm p = 0.03) relative to a mid-range non-verbal IQ group. Interestingly, children classified with a >110 non-verbal IQ also had an increased HC (0.4 cm p = 0.04), relative to a mid-range non-verbal IQ group, and were 90% multiplex. HC effects do not appear to be confounded by height, however, larger samples with height information are needed. CONCLUSION: The potential link between brain growth and autism like symptoms is complex and could depend on specific etiologies. Further investigations accounting for a likely mode of inheritance will help identify an ASD subtype related to HC.


Subject(s)
Autistic Disorder/genetics , Autistic Disorder/pathology , Head/pathology , Models, Genetic , Adolescent , Adult , Child , Child, Preschool , Cross-Sectional Studies , Female , Humans , Infant , Male , Organ Size/genetics
16.
Nat Rev Genet ; 13(12): 853-66, 2012 Dec.
Article in English | MEDLINE | ID: mdl-23154808

ABSTRACT

Given the unprecedented tools that are now available for rapidly comparing genomes, the identification and study of genetic and genomic changes that are unique to our species have accelerated, and we are entering a golden age of human evolutionary genomics. Here we provide an overview of these efforts, highlighting important recent discoveries, examples of the different types of human-specific genomic and genetic changes identified, and salient trends, such as the localization of evolutionary adaptive changes to complex loci that are highly enriched for disease associations. Finally, we discuss the remaining challenges, such as the incomplete nature of current genome sequence assemblies and difficulties in linking human-specific genomic changes to human-specific phenotypic traits.


Subject(s)
Evolution, Molecular , Genome, Human , Animals , Chromosomes, Human/genetics , Comparative Genomic Hybridization , Genetic Speciation , Genomics/trends , Hominidae/genetics , Humans , Karyotyping , Phenotype , Primates/genetics , Species Specificity
17.
G3 (Bethesda) ; 2(9): 977-86, 2012 Sep.
Article in English | MEDLINE | ID: mdl-22973535

ABSTRACT

DUF1220 protein domains exhibit the most extreme human lineage-specific (HLS) copy number increase of any protein coding region in the human genome and have recently been linked to evolutionary and pathological changes in brain size (e.g., 1q21-associated microcephaly). These findings lend support to the view that DUF1220 domain dosage is a key factor in the determination of primate (and human) brain size. Here we analyze 41 animal genomes and present the most complete account to date of the evolutionary history and genome organization of DUF1220 domains and the gene family that encodes them (NBPF). Included among the novel features identified by this analysis is a DUF1220 domain precursor in nonmammalian vertebrates, a unique predicted promoter common to all mammalian NBPF genes, six distinct clades into which DUF1220 sequences can be subdivided, and a previously unknown member of the NBPF gene family (NBPF25). Most importantly, we show that the exceptional HLS increase in DUF1220 copy number (from 102 in our last common ancestor with chimp to 272 in human; an average HLS increase of ~28 copies every million years since the Homo/Pan split) was driven by intragenic domain hyperamplification. This increase primarily involved a 4.7 kb, tandemly repeated three DUF1220 domain unit we have named the HLS DUF1220 triplet, a motif that is a likely candidate to underlie key properties unique to the Homo sapiens brain. Interestingly, all copies of the HLS DUF1220 triplet lie within a human-specific pericentric inversion that also includes the 1q12 C-band, a polymorphic heterochromatin expansion that is unique to the human genome. Both cytogenetic features likely played key roles in the rapid HLS DUF1220 triplet hyperamplification, which is among the most striking genomic changes specific to the human lineage.


Subject(s)
Evolution, Molecular , Genome , Protein Structure, Tertiary/genetics , Animals , Gene Dosage , Gene Order , Humans , Molecular Sequence Annotation , Multigene Family , Phylogeny , Physical Chromosome Mapping , Proteins/chemistry , Proteins/genetics
18.
Am J Hum Genet ; 91(3): 444-54, 2012 Sep 07.
Article in English | MEDLINE | ID: mdl-22901949

ABSTRACT

DUF1220 domains show the largest human-lineage-specific increase in copy number of any protein-coding region in the human genome and map primarily to 1q21, where deletions and reciprocal duplications have been associated with microcephaly and macrocephaly, respectively. Given these findings and the high correlation between DUF1220 copy number and brain size across primate lineages (R(2) = 0.98; p = 1.8 × 10(-6)), DUF1220 sequences represent plausible candidates for underlying 1q21-associated brain-size pathologies. To investigate this possibility, we used specialized bioinformatics tools developed for scoring highly duplicated DUF1220 sequences to implement targeted 1q21 array comparative genomic hybridization on individuals (n = 42) with 1q21-associated microcephaly and macrocephaly. We show that of all the 1q21 genes examined (n = 53), DUF1220 copy number shows the strongest association with brain size among individuals with 1q21-associated microcephaly, particularly with respect to the three evolutionarily conserved DUF1220 clades CON1(p = 0.0079), CON2 (p = 0.0134), and CON3 (p = 0.0116). Interestingly, all 1q21 DUF1220-encoding genes belonging to the NBPF family show significant correlations with frontal-occipital-circumference Z scores in the deletion group. In a similar survey of a nondisease population, we show that DUF1220 copy number exhibits the strongest correlation with brain gray-matter volume (CON1, p = 0.0246; and CON2, p = 0.0334). Notably, only DUF1220 sequences are consistently significant in both disease and nondisease populations. Taken together, these data strongly implicate the loss of DUF1220 copy number in the etiology of 1q21-associated microcephaly and support the view that DUF1220 domains function as general effectors of evolutionary, pathological, and normal variation in brain size.


Subject(s)
Brain/pathology , DNA Copy Number Variations , Organ Size , Animals , Base Sequence , Biological Evolution , Chromosomes, Human, Pair 1 , Comparative Genomic Hybridization , Gene Duplication , Humans , Megalencephaly/genetics , Microcephaly/genetics
19.
Hum Genomics ; 4(6): 421-7, 2010 Aug.
Article in English | MEDLINE | ID: mdl-20846932

ABSTRACT

Copy number variants (CNVs) create a major source of variation among individuals and populations. Array-based comparative genomic hybridisation (aCGH) is a powerful method used to detect and compare the copy numbers of DNA sequences at high resolution along the genome. In recent years, several informatics tools for accurate and efficient CNV detection and assessment have been developed. In this paper, most of the well known algorithms, analysis software and the limitations of that software will be briefly reviewed.


Subject(s)
Comparative Genomic Hybridization/methods , Computational Biology/methods , DNA Copy Number Variations/genetics , Data Collection , Software/statistics & numerical data , Humans , Markov Chains
20.
Nat Rev Genet ; 11(9): 658-62, 2010 09.
Article in English | MEDLINE | ID: mdl-20717156

ABSTRACT

A flood of comparative genomic data is resulting in the identification of human lineage-specific (HLS) sequences. As apes are our closest evolutionary relatives, transgenic introduction of HLS sequences into these species has the greatest potential to produce 'humanized' phenotypes and also to illuminate the functions of these sequences. We argue that such transgenic apes would also be more likely than other species to experience harm from such research, which renders such studies ethically unacceptable in apes and justifies regulatory barriers between these species and other non-human primates for HLS transgenic research.


Subject(s)
Biomedical Research/ethics , Genetic Techniques/ethics , Primates/genetics , Animals , Animals, Genetically Modified , Humans
SELECTION OF CITATIONS
SEARCH DETAIL
...