Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 8 de 8
Filter
Add more filters










Database
Language
Publication year range
1.
Genome Biol ; 8(10): R230, 2007.
Article in English | MEDLINE | ID: mdl-17971225

ABSTRACT

BACKGROUND: The role that chromosomal rearrangements might have played in the speciation processes that have separated the lineages of humans and chimpanzees has recently come into the spotlight. To date, however, results are contradictory. Here we revisit this issue by making use of the available human and chimpanzee genome sequence to study the relationship between chromosomal rearrangements and rates of DNA sequence evolution. RESULTS: Contrary to previous findings for this pair of species, we show that genes located in the rearranged chromosomes that differentiate the genomes of humans and chimpanzees, especially genes within rearrangements themselves, present lower divergence than genes elsewhere in the genome. Still, there are considerable differences between individual chromosomes. Chromosome 4, in particular, presents higher divergence in genes located within its rearrangement. CONCLUSION: A first conclusion of our analysis is that divergence is lower for genes located in rearranged chromosomes than for those in colinear chromosomes. We also report that non-coding regions within rearranged regions tend to have lower divergence than non-coding regions outside them. These results suggest an association between chromosomal rearrangements and lower non-coding divergence that has not been reported before, even if some chromosomes do not follow this trend and could be potentially associated with a speciation episode. In summary, without excluding it, our results suggest that chromosomal speciation has not been common along the human and chimpanzee lineage.


Subject(s)
Chromosome Aberrations , Evolution, Molecular , Genetic Speciation , Genetic Variation , Pan troglodytes/genetics , Animals , Computational Biology , Genes/genetics , Humans , Species Specificity
2.
Nat Genet ; 38(12): 1413-8, 2006 Dec.
Article in English | MEDLINE | ID: mdl-17115057

ABSTRACT

Numerous types of DNA variation exist, ranging from SNPs to larger structural alterations such as copy number variants (CNVs) and inversions. Alignment of DNA sequence from different sources has been used to identify SNPs and intermediate-sized variants (ISVs). However, only a small proportion of total heterogeneity is characterized, and little is known of the characteristics of most smaller-sized (<50 kb) variants. Here we show that genome assembly comparison is a robust approach for identification of all classes of genetic variation. Through comparison of two human assemblies (Celera's R27c compilation and the Build 35 reference sequence), we identified megabases of sequence (in the form of 13,534 putative non-SNP events) that were absent, inverted or polymorphic in one assembly. Database comparison and laboratory experimentation further demonstrated overlap or validation for 240 variable regions and confirmed >1.5 million SNPs. Some differences were simple insertions and deletions, but in regions containing CNVs, segmental duplication and repetitive DNA, they were more complex. Our results uncover substantial undescribed variation in humans, highlighting the need for comprehensive annotation strategies to fully interpret genome scanning and personalized sequencing projects.


Subject(s)
Genetic Variation , Genome, Human , Base Sequence , DNA/genetics , Genomics , Humans , In Situ Hybridization, Fluorescence , Polymerase Chain Reaction , Sequence Alignment
3.
Methods Mol Biol ; 338: 9-20, 2006.
Article in English | MEDLINE | ID: mdl-16888347

ABSTRACT

The aim of this chapter is to provide instruction for analyzing and mapping recent segmental and gene duplications in eukaryotic genomes. We describe a bioinformatics-based approach utilizing computational tools to manage eukaryotic genome sequences to characterize and understand the evolutionary fates and trajectories of duplicated genes. An introduction to bioinformatics tools and programs such as BLAST, Perl, BioPerl, and the GFF specification provides the necessary background to complete this analysis for any eukaryotic genome of interest.


Subject(s)
Chromosome Mapping/methods , Gene Duplication , Animals , Computational Biology , Databases, Genetic , Genomics , Humans , Mice , Sequence Alignment , Software
4.
Genomics ; 86(6): 692-700, 2005 Dec.
Article in English | MEDLINE | ID: mdl-16256303

ABSTRACT

Mouse and rat genomic sequences permit us to obtain a global view of evolutionary rearrangements that have occurred between the two species and to define hallmarks that might underlie these events. We present a comparative study of the sequence assemblies of mouse and rat genomes and report an enrichment of rodent-specific segmental duplications in regions where synteny is not preserved. We show that segmental duplications present higher rates of molecular evolution and that genes in rearranged regions have evolved faster than those located elsewhere. Previous studies have shown that synteny breakpoints between the mouse and the human genomes are enriched in human segmental duplications, suggesting a causative connection between such structures and evolutionary rearrangements. Our work provides further evidence to support the role of segmental duplications in chromosomal rearrangements in the evolution of the architecture of mammalian chromosomes and in the speciation processes that separate the mouse and the rat.


Subject(s)
Chromosomes, Mammalian/genetics , Evolution, Molecular , Gene Duplication , Gene Rearrangement/genetics , Mice/genetics , Rats/genetics , Animals , Computational Biology , Genomics , Synteny/genetics
5.
PLoS Genet ; 1(4): e56, 2005 Oct.
Article in English | MEDLINE | ID: mdl-16254605

ABSTRACT

With a draft genome-sequence assembly for the chimpanzee available, it is now possible to perform genome-wide analyses to identify, at a submicroscopic level, structural rearrangements that have occurred between chimpanzees and humans. The goal of this study was to investigate chromosomal regions that are inverted between the chimpanzee and human genomes. Using the net alignments for the builds of the human and chimpanzee genome assemblies, we identified a total of 1,576 putative regions of inverted orientation, covering more than 154 mega-bases of DNA. The DNA segments are distributed throughout the genome and range from 23 base pairs to 62 mega-bases in length. For the 66 inversions more than 25 kilobases (kb) in length, 75% were flanked on one or both sides by (often unrelated) segmental duplications. Using PCR and fluorescence in situ hybridization we experimentally validated 23 of 27 (85%) semi-randomly chosen regions; the largest novel inversion confirmed was 4.3 mega-bases at human Chromosome 7p14. Gorilla was used as an out-group to assign ancestral status to the variants. All experimentally validated inversion regions were then assayed against a panel of human samples and three of the 23 (13%) regions were found to be polymorphic in the human genome. These polymorphic inversions include 730 kb (at 7p22), 13 kb (at 7q11), and 1 kb (at 16q24) fragments with a 5%, 30%, and 48% minor allele frequency, respectively. Our results suggest that inversions are an important source of variation in primate genome evolution. The finding of at least three novel inversion polymorphisms in humans indicates this type of structural variation may be a more common feature of our genome than previously realized.


Subject(s)
Polymorphism, Genetic , Animals , Biological Evolution , Chromosome Inversion , Computational Biology/methods , Evolution, Molecular , Humans , In Situ Hybridization, Fluorescence , Pan troglodytes , Polymerase Chain Reaction , Species Specificity
6.
Genome Biol ; 4(8): R47, 2003.
Article in English | MEDLINE | ID: mdl-12914656

ABSTRACT

BACKGROUND: The high quality of the mouse genome draft sequence and its associated annotations are an invaluable biological resource. Identifying recent duplications in the mouse genome, especially in regions containing genes, may highlight important events in recent murine evolution. In addition, detecting recent sequence duplications can reveal potentially problematic regions of the genome assembly. We use BLAST-based computational heuristics to identify large (>/= 5 kb) and recent (>/= 90% sequence identity) segmental duplications in the mouse genome sequence. Here we present a database of recently duplicated regions of the mouse genome found in the mouse genome sequencing consortium (MGSC) February 2002 and February 2003 assemblies. RESULTS: We determined that 33.6 Mb of 2,695 Mb (1.2%) of sequence from the February 2003 mouse genome sequence assembly is involved in recent segmental duplications, which is less than that observed in the human genome (around 3.5-5%). From this dataset, 8.9 Mb (26%) of the duplication content consisted of 'unmapped' chromosome sequence. Moreover, we suspect that an additional 18.5 Mb of sequence is involved in duplication artifacts arising from sequence misassignment errors in this genome assembly. By searching for genes that are located within these regions, we identified 675 genes that mapped to duplicated regions of the mouse genome. Sixteen of these genes appear to have been duplicated independently in the human genome. From our dataset we further characterized a 42 kb recent segmental duplication of Mater, a maternal-effect gene essential for embryogenesis in mice. CONCLUSION: Our results provide an initial analysis of the recently duplicated sequence and gene content of the mouse genome. Many of these duplicated loci, as well as regions identified to be involved in potential sequence misassignment errors, will require further mapping and sequencing to achieve accuracy. A Genome Browser database was set up to display the identified duplication content presented in this work. This data will also be relevant to the growing number of investigators who use the draft genome sequence for experimental design and analysis.


Subject(s)
Computational Biology , Gene Duplication , Genome , Animals , Computational Biology/methods , Contig Mapping/methods , Databases, Genetic , Evolution, Molecular , Genes/genetics , Genes, Duplicate/genetics , Genome, Human , Humans , In Situ Hybridization, Fluorescence , Mice
7.
Science ; 300(5620): 767-72, 2003 May 02.
Article in English | MEDLINE | ID: mdl-12690205

ABSTRACT

DNA sequence and annotation of the entire human chromosome 7, encompassing nearly 158 million nucleotides of DNA and 1917 gene structures, are presented. To generate a higher order description, additional structural features such as imprinted genes, fragile sites, and segmental duplications were integrated at the level of the DNA sequence with medical genetic data, including 440 chromosome rearrangement breakpoints associated with disease. This approach enabled the discovery of candidate genes for developmental diseases including autism.


Subject(s)
Chromosomes, Human, Pair 7/genetics , Sequence Analysis, DNA , Animals , Autistic Disorder/genetics , Chromosome Aberrations , Chromosome Fragile Sites , Chromosome Fragility , Chromosome Mapping , Computational Biology , Congenital Abnormalities/genetics , CpG Islands , DNA, Complementary , Databases, Genetic , Euchromatin/genetics , Expressed Sequence Tags , Gene Duplication , Genes, Overlapping , Genetic Diseases, Inborn/genetics , Genomic Imprinting , Humans , In Situ Hybridization, Fluorescence , Limb Deformities, Congenital/genetics , Mice , Molecular Sequence Data , Mutation , Neoplasms/genetics , Pseudogenes , RNA/genetics , Retroelements , Williams Syndrome/genetics
8.
Genome Biol ; 4(4): R25, 2003.
Article in English | MEDLINE | ID: mdl-12702206

ABSTRACT

BACKGROUND: Previous studies have suggested that recent segmental duplications, which are often involved in chromosome rearrangements underlying genomic disease, account for some 5% of the human genome. We have developed rapid computational heuristics based on BLAST analysis to detect segmental duplications, as well as regions containing potential sequence misassignments in the human genome assemblies. RESULTS: Our analysis of the June 2002 public human genome assembly revealed that 107.4 of 3,043.1 megabases (Mb) (3.53%) of sequence contained segmental duplications, each with size equal or more than 5 kb and 90% identity. We have also detected that 38.9 Mb (1.28%) of sequence within this assembly is likely to be involved in sequence misassignment errors. Furthermore, we have identified a significant subset (199,965 of 2,327,473 or 8.6%) of single-nucleotide polymorphisms (SNPs) in the public databases that are not true SNPs but are potential paralogous sequence variants. CONCLUSION: Using two distinct computational approaches, we have identified most of the sequences in the human genome that have undergone recent segmental duplications. Near-identical segmental duplications present a major challenge to the completion of the human genome sequence. Potential sequence misassignments detected in this study would require additional efforts to resolve.


Subject(s)
Gene Duplication , Genome, Human , Sequence Analysis, DNA , Artifacts , Base Sequence , Chromosomes, Human , Computational Biology , Genetic Diseases, Inborn/genetics , Genetic Variation , Humans , Polymorphism, Single Nucleotide
SELECTION OF CITATIONS
SEARCH DETAIL
...