Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 17 de 17
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
Sci Transl Med ; 6(254): 254ra126, 2014 Sep 17.
Artigo em Inglês | MEDLINE | ID: mdl-25232178

RESUMO

Public health officials have raised concerns that plasmid transfer between Enterobacteriaceae species may spread resistance to carbapenems, an antibiotic class of last resort, thereby rendering common health care-associated infections nearly impossible to treat. To determine the diversity of carbapenemase-encoding plasmids and assess their mobility among bacterial species, we performed comprehensive surveillance and genomic sequencing of carbapenem-resistant Enterobacteriaceae in the National Institutes of Health (NIH) Clinical Center patient population and hospital environment. We isolated a repertoire of carbapenemase-encoding Enterobacteriaceae, including multiple strains of Klebsiella pneumoniae, Klebsiella oxytoca, Escherichia coli, Enterobacter cloacae, Citrobacter freundii, and Pantoea species. Long-read genome sequencing with full end-to-end assembly revealed that these organisms carry the carbapenem resistance genes on a wide array of plasmids. K. pneumoniae and E. cloacae isolated simultaneously from a single patient harbored two different carbapenemase-encoding plasmids, indicating that plasmid transfer between organisms was unlikely within this patient. We did, however, find evidence of horizontal transfer of carbapenemase-encoding plasmids between K. pneumoniae, E. cloacae, and C. freundii in the hospital environment. Our data, including full plasmid identification, challenge assumptions about horizontal gene transfer events within patients and identify possible connections between patients and the hospital environment. In addition, we identified a new carbapenemase-encoding plasmid of potentially high clinical impact carried by K. pneumoniae, E. coli, E. cloacae, and Pantoea species, in unrelated patients and in the hospital environment.


Assuntos
Proteínas de Bactérias/biossíntese , Infecção Hospitalar , Enterobacteriaceae/enzimologia , Plasmídeos , beta-Lactamases/biossíntese , Enterobacteriaceae/classificação , Enterobacteriaceae/genética , Hospitais Públicos , Humanos , National Institutes of Health (U.S.) , Vigilância da População , Reação em Cadeia da Polimerase em Tempo Real , Estados Unidos
2.
PLoS Genet ; 10(3): e1004190, 2014 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-24603370

RESUMO

Although a considerable proportion of serum lipids loci identified in European ancestry individuals (EA) replicate in African Americans (AA), interethnic differences in the distribution of serum lipids suggest that some genetic determinants differ by ethnicity. We conducted a comprehensive evaluation of five lipid candidate genes to identify variants with ethnicity-specific effects. We sequenced ABCA1, LCAT, LPL, PON1, and SERPINE1 in 48 AA individuals with extreme serum lipid concentrations (high HDLC/low TG or low HDLC/high TG). Identified variants were genotyped in the full population-based sample of AA (n = 1694) and tested for an association with serum lipids. rs328 (LPL) and correlated variants were associated with higher HDLC and lower TG. Interestingly, a stronger effect was observed on a "European" vs. "African" genetic background at this locus. To investigate this effect, we evaluated the region among West Africans (WA). For TG, the effect size among WA was the same in AA with only African local ancestry (2-3% lower TG), while the larger association among AA with local European ancestry matched previous reports in EA (10%). For HDLC, there was no association with rs328 in AA with only African local ancestry or in WA, while the association among AA with European local ancestry was much greater than what has been observed for EA (15 vs. ∼ 5 mg/dl), suggesting an interaction with an environmental or genetic factor that differs by ethnicity. Beyond this ancestry effect, the importance of African ancestry-focused, sequence-based work was also highlighted by serum lipid associations of variants that were in higher frequency (or present only) among those of African ancestry. By beginning our study with the sequence variation present in AA individuals, investigating local ancestry effects, and seeking replication in WA, we were able to comprehensively evaluate the role of a set of candidate genes in serum lipids in AA.


Assuntos
Negro ou Afro-Americano/genética , Etnicidade/genética , Estudo de Associação Genômica Ampla , Lipídeos/genética , Variação Genética , Genótipo , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Desequilíbrio de Ligação , Lipídeos/sangue , Polimorfismo de Nucleotídeo Único , População Branca/genética
3.
BMC Biol ; 10: 107, 2012 Dec 21.
Artigo em Inglês | MEDLINE | ID: mdl-23259493

RESUMO

BACKGROUND: Calcium-activated photoproteins are luciferase variants found in photocyte cells of bioluminescent jellyfish (Phylum Cnidaria) and comb jellies (Phylum Ctenophora). The complete genomic sequence from the ctenophore Mnemiopsis leidyi, a representative of the earliest branch of animals that emit light, provided an opportunity to examine the genome of an organism that uses this class of luciferase for bioluminescence and to look for genes involved in light reception. To determine when photoprotein genes first arose, we examined the genomic sequence from other early-branching taxa. We combined our genomic survey with gene trees, developmental expression patterns, and functional protein assays of photoproteins and opsins to provide a comprehensive view of light production and light reception in Mnemiopsis. RESULTS: The Mnemiopsis genome has 10 full-length photoprotein genes situated within two genomic clusters with high sequence conservation that are maintained due to strong purifying selection and concerted evolution. Photoprotein-like genes were also identified in the genomes of the non-luminescent sponge Amphimedon queenslandica and the non-luminescent cnidarian Nematostella vectensis, and phylogenomic analysis demonstrated that photoprotein genes arose at the base of all animals. Photoprotein gene expression in Mnemiopsis embryos begins during gastrulation in migrating precursors to photocytes and persists throughout development in the canals where photocytes reside. We identified three putative opsin genes in the Mnemiopsis genome and show that they do not group with well-known bilaterian opsin subfamilies. Interestingly, photoprotein transcripts are co-expressed with two of the putative opsins in developing photocytes. Opsin expression is also seen in the apical sensory organ. We present evidence that one opsin functions as a photopigment in vitro, absorbing light at wavelengths that overlap with peak photoprotein light emission, raising the hypothesis that light production and light reception may be functionally connected in ctenophore photocytes. We also present genomic evidence of a complete ciliary phototransduction cascade in Mnemiopsis. CONCLUSIONS: This study elucidates the genomic organization, evolutionary history, and developmental expression of photoprotein and opsin genes in the ctenophore Mnemiopsis leidyi, introduces a novel dual role for ctenophore photocytes in both bioluminescence and phototransduction, and raises the possibility that light production and light reception are linked in this early-branching non-bilaterian animal.


Assuntos
Ctenóforos/citologia , Ctenóforos/genética , Evolução Molecular , Regulação da Expressão Gênica , Genoma/genética , Proteínas Luminescentes/genética , Opsinas/genética , Sequência de Aminoácidos , Animais , Análise por Conglomerados , Ctenóforos/efeitos da radiação , Regulação da Expressão Gênica/efeitos da radiação , Proteínas de Fluorescência Verde/metabolismo , Luz , Transdução de Sinal Luminoso/efeitos da radiação , Proteínas Luminescentes/química , Proteínas Luminescentes/metabolismo , Dados de Sequência Molecular , Opsinas/química , Opsinas/metabolismo , Filogenia , RNA Mensageiro/genética , RNA Mensageiro/metabolismo , Reprodutibilidade dos Testes , Seleção Genética , Alinhamento de Sequência , Análise de Sequência de Proteína , Análise Espectral
4.
Genome Biol ; 13(7): R64, 2012 Jul 25.
Artigo em Inglês | MEDLINE | ID: mdl-22830599

RESUMO

BACKGROUND: While Staphylococcus epidermidis is commonly isolated from healthy human skin, it is also the most frequent cause of nosocomial infections on indwelling medical devices. Despite its importance, few genome sequences existed and the most frequent hospital-associated lineage, ST2, had not been fully sequenced. RESULTS: We cultivated 71 commensal S. epidermidis isolates from 15 skin sites and compared them with 28 nosocomial isolates from venous catheters and blood cultures. We produced 21 commensal and 9 nosocomial draft genomes, and annotated and compared their gene content, phylogenetic relatedness and biochemical functions. The commensal strains had an open pan-genome with 80% core genes and 20% variable genes. The variable genome was characterized by an overabundance of transposable elements, transcription factors and transporters. Biochemical diversity, as assayed by antibiotic resistance and in vitro biofilm formation, demonstrated the varied phenotypic consequences of this genomic diversity. The nosocomial isolates exhibited both large-scale rearrangements and single-nucleotide variation. We showed that S. epidermidis genomes separate into two phylogenetic groups, one consisting only of commensals. The formate dehydrogenase gene, present only in commensals, is a discriminatory marker between the two groups. CONCLUSIONS: Commensal skin S. epidermidis have an open pan-genome and show considerable diversity between isolates, even when derived from a single individual or body site. For ST2, the most common nosocomial lineage, we detect variation between three independent isolates sequenced. Finally, phylogenetic analyses revealed a previously unrecognized group of S. epidermidis strains characterized by reduced virulence and formate dehydrogenase, which we propose as a clinical molecular marker.


Assuntos
Infecções Relacionadas a Cateter/microbiologia , Infecção Hospitalar/microbiologia , Análise de Sequência de DNA/métodos , Pele/microbiologia , Staphylococcus epidermidis/classificação , Staphylococcus epidermidis/genética , Farmacorresistência Bacteriana , Evolução Molecular , Variação Genética , Genoma Bacteriano , Humanos , Dados de Sequência Molecular , Tipagem Molecular , Filogenia , Staphylococcus epidermidis/isolamento & purificação
5.
PLoS Genet ; 7(10): e1002325, 2011 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-22022284

RESUMO

We report an early onset spastic ataxia-neuropathy syndrome in two brothers of a consanguineous family characterized clinically by lower extremity spasticity, peripheral neuropathy, ptosis, oculomotor apraxia, dystonia, cerebellar atrophy, and progressive myoclonic epilepsy. Whole-exome sequencing identified a homozygous missense mutation (c.1847G>A; p.Y616C) in AFG3L2, encoding a subunit of an m-AAA protease. m-AAA proteases reside in the mitochondrial inner membrane and are responsible for removal of damaged or misfolded proteins and proteolytic activation of essential mitochondrial proteins. AFG3L2 forms either a homo-oligomeric isoenzyme or a hetero-oligomeric complex with paraplegin, a homologous protein mutated in hereditary spastic paraplegia type 7 (SPG7). Heterozygous loss-of-function mutations in AFG3L2 cause autosomal-dominant spinocerebellar ataxia type 28 (SCA28), a disorder whose phenotype is strikingly different from that of our patients. As defined in yeast complementation assays, the AFG3L2(Y616C) gene product is a hypomorphic variant that exhibited oligomerization defects in yeast as well as in patient fibroblasts. Specifically, the formation of AFG3L2(Y616C) complexes was impaired, both with itself and to a greater extent with paraplegin. This produced an early-onset clinical syndrome that combines the severe phenotypes of SPG7 and SCA28, in additional to other "mitochondrial" features such as oculomotor apraxia, extrapyramidal dysfunction, and myoclonic epilepsy. These findings expand the phenotype associated with AFG3L2 mutations and suggest that AFG3L2-related disease should be considered in the differential diagnosis of spastic ataxias.


Assuntos
Proteases Dependentes de ATP/genética , Encéfalo/anormalidades , Metaloendopeptidases/genética , Mitocôndrias/enzimologia , Paraplegia Espástica Hereditária/genética , Degenerações Espinocerebelares/genética , ATPases Associadas a Diversas Atividades Celulares , Adolescente , Sequência de Aminoácidos , Animais , Encéfalo/patologia , Criança , Diagnóstico Diferencial , Exoma/genética , Genótipo , Células HeLa , Homozigoto , Humanos , Masculino , Metaloendopeptidases/metabolismo , Camundongos , Dados de Sequência Molecular , Mutação de Sentido Incorreto , Paraplegia , Dobramento de Proteína , Irmãos , Paraplegia Espástica Hereditária/patologia , Ataxias Espinocerebelares/congênito , Degenerações Espinocerebelares/patologia , Leveduras/genética
6.
Mol Genet Metab ; 104(1-2): 189-91, 2011.
Artigo em Inglês | MEDLINE | ID: mdl-21767969

RESUMO

While genomic sequencing methods are powerful tools in the discovery of the genetic underpinnings of human disease, incidentally-revealed novel genomic risk factors may be equally important, both scientifically, and as relates to direct patient care. We performed whole-exome sequencing on a child with VACTERL association who suffered severe post-surgical neonatal pulmonary hypertension, and identified a potential novel genetic risk factor for this complication: a heterozygous mutation in CPSI. Newborn screening results from this patient's monozygotic twin provided evidence that this mutation, in combination with an environmental trigger (in this case, surgery), may have resulted in pulmonary artery hypertension due to inadequate nitric oxide production. Identification of this genetic risk factor allows for targeted medical preventative measures in this patient as well as relatives with the same mutation, and illustrates the power of incidental medical information unearthed by whole-exome sequencing.


Assuntos
Exoma/genética , Genômica , Medicina de Precisão , Hipertensão Pulmonar Primária Familiar , Humanos , Hipertensão Pulmonar/genética , Lactente , Recém-Nascido , Masculino , Reprodutibilidade dos Testes , Análise de Sequência de DNA
7.
Nat Genet ; 43(3): 189-96, 2011 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-21258341

RESUMO

Ciliary dysfunction leads to a broad range of overlapping phenotypes, collectively termed ciliopathies. This grouping is underscored by genetic overlap, where causal genes can also contribute modifier alleles to clinically distinct disorders. Here we show that mutations in TTC21B, which encodes the retrograde intraflagellar transport protein IFT139, cause both isolated nephronophthisis and syndromic Jeune asphyxiating thoracic dystrophy. Moreover, although resequencing of TTC21B in a large, clinically diverse ciliopathy cohort and matched controls showed a similar frequency of rare changes, in vivo and in vitro evaluations showed a significant enrichment of pathogenic alleles in cases (P < 0.003), suggesting that TTC21B contributes pathogenic alleles to ∼5% of ciliopathy cases. Our data illustrate how genetic lesions can be both causally associated with diverse ciliopathies and interact in trans with other disease-causing genes and highlight how saturated resequencing followed by functional analysis of all variants informs the genetic architecture of inherited disorders.


Assuntos
Proteínas Adaptadoras de Transdução de Sinal/genética , Alelos , Transtornos da Motilidade Ciliar/genética , Animais , Variação Genética , Humanos , Camundongos , Mutação , Linhagem , Células Fotorreceptoras/fisiologia , Peixe-Zebra/genética
8.
BMC Genomics ; 11: 21, 2010 Jan 11.
Artigo em Inglês | MEDLINE | ID: mdl-20064230

RESUMO

BACKGROUND: The approaches for shotgun-based sequencing of vertebrate genomes are now well-established, and have resulted in the generation of numerous draft whole-genome sequence assemblies. In contrast, the process of refining those assemblies to improve contiguity and increase accuracy (known as 'sequence finishing') remains tedious, labor-intensive, and expensive. As a result, the vast majority of vertebrate genome sequences generated to date remain at a draft stage. RESULTS: To date, our genome sequencing efforts have focused on comparative studies of targeted genomic regions, requiring sequence finishing of large blocks of orthologous sequence (average size 0.5-2 Mb) from various subsets of 75 vertebrates. This experience has provided a unique opportunity to compare the relative effort required to finish shotgun-generated genome sequence assemblies from different species, which we report here. Importantly, we found that the sequence assemblies generated for the same orthologous regions from various vertebrates show substantial variation with respect to misassemblies and, in particular, the frequency and characteristics of sequence gaps. As a consequence, the work required to finish different species' sequences varied greatly. Application of the same standardized methods for finishing provided a novel opportunity to "assay" characteristics of genome sequences among many vertebrate species. It is important to note that many of the problems we have encountered during sequence finishing reflect unique architectural features of a particular vertebrate's genome, which in some cases may have important functional and/or evolutionary implications. Finally, based on our analyses, we have been able to improve our procedures to overcome some of these problems and to increase the overall efficiency of the sequence-finishing process, although significant challenges still remain. CONCLUSION: Our findings have important implications for the eventual finishing of the draft whole-genome sequences that have now been generated for a large number of vertebrates.


Assuntos
Genômica/métodos , Análise de Sequência de DNA/métodos , Vertebrados/genética , Animais , Mapeamento Cromossômico , Cromossomos Artificiais Bacterianos , Genoma
9.
Genome Res ; 19(9): 1665-74, 2009 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-19602640

RESUMO

ClinSeq is a pilot project to investigate the use of whole-genome sequencing as a tool for clinical research. By piloting the acquisition of large amounts of DNA sequence data from individual human subjects, we are fostering the development of hypothesis-generating approaches for performing research in genomic medicine, including the exploration of issues related to the genetic architecture of disease, implementation of genomic technology, informed consent, disclosure of genetic information, and archiving, analyzing, and displaying sequence data. In the initial phase of ClinSeq, we are enrolling roughly 1000 participants; the evaluation of each includes obtaining a detailed family and medical history, as well as a clinical evaluation. The participants are being consented broadly for research on many traits and for whole-genome sequencing. Initially, Sanger-based sequencing of 300-400 genes thought to be relevant to atherosclerosis is being performed, with the resulting data analyzed for rare, high-penetrance variants associated with specific clinical traits. The participants are also being consented to allow the contact of family members for additional studies of sequence variants to explore their potential association with specific phenotypes. Here, we present the general considerations in designing ClinSeq, preliminary results based on the generation of an initial 826 Mb of sequence data, the findings for several genes that serve as positive controls for the project, and our views about the potential implications of ClinSeq. The early experiences with ClinSeq illustrate how large-scale medical sequencing can be a practical, productive, and critical component of research in genomic medicine.


Assuntos
Aterosclerose/genética , Pesquisa Biomédica , Doenças Cardiovasculares/genética , Genoma Humano , Genômica , Projetos Piloto , Análise de Sequência de DNA/métodos , Idoso , Estudos de Coortes , Feminino , Humanos , Masculino , Linhagem , Fenótipo
10.
Science ; 324(5931): 1190-2, 2009 May 29.
Artigo em Inglês | MEDLINE | ID: mdl-19478181

RESUMO

Human skin is a large, heterogeneous organ that protects the body from pathogens while sustaining microorganisms that influence human health and disease. Our analysis of 16S ribosomal RNA gene sequences obtained from 20 distinct skin sites of healthy humans revealed that physiologically comparable sites harbor similar bacterial communities. The complexity and stability of the microbial community are dependent on the specific characteristics of the skin site. This topographical and temporal survey provides a baseline for studies that examine the role of bacterial communities in disease states and the microbial interdependencies required to maintain healthy skin.


Assuntos
Bactérias/isolamento & purificação , Metagenoma , Pele/microbiologia , Actinobacteria/classificação , Actinobacteria/genética , Actinobacteria/isolamento & purificação , Adulto , Bactérias/classificação , Bactérias/genética , Bacteroidetes/classificação , Bacteroidetes/genética , Bacteroidetes/isolamento & purificação , Biodiversidade , Feminino , Genes de RNAr , Humanos , Masculino , Dados de Sequência Molecular , Filogenia , Proteobactérias/classificação , Proteobactérias/genética , Proteobactérias/isolamento & purificação , RNA Ribossômico 16S , Fatores de Tempo , Adulto Jovem
11.
Genome Res ; 18(7): 1043-50, 2008 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-18502944

RESUMO

The many layers and structures of the skin serve as elaborate hosts to microbes, including a diversity of commensal and pathogenic bacteria that contribute to both human health and disease. To determine the complexity and identity of the microbes inhabiting the skin, we sequenced bacterial 16S small-subunit ribosomal RNA genes isolated from the inner elbow of five healthy human subjects. This analysis revealed 113 operational taxonomic units (OTUs; "phylotypes") at the level of 97% similarity that belong to six bacterial divisions. To survey all depths of the skin, we sampled using three methods: swab, scrape, and punch biopsy. Proteobacteria dominated the skin microbiota at all depths of sampling. Interpersonal variation is approximately equal to intrapersonal variation when considering bacterial community membership and structure. Finally, we report strong similarities in the complexity and identity of mouse and human skin microbiota. This study of healthy human skin microbiota will serve to direct future research addressing the role of skin microbiota in health and disease, and metagenomic projects addressing the complex physiological interactions between the skin and the microbes that inhabit this environment.


Assuntos
Bactérias/genética , Variação Genética , Pele/microbiologia , Adulto , Idoso , Animais , DNA Bacteriano/análise , DNA Bacteriano/genética , DNA Ribossômico/genética , Feminino , Humanos , Masculino , Camundongos , Camundongos Endogâmicos C57BL , Pessoa de Meia-Idade , RNA Ribossômico 16S/genética
12.
Genome Res ; 17(6): 760-74, 2007 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-17567995

RESUMO

A key component of the ongoing ENCODE project involves rigorous comparative sequence analyses for the initially targeted 1% of the human genome. Here, we present orthologous sequence generation, alignment, and evolutionary constraint analyses of 23 mammalian species for all ENCODE targets. Alignments were generated using four different methods; comparisons of these methods reveal large-scale consistency but substantial differences in terms of small genomic rearrangements, sensitivity (sequence coverage), and specificity (alignment accuracy). We describe the quantitative and qualitative trade-offs concomitant with alignment method choice and the levels of technical error that need to be accounted for in applications that require multisequence alignments. Using the generated alignments, we identified constrained regions using three different methods. While the different constraint-detecting methods are in general agreement, there are important discrepancies relating to both the underlying alignments and the specific algorithms. However, by integrating the results across the alignments and constraint-detecting methods, we produced constraint annotations that were found to be robust based on multiple independent measures. Analyses of these annotations illustrate that most classes of experimentally annotated functional elements are enriched for constrained sequences; however, large portions of each class (with the exception of protein-coding sequences) do not overlap constrained regions. The latter elements might not be under primary sequence constraint, might not be constrained across all mammals, or might have expendable molecular functions. Conversely, 40% of the constrained sequences do not overlap any of the functional elements that have been experimentally identified. Together, these findings demonstrate and quantify how many genomic functional elements await basic molecular characterization.


Assuntos
Evolução Molecular , Genoma Humano , Mamíferos/genética , Fases de Leitura Aberta , Filogenia , Alinhamento de Sequência , Animais , Projeto Genoma Humano , Humanos
13.
Genome Res ; 16(6): 796-803, 2006 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-16672307

RESUMO

Sequencing of full-insert clones from full-length cDNA libraries from both Xenopus laevis and Xenopus tropicalis has been ongoing as part of the Xenopus Gene Collection Initiative. Here we present 10,967 full ORF verified cDNA clones (8049 from X. laevis and 2918 from X. tropicalis) as a community resource. Because the genome of X. laevis, but not X. tropicalis, has undergone allotetraploidization, comparison of coding sequences from these two clawed (pipid) frogs provides a unique angle for exploring the molecular evolution of duplicate genes. Within our clone set, we have identified 445 gene trios, each comprised of an allotetraploidization-derived X. laevis gene pair and their shared X. tropicalis ortholog. Pairwise dN/dS, comparisons within trios show strong evidence for purifying selection acting on all three members. However, dN/dS ratios between X. laevis gene pairs are elevated relative to their X. tropicalis ortholog. This difference is highly significant and indicates an overall relaxation of selective pressures on duplicated gene pairs. We have found that the paralogs that have been lost since the tetraploidization event are enriched for several molecular functions, but have found no such enrichment in the extant paralogs. Approximately 14% of the paralogous pairs analyzed here also show differential expression indicative of subfunctionalization.


Assuntos
Sequência de Bases , Biblioteca Gênica , Poliploidia , Xenopus laevis/genética , Xenopus/genética , Animais , Evolução Molecular , Expressão Gênica , Genes Duplicados , Genoma , Dados de Sequência Molecular , Fases de Leitura Aberta/genética , Filogenia , Homologia de Sequência do Ácido Nucleico
14.
Genome Res ; 14(11): 2235-44, 2004 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-15479945

RESUMO

Although the cost of generating draft-quality genomic sequence continues to decline, refining that sequence by the process of "sequence finishing" remains expensive. Near-perfect finished sequence is an appropriate goal for the human genome and a small set of reference genomes; however, such a high-quality product cannot be cost-justified for large numbers of additional genomes, at least for the foreseeable future. Here we describe the generation and quality of an intermediate grade of finished genomic sequence (termed comparative-grade finished sequence), which is tailored for use in multispecies sequence comparisons. Our analyses indicate that this sequence is very high quality (with the residual gaps and errors mostly falling within repetitive elements) and reflects 99% of the total sequence. Importantly, comparative-grade sequence finishing requires approximately 40-fold less reagents and approximately 10-fold less personnel effort compared to the generation of near-perfect finished sequence, such as that produced for the human genome. Although applied here to finishing sequence derived from individual bacterial artificial chromosome (BAC) clones, one could envision establishing routines for refining sequences emanating from whole-genome shotgun sequencing projects to a similar quality level. Our experience to date demonstrates that comparative-grade sequence finishing represents a practical and affordable option for sequence refinement en route to comparative analyses.


Assuntos
Cromossomos Artificiais Bacterianos/genética , Mapeamento de Sequências Contíguas/economia , Éxons/genética , Genoma , Análise de Sequência de DNA/economia , Software , Animais , Sequência de Bases , Clonagem Molecular , Biologia Computacional , Mapeamento de Sequências Contíguas/métodos , Custos e Análise de Custo , Bases de Dados Genéticas , Lemur/genética , Dados de Sequência Molecular , Papio/genética , Ratos , Sequências Repetitivas de Ácido Nucleico , Homologia de Sequência do Ácido Nucleico , Software/economia
15.
Genome Res ; 13(1): 55-63, 2003 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-12529306

RESUMO

Duplications have long been postulated to be an important mechanism by which genomes evolve. Interspecies genomic comparisons are one method by which the origin and molecular mechanism of duplications can be inferred. By comparative mapping in human, mouse, and rat, we previously found evidence for a recent chromosome-fission event that occurred in the mouse lineage. Cytogenetic mapping revealed that the genomic segments flanking the fission site appeared to be duplicated, with copies residing near the centromere of multiple mouse chromosomes. Here we report the mapping and sequencing of the regions of mouse chromosomes 5 and 6 involved in this chromosome-fission event as well as the results of comparative sequence analysis with the orthologous human and rat genomic regions. Our data indicate that the duplications associated with mouse chromosomes 5 and 6 are recent and that the resulting duplicated segments share significant sequence similarity with a series of regions near the centromeres of the mouse chromosomes previously identified by cytogenetic mapping. We also identified pericentromeric duplicated segments shared between mouse chromosomes 5 and 1. Finally, novel mouse satellite sequences as well as putative chimeric transcripts were found to be associated with the duplicated segments. Together, these findings demonstrate that pericentromeric duplications are not restricted to primates and may be a common mechanism for genome evolution in mammals.


Assuntos
Centrômero/genética , Duplicação Gênica , Animais , Quimera/genética , Cromossomos/genética , Cromossomos Humanos/genética , Sequência Conservada/genética , DNA Satélite/genética , Evolução Molecular , Marcadores Genéticos/genética , Humanos , Camundongos , Mapeamento Físico do Cromossomo/métodos , Ratos
16.
Nucleic Acids Res ; 30(11): 2469-77, 2002 Jun 01.
Artigo em Inglês | MEDLINE | ID: mdl-12034835

RESUMO

In parallel with the production of genomic sequence data, attention is being focused on the generation of comprehensive cDNA-sequence resources. Such efforts are increasingly emphasizing the production of high-accuracy sequence corresponding to the entire insert of cDNA clones, especially those presumed to reflect the full-length mRNA. The complete sequencing of cDNA clones on a large scale presents unique challenges because of the generally small, yet heterogeneous, sizes of the cloned inserts. We have developed a strategy for high-throughput sequencing of cDNA clones using the transposon Tn5. This approach has been tailored for implementation within an existing large-scale 'shotgun-style' sequencing program, although it could be readily adapted for use in virtually any sequencing environment. In addition, we have developed a modified version of our strategy that can be applied to cDNA clones with large cloning vectors, thereby overcoming a potential limitation of transposon-based approaches. Here we describe the details of our cDNA-sequencing pipeline, including a summary of the experience in sequencing more than 4200 cDNA clones to produce more than 8 million base pairs of high-accuracy cDNA sequence. These data provide both convincing evidence that the insertion of Tn5 into cDNA clones is sufficiently random for its effective use in large-scale cDNA sequencing as well as interesting insight about the sequence context preferred for insertion by Tn5.


Assuntos
Elementos de DNA Transponíveis/genética , DNA Complementar/genética , Análise de Sequência de DNA/métodos , Composição de Bases , Distribuição Binomial , Clonagem Molecular , Vetores Genéticos/genética , Mutagênese Insercional/genética , Mapeamento Físico do Cromossomo/métodos , Recombinação Genética/genética , Sensibilidade e Especificidade
17.
Genome Res ; 12(1): 3-15, 2002 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-11779826

RESUMO

Williams syndrome is a complex developmental disorder that results from the heterozygous deletion of a approximately 1.6-Mb segment of human chromosome 7q11.23. These deletions are mediated by large (approximately 300 kb) duplicated blocks of DNA of near-identical sequence. Previously, we showed that the orthologous region of the mouse genome is devoid of such duplicated segments. Here, we extend our studies to include the generation of approximately 3.3 Mb of genomic sequence from the mouse Williams syndrome region, of which just over 1.4 Mb is finished to high accuracy. Comparative analyses of the mouse and human sequences within and immediately flanking the interval commonly deleted in Williams syndrome have facilitated the identification of nine previously unreported genes, provided detailed sequence-based information regarding 30 genes residing in the region, and revealed a number of potentially interesting conserved noncoding sequences. Finally, to facilitate comparative sequence analysis, we implemented several enhancements to the program, including the addition of links from annotated features within a generated percent-identity plot to specific records in public databases. Taken together, the results reported here provide an important comparative sequence resource that should catalyze additional studies of Williams syndrome, including those that aim to characterize genes within the commonly deleted interval and to develop mouse models of the disorder.


Assuntos
Cromossomos Humanos Par 7/genética , Análise de Sequência de DNA/métodos , Homologia de Sequência do Ácido Nucleico , Síndrome de Williams/genética , Animais , Composição de Bases , Sequência Conservada/genética , Humanos , Camundongos , Dados de Sequência Molecular , Mapeamento Físico do Cromossomo
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...