Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 43
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
Nucleic Acids Res ; 45(D1): D507-D516, 2017 01 04.
Artigo em Inglês | MEDLINE | ID: mdl-27738135

RESUMO

The Integrated Microbial Genomes with Microbiome Samples (IMG/M: https://img.jgi.doe.gov/m/) system contains annotated DNA and RNA sequence data of (i) archaeal, bacterial, eukaryotic and viral genomes from cultured organisms, (ii) single cell genomes (SCG) and genomes from metagenomes (GFM) from uncultured archaea, bacteria and viruses and (iii) metagenomes from environmental, host associated and engineered microbiome samples. Sequence data are generated by DOE's Joint Genome Institute (JGI), submitted by individual scientists, or collected from public sequence data archives. Structural and functional annotation is carried out by JGI's genome and metagenome annotation pipelines. A variety of analytical and visualization tools provide support for examining and comparing IMG/M's datasets. IMG/M allows open access interactive analysis of publicly available datasets, while manual curation, submission and access to private datasets and computationally intensive workspace-based analysis require login/password access to its expert review (ER) companion system (IMG/M ER: https://img.jgi.doe.gov/mer/). Since the last report published in the 2014 NAR Database Issue, IMG/M's dataset content has tripled in terms of number of datasets and overall protein coding genes, while its analysis tools have been extended to cope with the rapid growth in the number and size of datasets handled by the system.


Assuntos
Biologia Computacional/métodos , Metagenoma , Metagenômica/métodos , Microbiota/genética , Software , Navegador
2.
Nucleic Acids Res ; 45(D1): D457-D465, 2017 01 04.
Artigo em Inglês | MEDLINE | ID: mdl-27799466

RESUMO

Viruses represent the most abundant life forms on the planet. Recent experimental and computational improvements have led to a dramatic increase in the number of viral genome sequences identified primarily from metagenomic samples. As a result of the expanding catalog of metagenomic viral sequences, there exists a need for a comprehensive computational platform integrating all these sequences with associated metadata and analytical tools. Here we present IMG/VR (https://img.jgi.doe.gov/vr/), the largest publicly available database of 3908 isolate reference DNA viruses with 264 413 computationally identified viral contigs from >6000 ecologically diverse metagenomic samples. Approximately half of the viral contigs are grouped into genetically distinct quasi-species clusters. Microbial hosts are predicted for 20 000 viral sequences, revealing nine microbial phyla previously unreported to be infected by viruses. Viral sequences can be queried using a variety of associated metadata, including habitat type and geographic location of the samples, or taxonomic classification according to hallmark viral genes. IMG/VR has a user-friendly interface that allows users to interrogate all integrated data and interact by comparing with external sequences, thus serving as an essential resource in the viral genomics community.


Assuntos
Vírus de DNA/genética , Bases de Dados Genéticas , Genoma Viral , Genômica/métodos , Metagenômica/métodos , Retroviridae/genética , Software , Microbiologia Ambiental , Interações Hospedeiro-Patógeno , Metagenoma , Análise de Sequência de DNA
3.
BMC Genomics ; 17: 307, 2016 Apr 26.
Artigo em Inglês | MEDLINE | ID: mdl-27118214

RESUMO

BACKGROUND: The exponential growth of genomic data from next generation technologies renders traditional manual expert curation effort unsustainable. Many genomic systems have included community annotation tools to address the problem. Most of these systems adopted a "Wiki-based" approach to take advantage of existing wiki technologies, but encountered obstacles in issues such as usability, authorship recognition, information reliability and incentive for community participation. RESULTS: Here, we present a different approach, relying on tightly integrated method rather than "Wiki-based" method, to support community annotation and user collaboration in the Integrated Microbial Genomes (IMG) system. The IMG approach allows users to use existing IMG data warehouse and analysis tools to add gene, pathway and biosynthetic cluster annotations, to analyze/reorganize contigs, genes and functions using workspace datasets, and to share private user annotations and workspace datasets with collaborators. We show that the annotation effort using IMG can be part of the research process to overcome the user incentive and authorship recognition problems thus fostering collaboration among domain experts. The usability and reliability issues are addressed by the integration of curated information and analysis tools in IMG, together with DOE Joint Genome Institute (JGI) expert review. CONCLUSION: By incorporating annotation operations into IMG, we provide an integrated environment for users to perform deeper and extended data analysis and annotation in a single system that can lead to publications and community knowledge sharing as shown in the case studies.


Assuntos
Biologia Computacional/métodos , Genoma Microbiano , Genômica/métodos , Anotação de Sequência Molecular/métodos , Software , Comportamento Cooperativo , Confiabilidade dos Dados , Disseminação de Informação , Internet , Interface Usuário-Computador
5.
Stand Genomic Sci ; 11: 17, 2016.
Artigo em Inglês | MEDLINE | ID: mdl-26918089

RESUMO

The DOE-JGI Metagenome Annotation Pipeline (MAP v.4) performs structural and functional annotation for metagenomic sequences that are submitted to the Integrated Microbial Genomes with Microbiomes (IMG/M) system for comparative analysis. The pipeline runs on nucleotide sequences provided via the IMG submission site. Users must first define their analysis projects in GOLD and then submit the associated sequence datasets consisting of scaffolds/contigs with optional coverage information and/or unassembled reads in fasta and fastq file formats. The MAP processing consists of feature prediction including identification of protein-coding genes, non-coding RNAs and regulatory RNAs, as well as CRISPR elements. Structural annotation is followed by functional annotation including assignment of protein product names and connection to various protein family databases.

6.
Stand Genomic Sci ; 10: 86, 2015.
Artigo em Inglês | MEDLINE | ID: mdl-26512311

RESUMO

The DOE-JGI Microbial Genome Annotation Pipeline performs structural and functional annotation of microbial genomes that are further included into the Integrated Microbial Genome comparative analysis system. MGAP is applied to assembled nucleotide sequence datasets that are provided via the IMG submission site. Dataset submission for annotation first requires project and associated metadata description in GOLD. The MGAP sequence data processing consists of feature prediction including identification of protein-coding genes, non-coding RNAs and regulatory RNA features, as well as CRISPR elements. Structural annotation is followed by assignment of protein product names and functions.

7.
Genome Announc ; 3(4)2015 Aug 06.
Artigo em Inglês | MEDLINE | ID: mdl-26251504

RESUMO

Frankia sp. strain DC12, isolated from root nodules of Datisca cannabina, is a member of the fourth lineage of Frankia, which is unable to reinfect actinorhizal plants. Here, we report its 6.88-Mbp high-quality draft genome sequence, with a G+C content of 71.92% and 5,858 candidate protein-coding genes.

8.
mBio ; 6(4): e00932, 2015 Jul 14.
Artigo em Inglês | MEDLINE | ID: mdl-26173699

RESUMO

UNLABELLED: In the discovery of secondary metabolites, analysis of sequence data is a promising exploration path that remains largely underutilized due to the lack of computational platforms that enable such a systematic approach on a large scale. In this work, we present IMG-ABC (https://img.jgi.doe.gov/abc), an atlas of biosynthetic gene clusters within the Integrated Microbial Genomes (IMG) system, which is aimed at harnessing the power of "big" genomic data for discovering small molecules. IMG-ABC relies on IMG's comprehensive integrated structural and functional genomic data for the analysis of biosynthetic gene clusters (BCs) and associated secondary metabolites (SMs). SMs and BCs serve as the two main classes of objects in IMG-ABC, each with a rich collection of attributes. A unique feature of IMG-ABC is the incorporation of both experimentally validated and computationally predicted BCs in genomes as well as metagenomes, thus identifying BCs in uncultured populations and rare taxa. We demonstrate the strength of IMG-ABC's focused integrated analysis tools in enabling the exploration of microbial secondary metabolism on a global scale, through the discovery of phenazine-producing clusters for the first time in Alphaproteobacteria. IMG-ABC strives to fill the long-existent void of resources for computational exploration of the secondary metabolism universe; its underlying scalable framework enables traversal of uncovered phylogenetic and chemical structure space, serving as a doorway to a new era in the discovery of novel molecules. IMPORTANCE: IMG-ABC is the largest publicly available database of predicted and experimental biosynthetic gene clusters and the secondary metabolites they produce. The system also includes powerful search and analysis tools that are integrated with IMG's extensive genomic/metagenomic data and analysis tool kits. As new research on biosynthetic gene clusters and secondary metabolites is published and more genomes are sequenced, IMG-ABC will continue to expand, with the goal of becoming an essential component of any bioinformatic exploration of the secondary metabolism world.


Assuntos
Vias Biossintéticas/genética , Biologia Computacional/métodos , Bases de Conhecimento , Família Multigênica , Metabolismo Secundário/genética
9.
Stand Genomic Sci ; 9(3): 540-50, 2014 Jun 15.
Artigo em Inglês | MEDLINE | ID: mdl-25197439

RESUMO

Microvirga lotononidis is a recently described species of root-nodule bacteria that is an effective nitrogen- (N2) fixing microsymbiont of the symbiotically specific African legume Listia angolensis (Welw. ex Bak.) B.-E. van Wyk & Boatwr. M. lotononidis possesses several properties that are unusual in root-nodule bacteria, including pigmentation and the ability to grow at temperatures of up to 45°C. Strain WSM3557(T) is an aerobic, motile, Gram-negative, non-spore-forming rod isolated from a L. angolensis root nodule collected in Chipata, Zambia in 1963. This is the first report of a complete genome sequence for the genus Microvirga. Here we describe the features of Microvirga lotononidis strain WSM3557(T), together with genome sequence information and annotation. The 7,082,538 high-quality-draft genome is arranged in 18 scaffolds of 104 contigs, contains 6,956 protein-coding genes and 84 RNA-only encoding genes, and is one of 20 rhizobial genomes sequenced as part of the DOE Joint Genome Institute 2010 Community Sequencing Program.

10.
Nucleic Acids Res ; 42(Database issue): D560-7, 2014 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-24165883

RESUMO

The Integrated Microbial Genomes (IMG) data warehouse integrates genomes from all three domains of life, as well as plasmids, viruses and genome fragments. IMG provides tools for analyzing and reviewing the structural and functional annotations of genomes in a comparative context. IMG's data content and analytical capabilities have increased continuously since its first version released in 2005. Since the last report published in the 2012 NAR Database Issue, IMG's annotation and data integration pipelines have evolved while new tools have been added for recording and analyzing single cell genomes, RNA Seq and biosynthetic cluster data. Different IMG datamarts provide support for the analysis of publicly available genomes (IMG/W: http://img.jgi.doe.gov/w), expert review of genome annotations (IMG/ER: http://img.jgi.doe.gov/er) and teaching and training in the area of microbial genome analysis (IMG/EDU: http://img.jgi.doe.gov/edu).


Assuntos
Bases de Dados Genéticas , Genoma Microbiano , Vias Biossintéticas/genética , Perfilação da Expressão Gênica , Genoma Arqueal , Genoma Bacteriano , Genoma Viral , Genômica , Internet , Anotação de Sequência Molecular , Plasmídeos/genética , Proteômica , Software , Integração de Sistemas
11.
Nucleic Acids Res ; 42(Database issue): D568-73, 2014 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-24136997

RESUMO

IMG/M (http://img.jgi.doe.gov/m) provides support for comparative analysis of microbial community aggregate genomes (metagenomes) in the context of a comprehensive set of reference genomes from all three domains of life, as well as plasmids, viruses and genome fragments. IMG/M's data content and analytical tools have expanded continuously since its first version was released in 2007. Since the last report published in the 2012 NAR Database Issue, IMG/M's database architecture, annotation and data integration pipelines and analysis tools have been extended to copewith the rapid growth in the number and size of metagenome data sets handled by the system. IMG/M data marts provide support for the analysis of publicly available genomes, expert review of metagenome annotations (IMG/M ER: http://img.jgi.doe.gov/mer) and Human Microbiome Project (HMP)-specific metagenome samples (IMG/M HMP: http://img.jgi.doe.gov/imgm_hmp).


Assuntos
Bases de Dados Genéticas , Metagenoma , Perfilação da Expressão Gênica , Genoma Arqueal , Genoma Bacteriano , Genoma Viral , Internet , Metagenômica/normas , Plasmídeos/genética , Padrões de Referência , Análise de Sequência de Proteína , Software , Integração de Sistemas
12.
Stand Genomic Sci ; 7(3): 449-68, 2013.
Artigo em Inglês | MEDLINE | ID: mdl-24019992

RESUMO

The complete genomes of Thermus oshimai JL-2 and T. thermophilus JL-18 each consist of a circular chromosome, 2.07 Mb and 1.9 Mb, respectively, and two plasmids ranging from 0.27 Mb to 57.2 kb. Comparison of the T. thermophilus JL-18 chromosome with those from other strains of T. thermophilus revealed a high degree of synteny, whereas the megaplasmids from the same strains were highly plastic. The T. oshimai JL-2 chromosome and megaplasmids shared little or no synteny with other sequenced Thermus strains. Phylogenomic analyses using a concatenated set of conserved proteins confirmed the phylogenetic and taxonomic assignments based on 16S rRNA phylogenetics. Both chromosomes encode a complete glycolysis, tricarboxylic acid (TCA) cycle, and pentose phosphate pathway plus glucosidases, glycosidases, proteases, and peptidases, highlighting highly versatile heterotrophic capabilities. Megaplasmids of both strains contained a gene cluster encoding enzymes predicted to catalyze the sequential reduction of nitrate to nitrous oxide; however, the nitrous oxide reductase required for the terminal step in denitrification was absent, consistent with their incomplete denitrification phenotypes. A sox gene cluster was identified in both chromosomes, suggesting a mode of chemolithotrophy. In addition, nrf and psr gene clusters in T. oshmai JL-2 suggest respiratory nitrite ammonification and polysulfide reduction as possible modes of anaerobic respiration.

13.
Stand Genomic Sci ; 7(3): 469-82, 2013.
Artigo em Inglês | MEDLINE | ID: mdl-24019993

RESUMO

Nitrosomonas sp. Is79 is a chemolithoautotrophic ammonia-oxidizing bacterium that belongs to the family Nitrosomonadaceae within the phylum Proteobacteria. Ammonia oxidation is the first step of nitrification, an important process in the global nitrogen cycle ultimately resulting in the production of nitrate. Nitrosomonas sp. Is79 is an ammonia oxidizer of high interest because it is adapted to low ammonium and can be found in freshwater environments around the world. The 3,783,444-bp chromosome with a total of 3,553 protein coding genes and 44 RNA genes was sequenced by the DOE-Joint Genome Institute Program CSP 2006.

14.
Genome Announc ; 1(4)2013 Jul 05.
Artigo em Inglês | MEDLINE | ID: mdl-23833133

RESUMO

We announce the availability of the genome sequence of Streptomyces viridosporus strain T7A ATCC 39115, a plant biomass-degrading actinomycete. This bacterium is of special interest because of its capacity to degrade lignin, an underutilized component of plants in the context of bioenergy. It has a full complement of genes for plant biomass catabolism.

15.
Genome Announc ; 1(4)2013 Jul 11.
Artigo em Inglês | MEDLINE | ID: mdl-23846272

RESUMO

Members of the actinomycete genus Frankia form a nitrogen-fixing symbiosis with 8 different families of actinorhizal plants. We report a draft genome sequence for Frankia sp. strain BMG5.12, a nitrogen-fixing actinobacterium isolated from Tunisian soils with the ability to infect Elaeagnus angustifolia and Myrica gale.

17.
Genome Announc ; 1(2): e0008513, 2013 Mar 14.
Artigo em Inglês | MEDLINE | ID: mdl-23516212

RESUMO

We report here the genome sequence of Frankia sp. strain CN3, which was isolated from Coriaria nepalensis. This genome sequence is the first from the fourth lineage of Frankia, strains of which are unable to reinfect actinorhizal plants. At 10 Mb, it represents the largest Frankia genome sequenced to date.

18.
19.
Genome Announc ; 1(1)2013 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-23405355

RESUMO

The strains Thermus oshimai JL-2 and Thermus thermophilus JL-18 each have a circular chromosome, 2.07 Mb and 1.9 Mb in size, respectively, and each has two plasmids ranging from 0.27 Mb to 57.2 kb. The megaplasmid of each strain contains a gene cluster for the reduction of nitrate to nitrous oxide, consistent with their incomplete denitrification phenotypes.

20.
Stand Genomic Sci ; 8(3): 375-88, 2013 Jul 30.
Artigo em Inglês | MEDLINE | ID: mdl-24501624

RESUMO

Dehalobacter restrictus strain PER-K23 (DSM 9455) is the type strain of the species Dehalobacter restrictus. D. restrictus strain PER-K23 grows by organohalide respiration, coupling the oxidation of H2 to the reductive dechlorination of tetra- or trichloroethene. Growth has not been observed with any other electron donor or acceptor, nor has fermentative growth been shown. Here we introduce the first full genome of a pure culture within the genus Dehalobacter. The 2,943,336 bp long genome contains 2,826 protein coding and 82 RNA genes, including 5 16S rRNA genes. Interestingly, the genome contains 25 predicted reductive dehalogenase genes, the majority of which appear to be full length. The reductive dehalogenase genes are mainly located in two clusters, suggesting a much larger potential for organohalide respiration than previously anticipated.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...