Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 7 de 7
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
Microb Genom ; 8(6)2022 06.
Artigo em Inglês | MEDLINE | ID: mdl-35675101

RESUMO

Antimicrobial resistance (AMR) is a significant public health threat. Low-cost whole-genome sequencing, which is often used in surveillance programmes, provides an opportunity to assess AMR gene content in these genomes using in silico approaches. A variety of bioinformatic tools have been developed to identify these genomic elements. Most of those tools rely on reference databases of nucleotide or protein sequences and collections of models and rules for analysis. While the tools are critical for the identification of AMR genes, the databases themselves also provide significant utility for researchers, for applications ranging from sequence analysis to information about AMR phenotypes. Additionally, these databases can be evaluated by domain experts and others to ensure their accuracy. Here we describe how we curate the genes, point mutations and blast rules, and hidden Markov models used in NCBI's AMRFinderPlus, along with the quality-control steps we take to ensure database quality. We also describe the web interfaces that display the full structure of the database and their newly developed cross-browser relationships. Then, using the Reference Gene Catalog as an example, we detail how the databases, rules and models are made publicly available, as well as how to access the software. In addition, as part of the Pathogen Detection system, we have analysed over 1 million publicly available genomes using AMRFinderPlus and its databases. We discuss how the computed analyses generated by those tools can be accessed through a web interface. Finally, we conclude with NCBI's plans to make these databases accessible over the long-term.


Assuntos
Biologia Computacional , Software , Sequência de Aminoácidos , Sequenciamento Completo do Genoma
2.
Antioxidants (Basel) ; 10(9)2021 Sep 13.
Artigo em Inglês | MEDLINE | ID: mdl-34573083

RESUMO

In the case of various pathologies, an imbalance between ROS generation and the endogenous AOS can be observed, which leads to excessive ROS accumulation, intensification of LPO processes, and oxidative stress. For the prevention of diseases associated with oxidative stress, drugs with antioxidant activity can be used. The cytotoxic, antioxidant, and NO-donor properties of the new hybrid compound B6NO (di(3-hydroxy-4,5-bis(hydroxymethyl)-2-methylpyridinium) salt of 2-(nitrooxy)butanedioic acid) were studied. It was determined that B6NO chelates iron ions by 94%, which indicates B6NO's ability to block the Fenton reaction. The hybrid compound B6NO inhibits the process of initiated lipid peroxidation more effectively than pyridoxine. It was shown that B6NO exhibits antioxidant properties by decreasing ROS concentration in normal cells during the oxidative stress induction by tert-Butyl peroxide. At the same time, the B6NO antioxidant activity on tumor cells was significantly lower. B6NO significantly increases the intracellular nitrogen monoxide accumulation and showed low cytotoxicity for normal cells (IC50 > 4 mM). Thus, the results indicate a high potential of the B6NO as an antioxidant compound.

3.
BMC Bioinformatics ; 17 Suppl 8: 276, 2016 Aug 31.
Artigo em Inglês | MEDLINE | ID: mdl-27586436

RESUMO

BACKGROUND: Microbial genomes at the National Center for Biotechnology Information (NCBI) represent a large collection of more than 35,000 assemblies. There are several complexities associated with the data: a great variation in sampling density since human pathogens are densely sampled while other bacteria are less represented; different protein families occur in annotations with different frequencies; and the quality of genome annotation varies greatly. In order to extract useful information from these sophisticated data, the analysis needs to be performed at multiple levels of phylogenomic resolution and protein similarity, with an adequate sampling strategy. RESULTS: Protein clustering is used to construct meaningful and stable groups of similar proteins to be used for analysis and functional annotation. Our approach is to create protein clusters at three levels. First, tight clusters in groups of closely-related genomes (species-level clades) are constructed using a combined approach that takes into account both sequence similarity and genome context. Second, clustroids of conservative in-clade clusters are organized into seed global clusters. Finally, global protein clusters are built around the the seed clusters. We propose filtering strategies that allow limiting the protein set included in global clustering. The in-clade clustering procedure, subsequent selection of clustroids and organization into seed global clusters provides a robust representation and high rate of compression. Seed protein clusters are further extended by adding related proteins. Extended seed clusters include a significant part of the data and represent all major known cell machinery. The remaining part, coming from either non-conservative (unique) or rapidly evolving proteins, from rare genomes, or resulting from low-quality annotation, does not group together well. Processing these proteins requires significant computational resources and results in a large number of questionable clusters. CONCLUSION: The developed filtering strategies allow to identify and exclude such peripheral proteins limiting the protein dataset in global clustering. Overall, the proposed methodology allows the relevant data at different levels of details to be obtained and data redundancy eliminated while keeping biologically interesting variations.


Assuntos
Proteínas de Bactérias/metabolismo , Genoma Microbiano , Algoritmos , Análise por Conglomerados , Guanosina Trifosfato/metabolismo , Humanos , Filogenia , Estatística como Assunto
4.
Nucleic Acids Res ; 43(Database issue): D599-605, 2015 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-25510495

RESUMO

NCBI RefSeq genome collection http://www.ncbi.nlm.nih.gov/genome represents all three major domains of life: Eukarya, Bacteria and Archaea as well as Viruses. Prokaryotic genome sequences are the most rapidly growing part of the collection. During the year of 2014 more than 10,000 microbial genome assemblies have been publicly released bringing the total number of prokaryotic genomes close to 30,000. We continue to improve the quality and usability of the microbial genome resources by providing easy access to the data and the results of the pre-computed analysis, and improving analysis and visualization tools. A number of improvements have been incorporated into the Prokaryotic Genome Annotation Pipeline. Several new features have been added to RefSeq prokaryotic genomes data processing pipeline including the calculation of genome groups (clades) and the optimization of protein clusters generation using pan-genome approach.


Assuntos
Bases de Dados de Ácidos Nucleicos , Genoma Arqueal , Genoma Bacteriano , Internet , Anotação de Sequência Molecular
5.
Nucleic Acids Res ; 42(Database issue): D553-9, 2014 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-24316578

RESUMO

The source of the microbial genomic sequences in the RefSeq collection is the set of primary sequence records submitted to the International Nucleotide Sequence Database public archives. These can be accessed through the Entrez search and retrieval system at http://www.ncbi.nlm.nih.gov/genome. Next-generation sequencing has enabled researchers to perform genomic sequencing at rates that were unimaginable in the past. Microbial genomes can now be sequenced in a matter of hours, which has led to a significant increase in the number of assembled genomes deposited in the public archives. This huge increase in DNA sequence data presents new challenges for the annotation, analysis and visualization bioinformatics tools. New strategies have been developed for the annotation and representation of reference genomes and sequence variations derived from population studies and clinical outbreaks.


Assuntos
Bases de Dados Genéticas , Genoma Microbiano , Anotação de Sequência Molecular , Proteínas de Bactérias/genética , Genoma Bacteriano , Genômica/normas , Internet , Padrões de Referência
6.
Stand Genomic Sci ; 5(1): 168-93, 2011 Oct 15.
Artigo em Inglês | MEDLINE | ID: mdl-22180819

RESUMO

The promise of genome sequencing was that the vast undiscovered country would be mapped out by comparison of the multitude of sequences available and would aid researchers in deciphering the role of each gene in every organism. Researchers recognize that there is a need for high quality data. However, different annotation procedures, numerous databases, and a diminishing percentage of experimentally determined gene functions have resulted in a spectrum of annotation quality. NCBI in collaboration with sequencing centers, archival databases, and researchers, has developed the first international annotation standards, a fundamental step in ensuring that high quality complete prokaryotic genomes are available as gold standard references. Highlights include the development of annotation assessment tools, community acceptance of protein naming standards, comparison of annotation resources to provide consistent annotation, and improved tracking of the evidence used to generate a particular annotation. The development of a set of minimal standards, including the requirement for annotated complete prokaryotic genomes to contain a full set of ribosomal RNAs, transfer RNAs, and proteins encoding core conserved functions, is an historic milestone. The use of these standards in existing genomes and future submissions will increase the quality of databases, enabling researchers to make accurate biological discoveries.

7.
Nucleic Acids Res ; 37(Database issue): D216-23, 2009 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-18940865

RESUMO

Rapid increases in DNA sequencing capabilities have led to a vast increase in the data generated from prokaryotic genomic studies, which has been a boon to scientists studying micro-organism evolution and to those who wish to understand the biological underpinnings of microbial systems. The NCBI Protein Clusters Database (ProtClustDB) has been created to efficiently maintain and keep the deluge of data up to date. ProtClustDB contains both curated and uncurated clusters of proteins grouped by sequence similarity. The May 2008 release contains a total of 285 386 clusters derived from over 1.7 million proteins encoded by 3806 nt sequences from the RefSeq collection of complete chromosomes and plasmids from four major groups: prokaryotes, bacteriophages and the mitochondrial and chloroplast organelles. There are 7180 clusters containing 376 513 proteins with curated gene and protein functional annotation. PubMed identifiers and external cross references are collected for all clusters and provide additional information resources. A suite of web tools is available to explore more detailed information, such as multiple alignments, phylogenetic trees and genomic neighborhoods. ProtClustDB provides an efficient method to aggregate gene and protein annotation for researchers and is available at http://www.ncbi.nlm.nih.gov/sites/entrez?db=proteinclusters.


Assuntos
Bases de Dados de Proteínas , Proteínas/classificação , Análise por Conglomerados , Genômica , Proteínas/química , Proteínas/genética , Homologia de Sequência de Aminoácidos
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...