Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 5 de 5
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
BMC Genomics ; 25(1): 365, 2024 Apr 15.
Artigo em Inglês | MEDLINE | ID: mdl-38622536

RESUMO

BACKGROUND: Microbial genomes are largely comprised of protein coding sequences, yet some genomes contain many pseudogenes caused by frameshifts or internal stop codons. These pseudogenes are believed to result from gene degradation during evolution but could also be technical artifacts of genome sequencing or assembly. RESULTS: Using a combination of observational and experimental data, we show that many putative pseudogenes are attributable to errors that are incorporated into genomes during assembly. Within 126,564 publicly available genomes, we observed that nearly identical genomes often substantially differed in pseudogene counts. Causal inference implicated assembler, sequencing platform, and coverage as likely causative factors. Reassembly of genomes from raw reads confirmed that each variable affects the number of putative pseudogenes in an assembly. Furthermore, simulated sequencing reads corroborated our observations that the quality and quantity of raw data can significantly impact the number of pseudogenes in an assembler dependent fashion. The number of unexpected pseudogenes due to internal stops was highly correlated (R2 = 0.96) with average nucleotide identity to the ground truth genome, implying relative pseudogene counts can be used as a proxy for overall assembly correctness. Applying our method to assemblies in RefSeq resulted in rejection of 3.6% of assemblies due to significantly elevated pseudogene counts. Reassembly from real reads obtained from high coverage genomes showed considerable variability in spurious pseudogenes beyond that observed with simulated reads, reinforcing the finding that high coverage is necessary to mitigate assembly errors. CONCLUSIONS: Collectively, these results demonstrate that many pseudogenes in microbial genome assemblies are actually genes. Our results suggest that high read coverage is required for correct assembly and indicate an inflated number of pseudogenes due to internal stops is indicative of poor overall assembly quality.


Assuntos
Genoma Bacteriano , Pseudogenes , Pseudogenes/genética , Mapeamento Cromossômico , Sequência de Bases , Genoma Microbiano , Análise de Sequência de DNA/métodos , Sequenciamento de Nucleotídeos em Larga Escala/métodos
2.
NAR Genom Bioinform ; 3(3): lqab080, 2021 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-34541527

RESUMO

The observed diversity of protein coding sequences continues to increase far more rapidly than knowledge of their functions, making classification algorithms essential for assigning a function to proteins using only their sequence. Most pipelines for annotating proteins rely on searches for homologous sequences in databases of previously annotated proteins using BLAST or HMMER. Here, we develop a new approach for classifying proteins into a taxonomy of functions and demonstrate its utility for genome annotation. Our algorithm, IDTAXA, was more accurate than BLAST or HMMER at assigning sequences to KEGG ortholog groups. Moreover, IDTAXA correctly avoided classifying sequences with novel functions to existing groups, which is a common error mode for classification approaches that rely on E-values as a proxy for confidence. We demonstrate IDTAXA's utility for annotating eukaryotic and prokaryotic genomes by assigning functions to proteins within a multi-level ontology and applied IDTAXA to detect genome contamination in eukaryotic genomes. Finally, we re-annotated 8604 microbial genomes with known antibiotic resistance phenotypes to discover two novel associations between proteins and antibiotic resistance. IDTAXA is available as a web tool (http://DECIPHER.codes/Classification.html) or as part of the open source DECIPHER R package from Bioconductor.

3.
Viruses ; 12(12)2020 12 10.
Artigo em Inglês | MEDLINE | ID: mdl-33322070

RESUMO

Viruses represent important test cases for data federation due to their genome size and the rapid increase in sequence data in publicly available databases. However, some consequences of previously decentralized (unfederated) data are lack of consensus or comparisons between feature annotations. Unifying or displaying alternative annotations should be a priority both for communities with robust entry representation and for nascent communities with burgeoning data sources. To this end, during this three-day continuation of the Virus Hunting Toolkit codeathon series (VHT-2), a new integrated and federated viral index was elaborated. This Federated Index of Viral Experiments (FIVE) integrates pre-existing and novel functional and taxonomy annotations and virus-host pairings. Variability in the context of viral genomic diversity is often overlooked in virus databases. As a proof-of-concept, FIVE was the first attempt to include viral genome variation for HIV, the most well-studied human pathogen, through viral genome diversity graphs. As per the publication of this manuscript, FIVE is the first implementation of a virus-specific federated index of such scope. FIVE is coded in BigQuery for optimal access of large quantities of data and is publicly accessible. Many projects of database or index federation fail to provide easier alternatives to access or query information. To this end, a Python API query system was developed to enhance the accessibility of FIVE.


Assuntos
Biologia Computacional , Bases de Dados Genéticas , Metagenômica/métodos , Vírus/genética , Biologia Computacional/métodos , Variação Genética , Genoma Viral , Interações Hospedeiro-Patógeno , Humanos , Interface Usuário-Computador , Proteínas Virais/genética , Proteínas Virais/metabolismo , Vírus/metabolismo , Navegador
4.
ACS Chem Neurosci ; 8(6): 1159-1162, 2017 06 21.
Artigo em Inglês | MEDLINE | ID: mdl-28257176

RESUMO

The direct visualization of neurotransmitters is a continuing problem in neuroscience; however, functional fluorescent sensors for organic analytes are still rare. Herein, we describe a fluorescent sensor for glutamate and zinc ions. The sensor acts as a fluorescent logic gate, giving a turn-off response to glutamate or zinc ion alone. The combination of analytes produces a large increase in fluorescence. This type of sensor will aid in the study of neurotransmission, in this case, for neurons that copackage high concentrations of zinc and glutamate.


Assuntos
Corantes Fluorescentes/síntese química , Ácido Glutâmico/análise , Neurotransmissores/análise , Imagem Óptica/métodos , Zinco/análise , Corantes Fluorescentes/química
5.
Thorax ; 67(1): 90-2, 2012 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-21846790

RESUMO

National surveys have revealed significant differences in patient outcomes following admission to hospital with acute exacerbation of COPD which are likely to be due to variations in care. We developed a care bundle, comprising a short list of evidence-based practices to be implemented prior to discharge for all patients admitted with this condition, based on a review of national guidelines and other relevant literature, expert opinion and patient consultation. Implementation was then piloted using action research methodologies with patient input. Actively involving staff was vital to ensure that the changes introduced were understood and the process followed. Implementation of a care bundle has the potential to produce a dramatic improvement in compliance with optimum health care practice.


Assuntos
Medicina Baseada em Evidências , Administração dos Cuidados ao Paciente , Doença Pulmonar Obstrutiva Crônica/terapia , Idoso , Medicina Baseada em Evidências/normas , Feminino , Pesquisa sobre Serviços de Saúde , Humanos , Masculino , Alta do Paciente/estatística & dados numéricos
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...