Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 16 de 16
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
Front Microbiol ; 13: 797997, 2022.
Artigo em Inglês | MEDLINE | ID: mdl-35875579

RESUMO

Whole-genome sequence databases continue to grow. Collection times between samples are also growing, providing both a challenge for comparing recently collected sequence data to historical samples and an opportunity for evolutionary analyses that can be used to refine match criteria. We measured evolutionary rates for 22 Salmonella enterica serotypes. Based upon these measurements, we propose using an evolutionary rate of 1.97 single-nucleotide polymorphisms (SNPs) per year when determining whether genome sequences match.

2.
Microbiology (Reading) ; 166(5): 453-459, 2020 05.
Artigo em Inglês | MEDLINE | ID: mdl-32100709

RESUMO

In 2017, the US Food and Drug Administration investigated the sources of multiple outbreaks of salmonellosis. Epidemiologic and traceback investigations identified Maradol papayas as the suspect vehicles. During the investigations, the genomes of 55 Salmonella enterica that were isolated from papaya samples were sequenced. Serovar assignments and phylogenetic analysis placed the 55 isolates into ten distinct groups, each representing a different serovar. Within-serovar SNP differences are generally between 0 and 20 SNPs, while the median between-serovar distance is 51 812 SNPs. We observed two groups with SNP distances between 21 and 100 SNPs. These relatively large within-serovar SNP distances may indicate that the isolates represent either diverse populations or multiple, genetically distinct subpopulations. Further inspection of these cases with traceback evidence allowed us to identify an 11th population. We observed that high levels of genomic diversity from individual firms is possible, with one firm yielding five of the ten serovars. Also, high levels of diversity are possible within small geographic regions, as five of the serovars were isolated from papayas that originated from farms located in Armería and Tecomán, Colima. In addition, we identified AMR genes that are present in three of the serovars studied here (aph(3')-lb, aph(6)-ld, tet(C), fosA7, and qnrB19) and we detected the presence of the plasmid IncHI2A among S. Urbana isolates.


Assuntos
Carica/microbiologia , Variação Genética , Infecções por Salmonella/microbiologia , Salmonella enterica/classificação , Salmonella enterica/genética , Surtos de Doenças , Contaminação de Alimentos , Genoma Bacteriano , Genótipo , Humanos , Filogenia , Polimorfismo de Nucleotídeo Único , Infecções por Salmonella/epidemiologia , Salmonella enterica/isolamento & purificação , Sorogrupo , Estados Unidos/epidemiologia , Sequenciamento Completo do Genoma
3.
Genome Biol ; 20(1): 286, 2019 12 18.
Artigo em Inglês | MEDLINE | ID: mdl-31849328

RESUMO

Although it is assumed that contamination in bacterial whole-genome sequencing causes errors, the influences of contamination on clustering analyses, such as single-nucleotide polymorphism discovery, phylogenetics, and multi-locus sequencing typing, have not been quantified. By developing and analyzing 720 Listeria monocytogenes, Salmonella enterica, and Escherichia coli short-read datasets, we demonstrate that within-species contamination causes errors that confound clustering analyses, while between-species contamination generally does not. Contaminant reads mapping to references or becoming incorporated into chimeric sequences during assembly are the sources of those errors. Contamination sufficient to influence clustering analyses is present in public sequence databases.


Assuntos
Contaminação por DNA , Genoma Bacteriano , Sequenciamento Completo do Genoma , Análise por Conglomerados , Escherichia coli/genética , Listeria monocytogenes/genética , Salmonella enterica/genética
4.
Front Microbiol ; 9: 1482, 2018.
Artigo em Inglês | MEDLINE | ID: mdl-30042741

RESUMO

Whole-genome sequence (WGS) analysis has revolutionized the food safety industry by enabling high-resolution typing of foodborne bacteria. Higher resolving power allows investigators to identify origins of contamination during illness outbreaks and regulatory activities quickly and accurately. Government agencies and industry stakeholders worldwide are now analyzing WGS data routinely. Although researchers have published many studies that assess the efficacy of WGS data analysis for source attribution, guidance for interpreting WGS analyses is lacking. Here, we provide the framework for interpreting WGS analyses used by the Food and Drug Administration's Center for Food Safety and Applied Nutrition (CFSAN). We based this framework on the experiences of CFSAN investigators, collaborations and interactions with government and industry partners, and evaluation of the published literature. A fundamental question for investigators is whether two or more bacteria arose from the same source of contamination. Analysts often count the numbers of nucleotide differences [single-nucleotide polymorphisms (SNPs)] between two or more genome sequences to measure genetic distances. However, using SNP thresholds alone to assess whether bacteria originated from the same source can be misleading. Bacteria that are isolated from food, environmental, or clinical samples are representatives of bacterial populations. These populations are subject to evolutionary forces that can change genome sequences. Therefore, interpreting WGS analyses of foodborne bacteria requires a more sophisticated approach. Here, we present a framework for interpreting WGS analyses that combines SNP counts with phylogenetic tree topologies and bootstrap support. We also clarify the roles of WGS, epidemiological, traceback, and other evidence in forming the conclusions of investigations. Finally, we present examples that illustrate the application of this framework to real-world situations.

5.
PLoS Biol ; 15(9): e2003769, 2017 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-28892507

RESUMO

Blastocystis is the most prevalent eukaryotic microbe colonizing the human gut, infecting approximately 1 billion individuals worldwide. Although Blastocystis has been linked to intestinal disorders, its pathogenicity remains controversial because most carriers are asymptomatic. Here, the genome sequence of Blastocystis subtype (ST) 1 is presented and compared to previously published sequences for ST4 and ST7. Despite a conserved core of genes, there is unexpected diversity between these STs in terms of their genome sizes, guanine-cytosine (GC) content, intron numbers, and gene content. ST1 has 6,544 protein-coding genes, which is several hundred more than reported for ST4 and ST7. The percentage of proteins unique to each ST ranges from 6.2% to 20.5%, greatly exceeding the differences observed within parasite genera. Orthologous proteins also display extreme divergence in amino acid sequence identity between STs (i.e., 59%-61% median identity), on par with observations of the most distantly related species pairs of parasite genera. The STs also display substantial variation in gene family distributions and sizes, especially for protein kinase and protease gene families, which could reflect differences in virulence. It remains to be seen to what extent these inter-ST differences persist at the intra-ST level. A full 26% of genes in ST1 have stop codons that are created on the mRNA level by a novel polyadenylation mechanism found only in Blastocystis. Reconstructions of pathways and organellar systems revealed that ST1 has a relatively complete membrane-trafficking system and a near-complete meiotic toolkit, possibly indicating a sexual cycle. Unlike some intestinal protistan parasites, Blastocystis ST1 has near-complete de novo pyrimidine, purine, and thiamine biosynthesis pathways and is unique amongst studied stramenopiles in being able to metabolize α-glucans rather than ß-glucans. It lacks all genes encoding heme-containing cytochrome P450 proteins. Predictions of the mitochondrion-related organelle (MRO) proteome reveal an expanded repertoire of functions, including lipid, cofactor, and vitamin biosynthesis, as well as proteins that may be involved in regulating mitochondrial morphology and MRO/endoplasmic reticulum (ER) interactions. In sharp contrast, genes for peroxisome-associated functions are absent, suggesting Blastocystis STs lack this organelle. Overall, this study provides an important window into the biology of Blastocystis, showcasing significant differences between STs that can guide future experimental investigations into differences in their virulence and clarifying the roles of these organisms in gut health and disease.


Assuntos
Blastocystis/genética , Genoma de Protozoário , Blastocystis/metabolismo , Metabolismo dos Carboidratos , Códon de Terminação , Microbioma Gastrointestinal , Humanos , Íntrons , Especificidade da Espécie
6.
PLoS One ; 11(11): e0166162, 2016.
Artigo em Inglês | MEDLINE | ID: mdl-27832109

RESUMO

The adoption of whole-genome sequencing within the public health realm for molecular characterization of bacterial pathogens has been followed by an increased emphasis on real-time detection of emerging outbreaks (e.g., food-borne Salmonellosis). In turn, large databases of whole-genome sequence data are being populated. These databases currently contain tens of thousands of samples and are expected to grow to hundreds of thousands within a few years. For these databases to be of optimal use one must be able to quickly interrogate them to accurately determine the genetic distances among a set of samples. Being able to do so is challenging due to both biological (evolutionary diverse samples) and computational (petabytes of sequence data) issues. We evaluated seven measures of genetic distance, which were estimated from either k-mer profiles (Jaccard, Euclidean, Manhattan, Mash Jaccard, and Mash distances) or nucleotide sites (NUCmer and an extended multi-locus sequence typing (MLST) scheme). When analyzing empirical data (whole-genome sequence data from 18,997 Salmonella isolates) there are features (e.g., genomic, assembly, and contamination) that cause distances inferred from k-mer profiles, which treat absent data as informative, to fail to accurately capture the distance between samples when compared to distances inferred from differences in nucleotide sites. Thus, site-based distances, like NUCmer and extended MLST, are superior in performance, but accessing the computing resources necessary to perform them may be challenging when analyzing large databases.


Assuntos
Genoma Bacteriano/genética , Tipagem de Sequências Multilocus/métodos , Salmonella/genética , Análise de Sequência de DNA/métodos , Animais , Biologia Computacional/métodos , Humanos , Filogenia , Reprodutibilidade dos Testes , Salmonella/classificação , Salmonella/fisiologia , Infecções por Salmonella/microbiologia , Especificidade da Espécie , Fatores de Tempo
7.
Genome Announc ; 4(5)2016 Sep 15.
Artigo em Inglês | MEDLINE | ID: mdl-27634991

RESUMO

Listeria monocytogenes is a pathogenic bacterium of importance to public health and food safety agencies. We present the genome sequence of the serotype 1/2a L. monocytogenes food isolate HPB913, which was collected in Canada in 1993 as part of an investigation into a sporadic case of foodborne illness.

8.
Genome Announc ; 4(4)2016 Aug 04.
Artigo em Inglês | MEDLINE | ID: mdl-27491990

RESUMO

Listeria monocytogenes is a foodborne pathogen that causes severe illness. Thus, ongoing efforts at real-time whole-genome sequencing are of utmost importance. However, it is also important that retrospective analyses that place these data into context be performed. Here, we present the genome sequence of strain HPB2088, which was collected in 1994.

9.
BMC Res Notes ; 8: 748, 2015 Dec 08.
Artigo em Inglês | MEDLINE | ID: mdl-26643440

RESUMO

BACKGROUND: The influences that different programs and conditions have on error rates of single-nucleotide polymorphism (SNP) analyses are poorly understood. Using Illumina short-read sequence data generated from Listeria monocytogenes strain HPB5622, we assessed the performance of four SNP callers (BCFtools, FreeBayes, UnifiedGenotyper, VarScan) under a variety of conditions, including: (1) a range of sequencing coverages; (2) use of four popular reference-guided assemblers (Burrows-Wheeler Aligner, Novoalign, MOSAIK, SMALT); (3) with and without read quality trimming and filtering; and (4) use of different reference sequences. RESULTS: At 8-fold coverage the proportions of true positive calls ranged from 0.22 to 25.00 % when reads were aligned to a nearly identical reference (0.000096 % distant). Calls made when reads were aligned to a non-identical reference (0.85 % distant) were from 92.54 to 98.88 % accurate. At 79-fold coverage accuracies ranged from 3.95 to 20.00 % with the nearly identical reference and 93.80-98.75 % with the non-identical reference. Read preprocessing significantly changed the numbers of false positive calls made, from a 65.24 % decrease to a 54.55 % increase. CONCLUSIONS: The combinations of reference-guided sequence assemblers and SNP callers greatly influenced not only the numbers of true and false positive sites but also the proportions of true positive calls relative to the total numbers of calls made. Furthermore, the efficacy of different assembler and caller combinations changed dramatically with the different conditions tested. Researchers should consider whether identifying the greatest numbers of true positive sites, reducing the numbers of false positive calls, or achieving the highest accuracies are desired.


Assuntos
Listeria monocytogenes/genética , Polimorfismo de Nucleotídeo Único , Análise de Sequência de DNA/normas , Genes Bacterianos
10.
BMC Microbiol ; 15: 224, 2015 Oct 22.
Artigo em Inglês | MEDLINE | ID: mdl-26490433

RESUMO

BACKGROUND: Next-generation sequencing provides a powerful means of molecular characterization. However, methods such as single-nucleotide polymorphism detection or whole-chromosome sequence analysis are computationally expensive, prone to errors, and are still less accessible than traditional typing methods. Here, we present the Listeria monocytogenes core-genome sequence typing method for molecular characterization. This method uses a high-confidence core (HCC) genome, calculated to ensure accurate identification of orthologs. We also developed an evolutionarily relevant nomenclature based upon phylogenetic analysis of HCC genomes. Finally, we created a pipeline (LmCGST; https://sourceforge.net/projects/lmcgst/files/) that takes in raw next-generation sequencing reads, calculates a subject HCC profile, compares it to an expandable database, assigns a sequence type, and performs a phylogenetic analysis. RESULTS: We analyzed 29 high-quality, closed Listeria monocytogenes chromosome sequences and identified loci that are reliable targets for automated molecular characterization methods. We identified 1013 open-reading frames that comprise our high-confidence core (HCC) genome. We then populated a database with HCC profiles from 114 taxa. We sequenced 84 randomly selected isolates from the Listeriosis Reference Service for Canada's collection and analysed them with the LmCGST pipeline. In addition, we generated pulsed-field gel electrophoresis, ribotyping, and in silico multi-locus sequence typing (MLST) data for the 84 isolates and compared the results to those obtained using the CGST method. We found that all of the methods yielded results that are generally congruent. However, due to the increased numbers of categories, the CGST method provides much greater discriminatory power than the other methods tested here. CONCLUSIONS: We show that the CGST method provides increased discriminatory power relative to typing methods such as pulsed-field gel electrophoresis, ribotyping, and multi-locus sequence typing while it addresses several shortcomings of other methods of molecular characterization with next-generation sequence data. It uses discrete, well-defined groupings (types) of organisms that are phylogenetically relevant and easily interpreted. In addition, the CGST scheme can be expanded to include additional loci and HCC profiles in the future. In total, the CGST method provides an approach to the molecular characterization of Listeria monocytogenes with next-generation sequence data that is highly reproducible, easily standardized, portable, and accessible.


Assuntos
Biologia Computacional/métodos , Sequenciamento de Nucleotídeos em Larga Escala , Listeria monocytogenes/classificação , Listeria monocytogenes/genética , Tipagem Molecular/métodos , Análise de Sequência de DNA , Filogenia
11.
Genome Announc ; 3(3)2015 Jun 11.
Artigo em Inglês | MEDLINE | ID: mdl-26067972

RESUMO

Listeria monocytogenes strain HPB5415-isolated from deli meat-was found in 2008 to have the same pulsed-field gel electrophoresis patterns as a clinical strain (08-5923). However, whether nucleotide differences (single nucleotide polymorphisms [SNPs]) exist between their genomes was not determined. We sequenced the L. monocytogenes strain HPB5415 genome and identified 52 SNPs relative to strain 08-5923.

12.
Genome Announc ; 2(6)2014 Nov 06.
Artigo em Inglês | MEDLINE | ID: mdl-25377702

RESUMO

Clostridium botulinum is important for food safety and studies of neurotoxins associated with human botulism. We present the draft genome sequences of two strains belonging to group II type B: one collected from Pacific Ocean sediments (DB-2) and another obtained during a botulism outbreak (KAPB-3).

13.
Genome Announc ; 2(4)2014 Aug 07.
Artigo em Inglês | MEDLINE | ID: mdl-25103765

RESUMO

Cronobacter sakazakii is a food-borne pathogenic bacterium that may cause severe illness in neonates and the elderly. We present the genome sequence of a rare strain (ST40, CC45), commonly found in multiple food processing facilities and in powdered infant formula and only indicted in a single clinical case.

14.
PLoS One ; 9(8): e104579, 2014.
Artigo em Inglês | MEDLINE | ID: mdl-25144537

RESUMO

The wide availability of whole-genome sequencing (WGS) and an abundance of open-source software have made detection of single-nucleotide polymorphisms (SNPs) in bacterial genomes an increasingly accessible and effective tool for comparative analyses. Thus, ensuring that real nucleotide differences between genomes (i.e., true SNPs) are detected at high rates and that the influences of errors (such as false positive SNPs, ambiguously called sites, and gaps) are mitigated is of utmost importance. The choices researchers make regarding the generation and analysis of WGS data can greatly influence the accuracy of short-read sequence alignments and, therefore, the efficacy of such experiments. We studied the effects of some of these choices, including: i) depth of sequencing coverage, ii) choice of reference-guided short-read sequence assembler, iii) choice of reference genome, and iv) whether to perform read-quality filtering and trimming, on our ability to detect true SNPs and on the frequencies of errors. We performed benchmarking experiments, during which we assembled simulated and real Listeria monocytogenes strain 08-5578 short-read sequence datasets of varying quality with four commonly used assemblers (BWA, MOSAIK, Novoalign, and SMALT), using reference genomes of varying genetic distances, and with or without read pre-processing (i.e., quality filtering and trimming). We found that assemblies of at least 50-fold coverage provided the most accurate results. In addition, MOSAIK yielded the fewest errors when reads were aligned to a nearly identical reference genome, while using SMALT to align reads against a reference sequence that is ∼0.82% distant from 08-5578 at the nucleotide level resulted in the detection of the greatest numbers of true SNPs and the fewest errors. Finally, we show that whether read pre-processing improves SNP detection depends upon the choice of reference sequence and assembler. In total, this study demonstrates that researchers should test a variety of conditions to achieve optimal results.


Assuntos
Listeria monocytogenes/genética , Análise de Sequência/normas , Genoma Bacteriano/genética , Polimorfismo de Nucleotídeo Único/genética , Software
15.
Genome Announc ; 2(4)2014 Jul 24.
Artigo em Inglês | MEDLINE | ID: mdl-25059873

RESUMO

Listeria monocytogenes, a pathogenic food-borne bacterium, is the causative agent of both sporadic and outbreak cases of human listeriosis. Here, we present the genome sequence of L. monocytogenes reference strain LI0521, isolated during an outbreak involving contaminated cheese, which has been used as the model during several proteomic studies.

16.
PLoS One ; 3(8): e2879, 2007 Aug 06.
Artigo em Inglês | MEDLINE | ID: mdl-18663385

RESUMO

Meiosis is a defining feature of eukaryotes but its phylogenetic distribution has not been broadly determined, especially among eukaryotic microorganisms (i.e. protists)-which represent the majority of eukaryotic 'supergroups'. We surveyed genomes of animals, fungi, plants and protists for meiotic genes, focusing on the evolutionarily divergent parasitic protist Trichomonas vaginalis. We identified homologs of 29 components of the meiotic recombination machinery, as well as the synaptonemal and meiotic sister chromatid cohesion complexes. T. vaginalis has orthologs of 27 of 29 meiotic genes, including eight of nine genes that encode meiosis-specific proteins in model organisms. Although meiosis has not been observed in T. vaginalis, our findings suggest it is either currently sexual or a recent asexual, consistent with observed, albeit unusual, sexual cycles in their distant parabasalid relatives, the hypermastigotes. T. vaginalis may use meiotic gene homologs to mediate homologous recombination and genetic exchange. Overall, this expanded inventory of meiotic genes forms a useful "meiosis detection toolkit". Our analyses indicate that these meiotic genes arose, or were already present, early in eukaryotic evolution; thus, the eukaryotic cenancestor contained most or all components of this set and was likely capable of performing meiotic recombination using near-universal meiotic machinery.


Assuntos
Meiose/genética , Proteínas de Protozoários/genética , Trichomonas vaginalis/genética , Animais , Células Eucarióticas/citologia , Células Eucarióticas/metabolismo , Evolução Molecular , Genoma de Protozoário , Modelos Genéticos , Proteínas de Protozoários/classificação , Proteínas de Protozoários/fisiologia , Recombinação Genética , Trichomonas vaginalis/citologia , Trichomonas vaginalis/fisiologia
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...