Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 9 de 9
Filter
Add more filters










Database
Language
Publication year range
1.
Bioinformatics ; 37(7): 1024-1025, 2021 05 17.
Article in English | MEDLINE | ID: mdl-32777813

ABSTRACT

SUMMARY: Polymerase chain reaction-based assays are the current gold standard for detecting and diagnosing SARS-CoV-2. However, as SARS-CoV-2 mutates, we need to constantly assess whether existing PCR-based assays will continue to detect all known viral strains. To enable the continuous monitoring of SARS-CoV-2 assays, we have developed a web-based assay validation algorithm that checks existing PCR-based assays against the ever-expanding genome databases for SARS-CoV-2 using both thermodynamic and edit-distance metrics. The assay-screening results are displayed as a heatmap, showing the number of mismatches between each detection and each SARS-CoV-2 genome sequence. Using a mismatch threshold to define detection failure, assay performance is summarized with the true-positive rate (recall) to simplify assay comparisons. AVAILABILITY AND IMPLEMENTATION: The assay evaluation website and supporting software are Open Source and freely available at https://covid19.edgebioinformatics.org/#/assayValidation, https://github.com/jgans/thermonucleotide BLAST and https://github.com/LANL-Bioinformatics/assay_validation. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
COVID-19 , SARS-CoV-2 , COVID-19 Testing , Humans , Polymerase Chain Reaction , Sensitivity and Specificity
3.
Article in English | MEDLINE | ID: mdl-31024904

ABSTRACT

Progress in modern biology is being driven, in part, by the large amounts of freely available data in public resources such as the International Nucleotide Sequence Database Collaboration (INSDC), the world's primary database of biological sequence (and related) information. INSDC and similar databases have dramatically increased the pace of fundamental biological discovery and enabled a host of innovative therapeutic, diagnostic, and forensic applications. However, as high-value, openly shared resources with a high degree of assumed trust, these repositories share compelling similarities to the early days of the Internet. Consequently, as public biological databases continue to increase in size and importance, we expect that they will face the same threats as undefended cyberspace. There is a unique opportunity, before a significant breach and loss of trust occurs, to ensure they evolve with quality and security as a design philosophy rather than costly "retrofitted" mitigations. This Perspective surveys some potential quality assurance and security weaknesses in existing open genomic and proteomic repositories, describes methods to mitigate the likelihood of both intentional and unintentional errors, and offers recommendations for risk mitigation based on lessons learned from cybersecurity.

4.
Nucleic Acids Res ; 40(12): e96, 2012 Jul.
Article in English | MEDLINE | ID: mdl-22434885

ABSTRACT

Environmental biosurveillance and microbial ecology studies use PCR-based assays to detect and quantify microbial taxa and gene sequences within a complex background of microorganisms. However, the fragmentary nature and growing quantity of DNA-sequence data make group-specific assay design challenging. We solved this problem by developing a software platform that enables PCR-assay design at an unprecedented scale. As a demonstration, we developed quantitative PCR assays for a globally widespread, ecologically important bacterial group in soil, Acidobacteria Group 1. A total of 33,684 Acidobacteria 16S rRNA gene sequences were used for assay design. Following 1 week of computation on a 376-core cluster, 83 assays were obtained. We validated the specificity of the top three assays, collectively predicted to detect 42% of the Acidobacteria Group 1 sequences, by PCR amplification and sequencing of DNA from soil. Based on previous analyses of 16S rRNA gene sequencing, Acidobacteria Group 1 species were expected to decrease in response to elevated atmospheric CO(2). Quantitative PCR results, using the Acidobacteria Group 1-specific PCR assays, confirmed the expected decrease and provided higher statistical confidence than the 16S rRNA gene-sequencing data. These results demonstrate a powerful capacity to address previously intractable assay design challenges.


Subject(s)
Acidobacteria/isolation & purification , DNA Primers/chemistry , Polymerase Chain Reaction/methods , Software , Soil Microbiology , Acidobacteria/genetics , Algorithms , RNA, Ribosomal, 16S/genetics , Sequence Analysis, DNA
5.
BMC Genomics ; 10: 501, 2009 Oct 29.
Article in English | MEDLINE | ID: mdl-19874620

ABSTRACT

BACKGROUND: New and improved antimicrobial countermeasures are urgently needed to counteract increased resistance to existing antimicrobial treatments and to combat currently untreatable or new emerging infectious diseases. We demonstrate that computational comparative genomics, together with experimental screening, can identify potential generic (i.e., conserved across multiple pathogen species) and novel virulence-associated genes that may serve as targets for broad-spectrum countermeasures. RESULTS: Using phylogenetic profiles of protein clusters from completed microbial genome sequences, we identified seventeen protein candidates that are common to diverse human pathogens and absent or uncommon in non-pathogens. Mutants of 13 of these candidates were successfully generated in Yersinia pseudotuberculosis and the potential role of the proteins in virulence was assayed in an animal model. Six candidate proteins are suggested to be involved in the virulence of Y. pseudotuberculosis, none of which have previously been implicated in the virulence of Y. pseudotuberculosis and three have no record of involvement in the virulence of any bacteria. CONCLUSION: This work demonstrates a strategy for the identification of potential virulence factors that are conserved across a number of human pathogenic bacterial species, confirming the usefulness of this tool.


Subject(s)
Anti-Infective Agents/pharmacology , Virulence/drug effects , Virulence/genetics , Bacterial Proteins/genetics , Bacterial Proteins/metabolism , Drug Discovery , Drug Resistance, Bacterial , Genomics , Humans , Yersinia pseudotuberculosis/genetics , Yersinia pseudotuberculosis/pathogenicity
6.
Nucleic Acids Res ; 36(12): e74, 2008 Jul.
Article in English | MEDLINE | ID: mdl-18515842

ABSTRACT

Nucleic acid-based biochemical assays are crucial to modern biology. Key applications, such as detection of bacterial, viral and fungal pathogens, require detailed knowledge of assay sensitivity and specificity to obtain reliable results. Improved methods to predict assay performance are needed for exploiting the exponentially growing amount of DNA sequence data and for reducing the experimental effort required to develop robust detection assays. Toward this goal, we present an algorithm for the calculation of sequence similarity based on DNA thermodynamics. In our approach, search queries consist of one to three oligonucleotide sequences representing either a hybridization probe, a pair of Padlock probes or a pair of PCR primers with an optional TaqMantrade mark probe (i.e. in silico or 'virtual' PCR). Matches are reported if the query and target satisfy both the thermodynamics of the assay (binding at a specified hybridization temperature and/or change in free energy) and the relevant biological constraints (assay sequences binding to the correct target duplex strands in the required orientations). The sensitivity and specificity of our method is evaluated by comparing predicted to known sequence tagged sites in the human genome. Free energy is shown to be a more sensitive and specific match criterion than hybridization temperature.


Subject(s)
Databases, Nucleic Acid , Polymerase Chain Reaction , Sequence Alignment/methods , Sequence Analysis, DNA , Algorithms , DNA/chemistry , Genome, Human , Humans , Oligonucleotide Probes/chemistry , Sequence Tagged Sites , Thermodynamics
7.
BMC Bioinformatics ; 8: 204, 2007 Jun 14.
Article in English | MEDLINE | ID: mdl-17570856

ABSTRACT

BACKGROUND: The ability to visualize genomic features and design experimental assays that can target specific regions of a genome is essential for modern biology. To assist in these tasks, we present Genomorama, a software program for interactively displaying multiple genomes and identifying potential DNA hybridization sites for assay design. RESULTS: Useful features of Genomorama include genome search by DNA hybridization (probe binding and PCR amplification), efficient multi-scale display and manipulation of multiple genomes, support for many genome file types and the ability to search for and retrieve data from the National Center for Biotechnology Information (NCBI) Entrez server. CONCLUSION: Genomorama provides an efficient computational platform for visualizing and analyzing multiple genomes.


Subject(s)
Chromosome Mapping/methods , Genome/genetics , Sequence Alignment/methods , Sequence Analysis, DNA/methods , Software , User-Computer Interface , Algorithms , Base Sequence , Database Management Systems , Databases, Genetic , Gene Targeting/methods , Molecular Sequence Data
8.
Bioinformatics ; 21(5): 680-2, 2005 Mar 01.
Article in English | MEDLINE | ID: mdl-15388520

ABSTRACT

MOTIVATION: High-throughput NMR structure determination is a goal that will require progress on many fronts, one of which is rapid resonance assignment. An important rate-limiting step in the resonance assignment process is accurate identification of resonance peaks in the NMR spectra. Peak-picking schemes range from incomplete (which lose essential assignment connectivities) to noisy (which obscure true connectivities with many false ones). We introduce an automated preassignment process that removes false peaks from noisy peak lists by requiring consensus between multiple NMR experiments and exploiting a priori information about NMR spectra. This process is designed to accept multiple input formats and generate multiple output formats, in an effort to be compatible with a variety of user preferences. RESULTS: Automated preprocessing with APART rapidly identifies and removes false peaks from initial peak lists, reduces the burden of manual data entry, and documents and standardizes the peak filtering process. Successful preprocessing is demonstrated by the increased number of correct assignments obtained when data are submitted to an automated assignment program. AVAILABILITY: APART is available from http://sir.lanl.gov/NMR/APART.htm CONTACT: npawley@lanl.gov; rmichalczyk@lanl.gov SUPPLEMENTARY INFORMATION: Manual pages with installation instructions, procedures and screen shots can also be found at http://sir.lanl.gov/NMR/APART_Manual1.pdf.


Subject(s)
Algorithms , Magnetic Resonance Spectroscopy/methods , Pattern Recognition, Automated/methods , Proteins/analysis , Proteins/chemistry , Software , Models, Chemical , Models, Statistical , Protein Conformation , Signal Processing, Computer-Assisted , Stochastic Processes
9.
J Biomol NMR ; 24(3): 215-29, 2002 Nov.
Article in English | MEDLINE | ID: mdl-12522309

ABSTRACT

An accurate description of global tumbling of a protein is essential for correct analysis and interpretation of internal dynamics and thermodynamics. The accurate fitting of global tumbling parameters is affected by the number of experimental relaxation data points available for analysis, the distribution of data points over the domain of the function describing the tumbling, the measurement error associated with the data, the error associated with use of an approximate functional form, and errors in the protein structure. We present an analysis of the influence of these factors on the error in global tumbling parameters and the corresponding error in the calculated T(1)/T(2) values. We find that reduction of experimental and approximation error can compensate for a less-than-ideal quantity or distribution of data points, and that accurate parameters can be obtained for proteins with highly anisotropic distributions of bond vectors, as illustrated using the helical bundle protein G-CSF. This indicates that proteins with anisotropic distributions, such as the helical bundle class of proteins, should not summarily be excluded when selecting proteins for dynamic and thermodynamic analyses of (15)N backbone relaxation measurements.


Subject(s)
Nuclear Magnetic Resonance, Biomolecular/methods , Anisotropy , Computer Simulation/statistics & numerical data , Granulocyte Colony-Stimulating Factor/chemistry , Models, Chemical , Models, Molecular , Statistical Distributions , Thermodynamics
SELECTION OF CITATIONS
SEARCH DETAIL
...