Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 9 de 9
Filter
Add more filters










Database
Language
Publication year range
1.
BMC Bioinformatics ; 24(1): 221, 2023 May 31.
Article in English | MEDLINE | ID: mdl-37259021

ABSTRACT

BACKGROUND: As genome sequencing becomes better integrated into scientific research, government policy, and personalized medicine, the primary challenge for researchers is shifting from generating raw data to analyzing these vast datasets. Although much work has been done to reduce compute times using various configurations of traditional CPU computing infrastructures, Graphics Processing Units (GPUs) offer opportunities to accelerate genomic workflows by orders of magnitude. Here we benchmark one GPU-accelerated software suite called NVIDIA Parabricks on Amazon Web Services (AWS), Google Cloud Platform (GCP), and an NVIDIA DGX cluster. We benchmarked six variant calling pipelines, including two germline callers (HaplotypeCaller and DeepVariant) and four somatic callers (Mutect2, Muse, LoFreq, SomaticSniper). RESULTS: We achieved up to 65 × acceleration with germline variant callers, bringing HaplotypeCaller runtimes down from 36 h to 33 min on AWS, 35 min on GCP, and 24 min on the NVIDIA DGX. Somatic callers exhibited more variation between the number of GPUs and computing platforms. On cloud platforms, GPU-accelerated germline callers resulted in cost savings compared with CPU runs, whereas some somatic callers were more expensive than CPU runs because their GPU acceleration was not sufficient to overcome the increased GPU cost. CONCLUSIONS: Germline variant callers scaled well with the number of GPUs across platforms, whereas somatic variant callers exhibited more variation in the number of GPUs with the fastest runtimes, suggesting that, at least with the version of Parabricks used here, these workflows are less GPU optimized and require benchmarking on the platform of choice before being deployed at production scales. Our study demonstrates that GPUs can be used to greatly accelerate genomic workflows, thus bringing closer to grasp urgent societal advances in the areas of biosurveillance and personalized medicine.


Subject(s)
Computer Graphics , Software , Workflow , Genomics
4.
Cell Syst ; 9(5): 417-421, 2019 11 27.
Article in English | MEDLINE | ID: mdl-31677972

ABSTRACT

As more digital resources are produced by the research community, it is becoming increasingly important to harmonize and organize them for synergistic utilization. The findable, accessible, interoperable, and reusable (FAIR) guiding principles have prompted many stakeholders to consider strategies for tackling this challenge. The FAIRshake toolkit was developed to enable the establishment of community-driven FAIR metrics and rubrics paired with manual and automated FAIR assessments. FAIR assessments are visualized as an insignia that can be embedded within digital-resources-hosting websites. Using FAIRshake, a variety of biomedical digital resources were manually and automatically evaluated for their level of FAIRness.


Subject(s)
Information Dissemination/methods , Internet/trends , Online Systems/standards , Health Resources/standards , Humans
5.
PLoS Biol ; 15(4): e2001818, 2017 04.
Article in English | MEDLINE | ID: mdl-28388615

ABSTRACT

The thesis presented here is that biomedical research is based on the trusted exchange of services. That exchange would be conducted more efficiently if the trusted software platforms to exchange those services, if they exist, were more integrated. While simpler and narrower in scope than the services governing biomedical research, comparison to existing internet-based platforms, like Airbnb, can be informative. We illustrate how the analogy to internet-based platforms works and does not work and introduce The Commons, under active development at the National Institutes of Health (NIH) and elsewhere, as an example of the move towards platforms for research.


Subject(s)
Biomedical Research/standards , Database Management Systems/standards , Information Dissemination/methods , Program Evaluation/standards , Social Change , Trust , Animals , Biomedical Research/trends , Communication Barriers , Database Management Systems/trends , Efficiency , Humans , Internet , National Institutes of Health (U.S.) , Periodicals as Topic/standards , Periodicals as Topic/trends , Program Evaluation/trends , Research Support as Topic/trends , Scientific Misconduct , Software , Technology Transfer , United States , Workforce
7.
Nat Biotechnol ; 29(5): 415-20, 2011 May.
Article in English | MEDLINE | ID: mdl-21552244

ABSTRACT

Here we present a standard developed by the Genomic Standards Consortium (GSC) for reporting marker gene sequences--the minimum information about a marker gene sequence (MIMARKS). We also introduce a system for describing the environment from which a biological sample originates. The 'environmental packages' apply to any genome sequence of known origin and can be used in combination with MIMARKS and other GSC checklists. Finally, to establish a unified standard for describing sequence data and to provide a single point of entry for the scientific community to access and learn about GSC checklists, we present the minimum information about any (x) sequence (MIxS). Adoption of MIxS will enhance our ability to analyze natural genetic diversity documented by massive DNA sequencing efforts from myriad ecosystems in our ever-changing biosphere.


Subject(s)
Biomarkers , Environment , Metagenomics/standards , Sequence Analysis, DNA/standards , Checklist , Databases, Genetic , Genes, rRNA , Genetic Variation , Humans , Information Storage and Retrieval/standards , Internet , Programming Languages , Software
8.
Genome Res ; 19(12): 2317-23, 2009 Dec.
Article in English | MEDLINE | ID: mdl-19819907

ABSTRACT

The Human Microbiome Project (HMP), funded as an initiative of the NIH Roadmap for Biomedical Research (http://nihroadmap.nih.gov), is a multi-component community resource. The goals of the HMP are: (1) to take advantage of new, high-throughput technologies to characterize the human microbiome more fully by studying samples from multiple body sites from each of at least 250 "normal" volunteers; (2) to determine whether there are associations between changes in the microbiome and health/disease by studying several different medical conditions; and (3) to provide both a standardized data resource and new technological approaches to enable such studies to be undertaken broadly in the scientific community. The ethical, legal, and social implications of such research are being systematically studied as well. The ultimate objective of the HMP is to demonstrate that there are opportunities to improve human health through monitoring or manipulation of the human microbiome. The history and implementation of this new program are described here.


Subject(s)
Bacteria , Gastrointestinal Tract/microbiology , Metagenome/genetics , Mouth/microbiology , National Institutes of Health (U.S.) , Skin/microbiology , Vagina/microbiology , Bacteria/classification , Bacteria/genetics , Bacteria/isolation & purification , Female , Humans , National Health Programs , RNA, Ribosomal, 16S/genetics , Sequence Analysis, DNA , United States
9.
Nucleic Acids Res ; 30(1): 129-36, 2002 Jan 01.
Article in English | MEDLINE | ID: mdl-11752274

ABSTRACT

The Celera Discovery System (CDS) is a web-accessible research workbench for mining genomic and related biological information. Users have access to the human and mouse genome sequences with annotation presented in summary form in BioMolecule Reports for genes, transcripts and proteins. Over 40 additional databases are available, including sequence, mapping, mutation, genetic variation, mRNA expression, protein structure, motif and classification data. Data are accessible by browsing reports, through a variety of interactive graphical viewers, and by advanced query capability provided by the LION SRS search engine. A growing number of sequence analysis tools are available, including sequence similarity, pattern searching, multiple sequence alignment and Hidden Markov Model search. A user workspace keeps track of queries and analyses. CDS is widely used by the academic research community and requires a subscription for access. The system and academic pricing information are available at http://cds.celera.com.


Subject(s)
Databases, Genetic , Genome , Animals , Chromosome Mapping , Chromosomes, Human , Computer Graphics , Conserved Sequence , Forecasting , Genetic Variation , Humans , Information Storage and Retrieval , Internet , Markov Chains , Mice , Mutation , Proteins/classification , Proteins/genetics , Proteins/physiology , RNA, Messenger/genetics , Sequence Alignment , Sequence Homology , Transcription, Genetic
SELECTION OF CITATIONS
SEARCH DETAIL
...