Your browser doesn't support javascript.
BIGwas: Single-command quality control and association testing for multi-cohort and biobank-scale GWAS/PheWAS data.
Kässens, Jan Christian; Wienbrandt, Lars; Ellinghaus, David.
  • Kässens JC; Institute of Clinical Molecular Biology, Christian-Albrechts-University of Kiel, Rosalind-Franklin-Str. 12, 24105 Kiel, Germany.
  • Wienbrandt L; Haematology Lab Kiel, Klinik für Innere Medizin II, University Hospital Schleswig-Holstein, Langer Segen 8-10, 24105 Kiel, Germany.
  • Ellinghaus D; Institute of Clinical Molecular Biology, Christian-Albrechts-University of Kiel, Rosalind-Franklin-Str. 12, 24105 Kiel, Germany.
Gigascience ; 10(6)2021 06 29.
Artículo en Inglés | MEDLINE | ID: covidwho-2161022
ABSTRACT

BACKGROUND:

Genome-wide association studies (GWAS) and phenome-wide association studies (PheWAS) involving 1 million GWAS samples from dozens of population-based biobanks present a considerable computational challenge and are carried out by large scientific groups under great expenditure of time and personnel. Automating these processes requires highly efficient and scalable methods and software, but so far there is no workflow solution to easily process 1 million GWAS samples.

RESULTS:

Here we present BIGwas, a portable, fully automated quality control and association testing pipeline for large-scale binary and quantitative trait GWAS data provided by biobank resources. By using Nextflow workflow and Singularity software container technology, BIGwas performs resource-efficient and reproducible analyses on a local computer or any high-performance compute (HPC) system with just 1 command, with no need to manually install a software execution environment or various software packages. For a single-command GWAS analysis with 974,818 individuals and 92 million genetic markers, BIGwas takes ∼16 days on a small HPC system with only 7 compute nodes to perform a complete GWAS QC and association analysis protocol. Our dynamic parallelization approach enables shorter runtimes for large HPCs.

CONCLUSIONS:

Researchers without extensive bioinformatics knowledge and with few computer resources can use BIGwas to perform multi-cohort GWAS with 1 million GWAS samples and, if desired, use it to build their own (genome-wide) PheWAS resource. BIGwas is freely available for download from http//github.com/ikmb/gwas-qc and http//github.com/ikmb/gwas-assoc.
Asunto(s)
Palabras clave

Texto completo: Disponible Colección: Bases de datos internacionales Base de datos: MEDLINE Asunto principal: Bancos de Muestras Biológicas / Estudio de Asociación del Genoma Completo Tipo de estudio: Estudio de cohorte / Estudio experimental / Estudio observacional / Estudio pronóstico / Ensayo controlado aleatorizado Límite: Humanos Idioma: Inglés Año: 2021 Tipo del documento: Artículo País de afiliación: Gigascience

Similares

MEDLINE

...
LILACS

LIS


Texto completo: Disponible Colección: Bases de datos internacionales Base de datos: MEDLINE Asunto principal: Bancos de Muestras Biológicas / Estudio de Asociación del Genoma Completo Tipo de estudio: Estudio de cohorte / Estudio experimental / Estudio observacional / Estudio pronóstico / Ensayo controlado aleatorizado Límite: Humanos Idioma: Inglés Año: 2021 Tipo del documento: Artículo País de afiliación: Gigascience