Your browser doesn't support javascript.
CoV-Seq, a New Tool for SARS-CoV-2 Genome Analysis and Visualization: Development and Usability Study.
Liu, Boxiang; Liu, Kaibo; Zhang, He; Zhang, Liang; Bian, Yuchen; Huang, Liang.
  • Liu B; Baidu Research, Sunnyvale, CA, United States.
  • Liu K; Baidu Research, Sunnyvale, CA, United States.
  • Zhang H; Baidu Research, Sunnyvale, CA, United States.
  • Zhang L; Baidu Research, Sunnyvale, CA, United States.
  • Bian Y; School of Electrical Engineering & Computer Science, Oregon State University, Corvallis, OR, United States.
  • Huang L; Baidu Research, Sunnyvale, CA, United States.
J Med Internet Res ; 22(10): e22299, 2020 10 02.
Article in English | MEDLINE | ID: covidwho-862642
ABSTRACT

BACKGROUND:

COVID-19 became a global pandemic not long after its identification in late 2019. The genomes of SARS-CoV-2 are being rapidly sequenced and shared on public repositories. To keep up with these updates, scientists need to frequently refresh and reclean data sets, which is an ad hoc and labor-intensive process. Further, scientists with limited bioinformatics or programming knowledge may find it difficult to analyze SARS-CoV-2 genomes.

OBJECTIVE:

To address these challenges, we developed CoV-Seq, an integrated web server that enables simple and rapid analysis of SARS-CoV-2 genomes.

METHODS:

CoV-Seq is implemented in Python and JavaScript. The web server and source code URLs are provided in this article.

RESULTS:

Given a new sequence, CoV-Seq automatically predicts gene boundaries and identifies genetic variants, which are displayed in an interactive genome visualizer and are downloadable for further analysis. A command-line interface is available for high-throughput processing. In addition, we aggregated all publicly available SARS-CoV-2 sequences from the Global Initiative on Sharing Avian Influenza Data (GISAID), National Center for Biotechnology Information (NCBI), European Nucleotide Archive (ENA), and China National GeneBank (CNGB), and extracted genetic variants from these sequences for download and downstream analysis. The CoV-Seq database is updated weekly.

CONCLUSIONS:

We have developed CoV-Seq, an integrated web service for fast and easy analysis of custom SARS-CoV-2 sequences. The web server provides an interactive module for the analysis of custom sequences and a weekly updated database of genetic variants of all publicly accessible SARS-CoV-2 sequences. We believe CoV-Seq will help improve our understanding of the genetic underpinnings of COVID-19.
Subject(s)
Keywords

Full text: Available Collection: International databases Database: MEDLINE Main subject: Pneumonia, Viral / Software / Genome, Viral / Coronavirus Infections / Databases, Genetic / Betacoronavirus / Data Visualization Type of study: Observational study / Prognostic study Topics: Variants Limits: Humans Language: English Journal: J Med Internet Res Journal subject: Medical Informatics Year: 2020 Document Type: Article Affiliation country: 22299

Similar

MEDLINE

...
LILACS

LIS


Full text: Available Collection: International databases Database: MEDLINE Main subject: Pneumonia, Viral / Software / Genome, Viral / Coronavirus Infections / Databases, Genetic / Betacoronavirus / Data Visualization Type of study: Observational study / Prognostic study Topics: Variants Limits: Humans Language: English Journal: J Med Internet Res Journal subject: Medical Informatics Year: 2020 Document Type: Article Affiliation country: 22299