Long-read, whole-genome shotgun sequence data for five model organisms.

Kim, Kristi E; Peluso, Paul; Babayan, Primo; Yeadon, P Jane; Yu, Charles; Fisher, William W; Chin, Chen-Shan; Rapicavoli, Nicole A; Rank, David R; Li, Joachim; Catcheside, David E A; Celniker, Susan E; Phillippy, Adam M; Bergman, Casey M; Landolin, Jane M

Kim, Kristi E; Peluso, Paul; Babayan, Primo; Yeadon, P Jane; Yu, Charles; Fisher, William W; Chin, Chen-Shan; Rapicavoli, Nicole A; Rank, David R; Li, Joachim; Catcheside, David E A; Celniker, Susan E; Phillippy, Adam M; Bergman, Casey M; Landolin, Jane M.

Afiliación

Kim KE; Pacific Biosciences of California Inc. , 1380 Willow Road, Menlo Park, California 94025, USA.
Peluso P; Pacific Biosciences of California Inc. , 1380 Willow Road, Menlo Park, California 94025, USA.
Babayan P; Pacific Biosciences of California Inc. , 1380 Willow Road, Menlo Park, California 94025, USA.
Yeadon PJ; Flinders University, School of Biological Sciences , PO Box 2100, Adelaide, South Australia 5001, Australia.
Yu C; Department of Genome Dynamics, Lawrence Berkeley National Laboratory, 1 Cyclotron Road , Berkeley, California 94720, USA.
Fisher WW; Department of Genome Dynamics, Lawrence Berkeley National Laboratory, 1 Cyclotron Road , Berkeley, California 94720, USA.
Chin CS; Pacific Biosciences of California Inc. , 1380 Willow Road, Menlo Park, California 94025, USA.
Rapicavoli NA; Pacific Biosciences of California Inc. , 1380 Willow Road, Menlo Park, California 94025, USA.
Rank DR; Pacific Biosciences of California Inc. , 1380 Willow Road, Menlo Park, California 94025, USA.
Li J; Department of Microbiology and Immunology, UCSF , San Francisco, California 94158, USA.
Catcheside DE; Flinders University, School of Biological Sciences , PO Box 2100, Adelaide, South Australia 5001, Australia.
Celniker SE; Department of Genome Dynamics, Lawrence Berkeley National Laboratory, 1 Cyclotron Road , Berkeley, California 94720, USA.
Phillippy AM; National Biodefense Analysis and Countermeasures Center , 110 Thomas Johnson Drive, Frederick, Maryland 21702, USA.
Bergman CM; Faculty of Life Sciences, University of Manchester , Oxford Road, Manchester M13 9PT, UK.
Landolin JM; Pacific Biosciences of California Inc. , 1380 Willow Road, Menlo Park, California 94025, USA.

Sci Data ; 1: 140045, 2014.

Article en En | MEDLINE | ID: mdl-25977796

RESUMEN

Single molecule, real-time (SMRT) sequencing from Pacific Biosciences is increasingly used in many areas of biological research including de novo genome assembly, structural-variant identification, haplotype phasing, mRNA isoform discovery, and base-modification analyses. High-quality, public datasets of SMRT sequences can spur development of analytic tools that can accommodate unique characteristics of SMRT data (long read lengths, lack of GC or amplification bias, and a random error profile leading to high consensus accuracy). In this paper, we describe eight high-coverage SMRT sequence datasets from five organisms (Escherichia coli, Saccharomyces cerevisiae, Neurospora crassa, Arabidopsis thaliana, and Drosophila melanogaster) that have been publicly released to the general scientific community (NCBI Sequence Read Archive ID SRP040522). Data were generated using two sequencing chemistries (P4C2 and P5C3) on the PacBio RS II instrument. The datasets reported here can be used without restriction by the research community to generate whole-genome assemblies, test new algorithms, investigate genome structure and evolution, and identify base modifications in some of the most widely-studied model systems in biological research.

Asunto(s)

Arabidopsis/genética; Drosophila melanogaster/genética; Escherichia coli/genética; Genoma Bacteriano; Genoma Fúngico; Genoma de los Insectos; Genoma de Planta; Neurospora crassa/genética; Saccharomyces cerevisiae/genética; Análisis de Secuencia de ADN; Animales; Modelos Animales

Texto completo

Añadir a Mi BVS

Imprimir

XML

PubMed Links

Buscar en Google

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Saccharomyces cerevisiae / Genoma Fúngico / Genoma Bacteriano / Análisis de Secuencia de ADN / Arabidopsis / Genoma de Planta / Drosophila melanogaster / Escherichia coli / Genoma de los Insectos / Neurospora crassa Tipo de estudio: Prognostic_studies Límite: Animals Idioma: En Revista: Sci Data Año: 2014 Tipo del documento: Article País de afiliación: Estados Unidos Pais de publicación: Reino Unido

Texto completo

Añadir a Mi BVS

Imprimir

XML

PubMed Links

Buscar en Google