Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 13 de 13
Filter
Add more filters










Publication year range
1.
Article in English | MEDLINE | ID: mdl-38862433

ABSTRACT

During the last decade, the generation and accumulation of petabase-scale high-throughput sequencing data have resulted in great challenges, including access to human data, as well as transfer, storage, and sharing of enormous amounts of data. To promote data-driven biological research, the Korean government announced that all biological data generated from government-funded research projects should be deposited at the Korea BioData Station (K-BDS), which consists of multiple databases for individual data types. Here, we introduce the Korean Nucleotide Archive (KoNA), a repository of nucleotide sequence data. As of July 2022, the Korean Read Archive in KoNA has collected over 477 TB of raw next-generation sequencing data from national genome projects. To ensure data quality and prepare for international alignment, a standard operating procedure was adopted, which is similar to that of the International Nucleotide Sequence Database Collaboration. The standard operating procedure includes quality control processes for submitted data and metadata using an automated pipeline, followed by manual examination. To ensure fast and stable data transfer, a high-speed transmission system called GBox is used in KoNA. Furthermore, the data uploaded to or downloaded from KoNA through GBox can be readily processed using a cloud computing service called Bio-Express. This seamless coupling of KoNA, GBox, and Bio-Express enhances the data experience, including submission, access, and analysis of raw nucleotide sequences. KoNA not only satisfies the unmet needs for a national sequence repository in Korea but also provides datasets to researchers globally and contributes to advances in genomics. The KoNA is available at https://www.kobic.re.kr/kona/.


Subject(s)
Databases, Nucleic Acid , Republic of Korea , Humans , High-Throughput Nucleotide Sequencing/methods
2.
PLoS One ; 15(10): e0240191, 2020.
Article in English | MEDLINE | ID: mdl-33112870

ABSTRACT

Functional analyses of genes are crucial for unveiling biological responses, genetic engineering, and developing new medicines. However, functional analyses have largely been restricted to model organisms, representing a major hurdle for functional studies and industrial applications. To resolve this, comparative genome analyses can be used to provide clues to gene functions as well as their evolutionary history. To this end, we present Prometheus, a web-based omics portal that contains more than 17,215 sequences from prokaryotic and eukaryotic genomes. This portal supports interkingdom comparative analyses via a domain architecture-based gene identification system and Gene Search, and users can easily and rapidly identify single or entire gene sets in specific pathways. Bioinformatics tools for further analyses are provided in Prometheus or through Bio-Express, a cloud-based bioinformatics analysis platform. Prometheus is a new paradigm for comparative analyses of large amounts of genomic information.


Subject(s)
Genomics/methods , Software , Animals , Archaea/genetics , Bacteria/genetics , Fungi/genetics , Humans , Metabolomics/methods , Plants/genetics , Sequence Alignment/methods
3.
PLoS One ; 15(5): e0232029, 2020.
Article in English | MEDLINE | ID: mdl-32374732

ABSTRACT

BACKGROUND: Translationally controlled tumor protein (TCTP) is a conserved, multifunctional protein involved in numerous cellular processes in eukaryotes. Although the functions of TCTP have been investigated sporadically in animals, invertebrates, and plants, few lineage-specific activities of this molecule, have been reported. An exception is in Arabidopsis thaliana, in which TCTP (AtTCTP1) functions in stomatal closuer by regulating microtubule stability. Further, although the development of next-generation sequencing technologies has facilitated the analysis of many eukaryotic genomes in public databases, inter-kingdom comparative analyses using available genome information are comparatively scarce. METHODOLOGY: To carry out inter-kingdom comparative analysis of TCTP, TCTP genes were identified from 377 species. Then phylogenetic analysis, prediction of protein structure, molecular docking simulation and molecular dynamics analysis were performed to investigate the evolution of TCTP genes and their binding proteins. RESULTS: A total of 533 TCTP genes were identified from 377 eukaryotic species, including protozoa, fungi, invertebrates, vertebrates, and plants. Phylogenetic and secondary structure analyses reveal lineage-specific evolution of TCTP, and inter-kingdom comparisons highlight the lineage-specific emergence of, or changes in, secondary structure elements in TCTP proteins from different kingdoms. Furthermore, secondary structure comparisons between TCTP proteins within each kingdom, combined with measurements of the degree of sequence conservation, suggest that TCTP genes have evolved to conserve protein secondary structures in a lineage-specific manner. Additional tertiary structure analysis of TCTP-binding proteins and their interacting partners and docking simulations between these proteins further imply that TCTP gene variation may influence the tertiary structures of TCTP-binding proteins in a lineage-specific manner. CONCLUSIONS: Our analysis suggests that TCTP has undergone lineage-specific evolution and that structural changes in TCTP proteins may correlate with the tertiary structure of TCTP-binding proteins and their binding partners in a lineage-specific manner.


Subject(s)
Biomarkers, Tumor/genetics , Biomarkers, Tumor/physiology , Evolution, Molecular , Genetic Speciation , Amino Acid Sequence , Animals , Biomarkers, Tumor/chemistry , Conserved Sequence , Eukaryotic Cells/classification , Eukaryotic Cells/metabolism , Fungi/classification , Fungi/genetics , Humans , Invertebrates/classification , Invertebrates/genetics , Mammals/classification , Mammals/genetics , Molecular Docking Simulation , Phylogeny , Plant Cells/classification , Plant Cells/metabolism , Prokaryotic Cells/classification , Prokaryotic Cells/metabolism , Protein Binding , Protein Structure, Secondary , Species Specificity , Tumor Protein, Translationally-Controlled 1
4.
PLoS One ; 15(2): e0229416, 2020.
Article in English | MEDLINE | ID: mdl-32084241

ABSTRACT

Acinetobacter baumannii is a highly potent nosocomial pathogen that is associated with increased in-hospital mortality. Here, we investigated the changes in molecular characteristics of carbapenem-resistant A. baumannii (CRAB) isolated from the blood samples of patients admitted to a tertiary hospital in South Korea from January 2009 to July 2015. Whole genome sequencing using the Illumina MiSeq platform and multi-locus sequence typing (MLST) were performed for 98 CRAB clinical isolates. In silico analyses for the prediction of antimicrobial resistance and virulence factor genes were performed. Plasmid sequences, including complete forms, were reconstructed from the sequence reads. Epidemiologic data were collected from the hospital database. MLST using the Oxford scheme revealed 10 sequence types of CRAB, of which ST191 was the dominant type (n = 59). Although blaOXA-23 was shared by most analysed strains, the compositions of antimicrobial resistance determinants differed among sequence types. ST447 and ST451/ST1809 with a few resistance genes were isolated during the later years of the study period. The number of virulence genes increased, while that of ST191 did not change significantly over the investigation period. Intriguingly MLST types, compositions of antimicrobial resistance genes, and virulence genes had no association with clinical outcomes of CRAB bacteraemia. In conclusion, active changes in or accumulations of antimicrobial resistance determinants and virulence genes in CRAB were not observed during the research period. Molecular characteristics of CRAB had no association with clinical outcomes of CRAB bacteraemia.


Subject(s)
Acinetobacter Infections/microbiology , Acinetobacter baumannii/genetics , Bacterial Proteins/genetics , Carbapenems/pharmacology , Cross Infection/microbiology , Drug Resistance, Bacterial/genetics , Genomics/methods , Acinetobacter Infections/epidemiology , Acinetobacter baumannii/drug effects , Acinetobacter baumannii/isolation & purification , Cross Infection/epidemiology , Female , Humans , Male , Middle Aged , Multilocus Sequence Typing , Republic of Korea/epidemiology , Virulence Factors/genetics
5.
Sci Data ; 7(1): 17, 2020 01 13.
Article in English | MEDLINE | ID: mdl-31932587

ABSTRACT

Peppers (Capsicum annuum L.), belonging to the Solanaceae family, are one of the most economically important crops globally. Like other crops, peppers are threatened by diverse environmental conditions due to different pathogens and abiotic stresses. High-quality reference genomes with massive datasets of transcriptomes from various conditions can provide clues to preferred agronomic traits for breeding. However, few global gene expression profiling datasets have been published to examine the environmental stress-resistant mechanisms in peppers. In this study, we report the RNA-seq analyses of peppers treated with heat, cold, salinity, and osmotic stress at six different time points. RNA-seq libraries from 78 RNA samples containing three biological replicates per time point for each of the abiotic stresses and a mock control were constructed. A total of 204.68 Gb of transcriptome data were verified by differentially expressed genes and gene ontology enrichment analysis. Analyses of the transcriptome data in this study will provide useful information for basic studies of various stimuli to facilitate the development of stress-resistant pepper cultivars.


Subject(s)
Capsicum/genetics , Gene Expression Regulation, Plant , Stress, Physiological , Transcriptome , Gene Expression Profiling , Hot Temperature , Osmotic Pressure , RNA-Seq , Salinity
6.
Sci Data ; 6(1): 220, 2019 10 22.
Article in English | MEDLINE | ID: mdl-31641135

ABSTRACT

Oriental melon (Cucumis melo L. var. makuwa) is one of the most important cultivated cucurbits, and is grown widely in Northeast Asian countries. With increasing interest in its biological properties and economic importance, oriental melon has become an attractive model crop for studying various horticultural traits. A previous genome sequence of the melon was constructed from a homozygous double-haploid line. Thus, individual reference genomes are required to perform functional studies and further breeding applications. Here, we report draft genome sequences of two oriental melons, Chang Bougi and SW3. The assembled 344 Mb genome of Chang Bougi was obtained with scaffold N50 1.0 Mb, and 36,235 genes were annotated. The 354 Mb genome of SW3 was assembled with scaffold N50 1.6 Mb, and has 38,173 genes. These newly constructed genomes will enable studies of fruit development, disease resistance, and breeding applications in the oriental melon.


Subject(s)
Cucumis melo/genetics , Genome, Plant , Chromosome Mapping
7.
Front Microbiol ; 8: 585, 2017.
Article in English | MEDLINE | ID: mdl-28421066

ABSTRACT

There have been extensive genome sequencing studies for Escherichia coli strains, particularly for pathogenic isolates, because fast determination of pathogenic potential and/or drug resistance and their propagation routes is crucial. For laboratory E. coli strains, however, genome sequence information is limited except for several well-known strains. We determined the complete genome sequence of laboratory E. coli strain RR1 (HB101 RecA+), which has long been used as a general cloning host. A hybrid genome sequence of K-12 MG1655 and B BL21(DE3) was constructed based on the initial mapping of Illumina HiSeq reads to each reference, and iterative rounds of read mapping, variant detection, and consensus extraction were carried out. Finally, PCR and Sanger sequencing-based finishing were applied to resolve non-single nucleotide variant regions with aberrant read depths and breakpoints, most of them resulting from prophages and insertion sequence transpositions that are not present in the reference genome sequence. We found that 96.9% of the RR1 genome is derived from K-12, and identified exact crossover junctions between K-12 and B genomic fragments. However, because RR1 has experienced a series of genetic manipulations since branching from the common ancestor, it has a set of mutations different from those found in K-12 MG1655. As well as identifying all known genotypes of RR1 on the basis of genomic context, we found novel mutations. Our results extend current knowledge of the genotype of RR1 and its relatives, and provide insights into the pedigree, genomic background, and physiology of common laboratory strains.

8.
Genome Announc ; 3(3)2015 May 21.
Article in English | MEDLINE | ID: mdl-25999562

ABSTRACT

Bacillus subtilis ATCC 6051a (=KCTC 1028), which is less domesticated than strain 168, is widely used for the secretory expression of industrial enzymes. Herein, we present the complete genome sequence of the Bacillus subtilis strain ATCC 6051a.

9.
BMC Genomics ; 15: 286, 2014 Apr 16.
Article in English | MEDLINE | ID: mdl-24735464

ABSTRACT

BACKGROUND: Chorella is the representative taxon of Chlorellales in Trebouxiophyceae, and its chloroplast (cp) genomic information has been thought to depend only on studies concerning Chlorella vulgaris and GenBank information of C. variablis. Mitochondrial (mt) genomic information regarding Chlorella is currently unavailable. To elucidate the evolution of organelle genomes and genetic information of Chlorella, we have sequenced and characterized the cp and mt genomes of Arctic Chlorella sp. ArM0029B. RESULTS: The 119,989-bp cp genome lacking inverted repeats and 65,049-bp mt genome were sequenced. The ArM0029B cp genome contains 114 conserved genes, including 32 tRNA genes, 3 rRNA genes, and 79 genes encoding proteins. Chlorella cp genomes are highly rearranged except for a Chlorella-specific six-gene cluster, and the ArM0029B plastid resembles that of Chlorella variabilis except for a 15-kb gene cluster inversion. In the mt genome, 62 conserved genes, including 27 tRNA genes, 3 rRNA genes, and 32 genes encoding proteins were determined. The mt genome of ArM0029B is similar to that of the non-photosynthetic species Prototheca and Heicosporidium. The ArM0029B mt genome contains a group I intron, with an ORF containing two LAGLIDADG motifs, in cox1. The intronic ORF is shared by C. vulgaris and Prototheca. The phylogeny of the plastid genome reveals that ArM0029B showed a close relationship of Chlorella to Parachlorella and Oocystis within Chlorellales. The distribution of the cox1 intron at 721 support membership in the order Chlorellales. Mitochondrial phylogenomic analyses, however, indicated that ArM0029B shows a greater affinity to MX-AZ01 and Coccomyxa than to the Helicosporidium-Prototheca clade, although the detailed phylogenetic relationships among the three taxa remain to be resolved. CONCLUSIONS: The plastid genome of ArM0029B is similar to that of C. variabilis. The mt sequence of ArM0029B is the first genome to be reported for Chlorella. Chloroplast genome phylogeny supports monophyly of the seven investigated members of Chlorellales. The presence of the cox1 intron at 721 in all four investigated Chlorellales taxa indicates that the cox1 intron had been introduced in early Chorellales as a cis-splice form and that the cis-splicing intron was inherited to recent Chlorellales and was recently trans-spliced in Helicosporidium.


Subject(s)
Chlorella/genetics , Genome, Mitochondrial , Genome, Plastid , Genomics , Amino Acid Motifs , Amino Acid Sequence , Base Sequence , Chlorella/classification , Evolution, Molecular , Gene Order , Gene Rearrangement , Genes, Plant , Genome, Chloroplast , Introns , Molecular Sequence Data , Multigene Family , Nucleic Acid Conformation , Phylogeny , Sequence Alignment
10.
J Microbiol Biotechnol ; 24(4): 563-7, 2014 Apr.
Article in English | MEDLINE | ID: mdl-24444998

ABSTRACT

Pseudomonas syringae pv. syringae (Psy) is a major bacterial pathogen of many economically important plant species. Despite the severity of its impact, the genome sequence of the type strain has not been reported. Here, we present the draft genome sequence of Psy ATCC 19310. Comparative genomic analysis revealed that Psy ATCC 19310 is closely related to Psy B728a. However, only a few type III effectors, which are key virulence factors, are shared by the two strains, indicating the possibility of host-pathogen specificity and genome dynamics, even under the pathovar level.


Subject(s)
DNA, Bacterial/chemistry , DNA, Bacterial/genetics , Genome, Bacterial , Pseudomonas syringae/genetics , Sequence Analysis, DNA , Computational Biology , Molecular Sequence Data , Pseudomonas syringae/isolation & purification
11.
Genome Announc ; 1(4)2013 Aug 15.
Article in English | MEDLINE | ID: mdl-23950128

ABSTRACT

Lysinibacillus sphaericus is a heterogeneous species that includes strains that produce mosquitocidal toxin proteins. Herein, we report the 4.56-Mb draft genome sequence of the nonpathogenic L. sphaericus strain KCTC 3346(T), which provides clues for the phylogenetic reassessment of L. sphaericus species and an understanding of its physiological properties.

12.
Genome Announc ; 1(4)2013 Aug 01.
Article in English | MEDLINE | ID: mdl-23908288

ABSTRACT

Strains of Amycolatopsis orientalis produce vancomycin or other related glycopeptide antibiotic compounds. Here we report the draft genome sequences of glycopeptide nonproducers Amycolatopsis orientalis subsp. orientalis DSM 43388 and DSM 46075. Their genome information will provide insights into the acquisition and regulation of glycopeptide antibiotic resistance genes.

13.
Genome Announc ; 1(3)2013 Jun 27.
Article in English | MEDLINE | ID: mdl-23814036

ABSTRACT

Amycolatopsis orientalis is the producer of vancomycin, a glycopeptide antibiotic that is used for the treatment of serious infections with Gram-positive bacteria. Here we present the next-generation sequencing (NGS)-based 9.06-Mb draft genome sequence of the type strain Amycolatopsis orientalis subsp. orientalis KCTC 9412 (DSM 40040; ATCC 19795).

SELECTION OF CITATIONS
SEARCH DETAIL
...