Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 9 de 9
Filter
Add more filters










Database
Language
Publication year range
1.
Article in English | MEDLINE | ID: mdl-38862433

ABSTRACT

During the last decade, the generation and accumulation of petabase-scale high-throughput sequencing data have resulted in great challenges, including access to human data, as well as transfer, storage, and sharing of enormous amounts of data. To promote data-driven biological research, the Korean government announced that all biological data generated from government-funded research projects should be deposited at the Korea BioData Station (K-BDS), which consists of multiple databases for individual data types. Here, we introduce the Korean Nucleotide Archive (KoNA), a repository of nucleotide sequence data. As of July 2022, the Korean Read Archive in KoNA has collected over 477 TB of raw next-generation sequencing data from national genome projects. To ensure data quality and prepare for international alignment, a standard operating procedure was adopted, which is similar to that of the International Nucleotide Sequence Database Collaboration. The standard operating procedure includes quality control processes for submitted data and metadata using an automated pipeline, followed by manual examination. To ensure fast and stable data transfer, a high-speed transmission system called GBox is used in KoNA. Furthermore, the data uploaded to or downloaded from KoNA through GBox can be readily processed using a cloud computing service called Bio-Express. This seamless coupling of KoNA, GBox, and Bio-Express enhances the data experience, including submission, access, and analysis of raw nucleotide sequences. KoNA not only satisfies the unmet needs for a national sequence repository in Korea but also provides datasets to researchers globally and contributes to advances in genomics. The KoNA is available at https://www.kobic.re.kr/kona/.


Subject(s)
Databases, Nucleic Acid , Republic of Korea , Humans , High-Throughput Nucleotide Sequencing/methods
2.
BMB Rep ; 56(10): 563-568, 2023 Oct.
Article in English | MEDLINE | ID: mdl-37574809

ABSTRACT

DNA methylation regulates gene expression and contributes to tumorigenesis in the early stages of cancer. In colorectal cancer (CRC), CpG island methylator phenotype (CIMP) is recognized as a distinct subset that is associated with specific molecular and clinical features. In this study, we investigated the genomewide DNA methylation patterns among patients with CRC. The methylation data of 1 unmatched normal, 142 adjacent normal, and 294 tumor samples were analyzed. We identified 40,003 differentially methylated positions with 6,933 (79.8%) hypermethylated and 16,145 (51.6%) hypomethylated probes in the genic region. Hypermethylated probes were predominantly found in promoter-like regions, CpG islands, and N shore sites; hypomethylated probes were enriched in open-sea regions. CRC tumors were categorized into three CIMP subgroups, with 90 (30.6%) in the CIMP-high (CIMP-H), 115 (39.1%) in the CIMP-low (CIMP-L), and 89 (30.3%) in the non-CIMP group. The CIMP-H group was associated with microsatellite instabilityhigh tumors, hypermethylation of MLH1, older age, and rightsided tumors. Our results showed that genome-wide methylation analyses classified patients with CRC into three subgroups according to CIMP levels, with clinical and molecular features consistent with previous data. [BMB Reports 2023; 56(10): 563-568].


Subject(s)
Colorectal Neoplasms , DNA Methylation , Humans , DNA Methylation/genetics , Colorectal Neoplasms/genetics , Colorectal Neoplasms/pathology , CpG Islands/genetics , Phenotype , Epigenesis, Genetic/genetics , Republic of Korea
3.
Genomics Inform ; 21(1): e12, 2023 Mar.
Article in English | MEDLINE | ID: mdl-37037470

ABSTRACT

A wave of new technologies has created opportunities for the cost-effective generation of high-throughput profiles of biological systems, foreshadowing a "data-driven science" era. The large variety of data available from biological research is also a rich resource that can be used for innovative endeavors. However, we are facing considerable challenges in big data deposition, integration, and translation due to the complexity of biological data and its production at unprecedented exponential rates. To address these problems, in 2020, the Korean government officially announced a national strategy to collect and manage the biological data produced through national R&D fund allocations and provide the collected data to researchers. To this end, the Korea Bioinformation Center (KOBIC) developed a new biological data repository, the Korea BioData Station (K-BDS), for sharing data from individual researchers and research programs to create a data-driven biological study environment. The K-BDS is dedicated to providing free open access to a suite of featured data resources in support of worldwide activities in both academia and industry.

5.
PLoS One ; 15(10): e0240191, 2020.
Article in English | MEDLINE | ID: mdl-33112870

ABSTRACT

Functional analyses of genes are crucial for unveiling biological responses, genetic engineering, and developing new medicines. However, functional analyses have largely been restricted to model organisms, representing a major hurdle for functional studies and industrial applications. To resolve this, comparative genome analyses can be used to provide clues to gene functions as well as their evolutionary history. To this end, we present Prometheus, a web-based omics portal that contains more than 17,215 sequences from prokaryotic and eukaryotic genomes. This portal supports interkingdom comparative analyses via a domain architecture-based gene identification system and Gene Search, and users can easily and rapidly identify single or entire gene sets in specific pathways. Bioinformatics tools for further analyses are provided in Prometheus or through Bio-Express, a cloud-based bioinformatics analysis platform. Prometheus is a new paradigm for comparative analyses of large amounts of genomic information.


Subject(s)
Genomics/methods , Software , Animals , Archaea/genetics , Bacteria/genetics , Fungi/genetics , Humans , Metabolomics/methods , Plants/genetics , Sequence Alignment/methods
6.
PLoS One ; 15(5): e0232029, 2020.
Article in English | MEDLINE | ID: mdl-32374732

ABSTRACT

BACKGROUND: Translationally controlled tumor protein (TCTP) is a conserved, multifunctional protein involved in numerous cellular processes in eukaryotes. Although the functions of TCTP have been investigated sporadically in animals, invertebrates, and plants, few lineage-specific activities of this molecule, have been reported. An exception is in Arabidopsis thaliana, in which TCTP (AtTCTP1) functions in stomatal closuer by regulating microtubule stability. Further, although the development of next-generation sequencing technologies has facilitated the analysis of many eukaryotic genomes in public databases, inter-kingdom comparative analyses using available genome information are comparatively scarce. METHODOLOGY: To carry out inter-kingdom comparative analysis of TCTP, TCTP genes were identified from 377 species. Then phylogenetic analysis, prediction of protein structure, molecular docking simulation and molecular dynamics analysis were performed to investigate the evolution of TCTP genes and their binding proteins. RESULTS: A total of 533 TCTP genes were identified from 377 eukaryotic species, including protozoa, fungi, invertebrates, vertebrates, and plants. Phylogenetic and secondary structure analyses reveal lineage-specific evolution of TCTP, and inter-kingdom comparisons highlight the lineage-specific emergence of, or changes in, secondary structure elements in TCTP proteins from different kingdoms. Furthermore, secondary structure comparisons between TCTP proteins within each kingdom, combined with measurements of the degree of sequence conservation, suggest that TCTP genes have evolved to conserve protein secondary structures in a lineage-specific manner. Additional tertiary structure analysis of TCTP-binding proteins and their interacting partners and docking simulations between these proteins further imply that TCTP gene variation may influence the tertiary structures of TCTP-binding proteins in a lineage-specific manner. CONCLUSIONS: Our analysis suggests that TCTP has undergone lineage-specific evolution and that structural changes in TCTP proteins may correlate with the tertiary structure of TCTP-binding proteins and their binding partners in a lineage-specific manner.


Subject(s)
Biomarkers, Tumor/genetics , Biomarkers, Tumor/physiology , Evolution, Molecular , Genetic Speciation , Amino Acid Sequence , Animals , Biomarkers, Tumor/chemistry , Conserved Sequence , Eukaryotic Cells/classification , Eukaryotic Cells/metabolism , Fungi/classification , Fungi/genetics , Humans , Invertebrates/classification , Invertebrates/genetics , Mammals/classification , Mammals/genetics , Molecular Docking Simulation , Phylogeny , Plant Cells/classification , Plant Cells/metabolism , Prokaryotic Cells/classification , Prokaryotic Cells/metabolism , Protein Binding , Protein Structure, Secondary , Species Specificity , Tumor Protein, Translationally-Controlled 1
7.
Front Genet ; 11: 590924, 2020.
Article in English | MEDLINE | ID: mdl-33584793

ABSTRACT

Lennox-Gastaut syndrome (LGS) is a severe type of childhood-onset epilepsy characterized by multiple types of seizures, specific discharges on electroencephalography, and intellectual disability. Most patients with LGS do not respond well to drug treatment and show poor long-term prognosis. Approximately 30% of patients without brain abnormalities have unidentifiable causes. Therefore, accurate diagnosis and treatment of LGS remain challenging. To identify causative mutations of LGS, we analyzed the whole-exome sequencing data of 17 unrelated Korean families, including patients with LGS and LGS-like epilepsy without brain abnormalities, using the Genome Analysis Toolkit. We identified 14 mutations in 14 genes as causes of LGS or LGS-like epilepsy. 64 percent of the identified genes were reported as LGS or epilepsy-related genes. Many of these variations were novel and considered as pathogenic or likely pathogenic. Network analysis was performed to classify the identified genes into two network clusters: neuronal signal transmission or neuronal development. Additionally, knockdown of two candidate genes with insufficient evidence of neuronal functions, SLC25A39 and TBC1D8, decreased neurite outgrowth and the expression level of MAP2, a neuronal marker. These results expand the spectrum of genetic variations and may aid the diagnosis and management of individuals with LGS.

8.
Genet Test Mol Biomarkers ; 24(1): 54-58, 2020 Jan.
Article in English | MEDLINE | ID: mdl-31829726

ABSTRACT

Aim: Lennox-Gastaut syndrome (LGS) is a severe type of childhood-onset epilepsy with multiple types of seizures, specific discharges on electroencephalography, and intellectual disability. However, LGS-related genes are largely unknown. To identify causative genes related to LGS, we collected and analyzed data from a three-generation Korean family in which one member had LGS and two had intellectual disability. Methods: Genomic DNAs were extracted from blood samples of all participants and used in whole-exome sequencing (WES). Genetic variants were detected by the Genome Analysis Toolkit and confirmed by Sanger sequencing. Variant pathogenicity was evaluated by prediction programs and the American College of Medical Genetics criteria. The LGS patient had generalized slow spike-and-wave discharges, multiple types of seizures, and developmental delay. Results: Analyses of the WES data from the family revealed a novel variant (c.1048G>A, p.Ala350Thr) in the IQ motif and Sec7 domain 2 (IQSEC2). This variant is within a highly evolutionarily conserved IQ-like motif, indicating a decrease in the calmodulin-binding capacity or α-amino-3-hydroxy-5-methyl-4-isoxazolepropionic acid transmission. The hemizygous variant in the male with LGS was a maternally inherited X-linked variant from the heterozygous maternal grandmother and mother, both of whom had intellectual disability. Conclusion: These findings indicate that the variant of IQSEC2 triggered both LGS and intellectual disability dependent on sex in this family. We report a novel X-linked inherited IQSEC2 variant for LGS and intellectual disability, which enhances the spectrum of variants in the IQ-like motif of IQSEC2.


Subject(s)
Guanine Nucleotide Exchange Factors/genetics , Intellectual Disability/genetics , Lennox Gastaut Syndrome/genetics , Adult , Child , Epilepsy/genetics , Family , Female , Genes, X-Linked/genetics , Guanine Nucleotide Exchange Factors/metabolism , Humans , Male , Pedigree , Republic of Korea , Exome Sequencing
9.
DNA Res ; 24(1): 71-80, 2017 Feb 01.
Article in English | MEDLINE | ID: mdl-28011721

ABSTRACT

Hibiscus syriacus (L.) (rose of Sharon) is one of the most widespread garden shrubs in the world. We report a draft of the H. syriacus genome comprised of a 1.75 Gb assembly that covers 92% of the genome with only 1.7% (33 Mb) gap sequences. Predicted gene modeling detected 87,603 genes, mostly supported by deep RNA sequencing data. To define gene family distribution among relatives of H. syriacus, orthologous gene sets containing 164,660 genes in 21,472 clusters were identified by OrthoMCL analysis of five plant species, including H. syriacus, Arabidopsis thaliana, Gossypium raimondii, Theobroma cacao and Amborella trichopoda. We inferred their evolutionary relationships based on divergence times among Malvaceae plant genes and found that gene families involved in flowering regulation and disease resistance were more highly divergent and expanded in H. syriacus than in its close relatives, G. raimondii (DD) and T. cacao. Clustered gene families and gene collinearity analysis revealed that two recent rounds of whole-genome duplication were followed by diploidization of the H. syriacus genome after speciation. Copy number variation and phylogenetic divergence indicates that WGDs and subsequent diploidization led to unequal duplication and deletion of flowering-related genes in H. syriacus and may affect its unique floral morphology.


Subject(s)
Flowers/growth & development , Genome, Plant , Hibiscus/genetics , Polyploidy , DNA-Binding Proteins/genetics , Hibiscus/physiology , Multigene Family , RNA-Binding Proteins/genetics , Transcriptome
SELECTION OF CITATIONS
SEARCH DETAIL
...