Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 7 de 7
Filter
Add more filters










Database
Language
Publication year range
2.
Sci Rep ; 14(1): 13954, 2024 06 17.
Article in English | MEDLINE | ID: mdl-38886537

ABSTRACT

China, is characterized by its remarkable ethnical diversity, which necessitates whole genome variation data from multiple populations as crucial tools for advancing population genetics and precision medical research. However, there has been a scarcity of research concentrating on the whole genome of ethnic minority groups. To fill this gap, we developed the Guizhou Multi-ethnic Genome Database (GMGD). It comprises whole genome sequencing data from 476 healthy unrelated individuals spanning 11 ethnic minorities groups in Guizhou Province, Southwest China, including Bouyei, Dong, Miao, Yi, Bai, Gelo, Zhuang, Tujia, Yao, Hui, and Sui. The GMGD database comprises more than 16.33 million variants in GRCh38 and 16.20 million variants in GRCh37. Among these, approximately 11.9% (1,956,322) of the variants in GRCh38 and 18.5% (3,009,431) of the variants in GRCh37 are entirely new and do not exist in the dbSNP database. These novel variants shed light on the genetic diversity landscape across these populations, providing valuable insights with an average coverage of 5.5 ×. This makes GMGD the largest genome-wide database encompassing the most diverse ethnic groups to date. The GMGD interactive interface facilitates researchers with multi-dimensional mutation search methods and displays population frequency differences among global populations. Furthermore, GMGD is equipped with a genotype-imputation function, enabling enhanced capabilities for low-depth genomic research or targeted region capture studies. GMGD offers unique insights into the genomic variation landscape of different ethnic groups, which are freely accessible at https://db.cngb.org/pop/gmgd/ .


Subject(s)
Databases, Genetic , Ethnicity , Genome, Human , Humans , Ethnicity/genetics , China/ethnology , Genetics, Population/methods , Whole Genome Sequencing/methods , Genetic Variation , Minority Groups , Polymorphism, Single Nucleotide
3.
Yi Chuan ; 43(7): 665-679, 2021 Jul 20.
Article in English | MEDLINE | ID: mdl-34284982

ABSTRACT

Glioblastoma (GBM) is the most common primary intracranial tumor with extremely high malignancy and poor prognosis. In order to identify the GBM prognostic biomarkers and establish a prognostic model, we analyzed the expression profile data of GBM in The Cancer Genome Atlas (TCGA) database as the experimental group. First, we identified the differentially expressed genes of different survival periods among the GBM patients. The GISTIC software and Kaplan Meier (KM) survival curve were used to analyze the copy number variation of GBM to identify the survival-associated amplified gene (SAG). We selected the intersection genes of up-regulated ones in short survival group and SAG, performed univariate Cox regression and iterative Lasso regression with them to identify the important candidate genes and establish a prognostic model. Based on the model, the prognostic score was calculated. The patients were divided into high-risk and low-risk groups according to the median prognostic score. Meanwhile ROC curve was used to evaluate the validity of the model, applying the KM survival analysis of the high-risk and low-risk groups. Multivariate Cox regression analysis was used to determine the independence of the prognostic score. All the data were verified with three external datasets: GEO GSE16011, CGGA, and Rembrandt. The results showed that differential expression analysis of different survival periods of GBM identified 426 up-regulated genes and 65 down-regulated genes in the TCGA GBM dataset. The intersection of up-regulated genes in short survival group and SAG yielded 47 genes. After the screening, the six-gene combination (EN2,PPBP,LRRC61,SEL1L3,CPA4,DDIT4L) prognostic model was finally determined. The area under ROC curve of the model in TCGA experimental group and three external validation group were all greater than 0.6, even reaching 0.912. KM analysis showed that the prognosis of the high-risk and low-risk groups was significant different (P<0.05). In the multivariate Cox regression analysis, the six-gene prognostic score was an independent factor influencing the prognosis of GBM patients (P<0.05). In summary, this study established a prognostic model of six-gene (EN2,PPBP,LRRC61,SEL1L3,CPA4,DDIT4L) for GBM. This six-gene model has good predictive ability and could be used as an independent prognostic marker for GBM patients.


Subject(s)
Brain Neoplasms , Glioblastoma , Adaptor Proteins, Signal Transducing , Brain Neoplasms/genetics , DNA Copy Number Variations , Gene Expression Regulation, Neoplastic , Glioblastoma/genetics , Humans , Prognosis
4.
Nucleic Acids Res ; 47(D1): D989-D993, 2019 01 08.
Article in English | MEDLINE | ID: mdl-30321400

ABSTRACT

DNA methylation, the most intensively studied epigenetic modification, plays an important role in understanding the molecular basis of diseases. Furthermore, epigenome-wide association study (EWAS) provides a systematic approach to identify epigenetic variants underlying common diseases/phenotypes. However, there is no comprehensive database to archive the results of EWASs. To fill this gap, we developed the EWASdb, which is a part of 'The EWAS Project', to store the epigenetic association results of DNA methylation from EWASs. In its current version (v 1.0, up to July 2018), the EWASdb has curated 1319 EWASs associated with 302 diseases/phenotypes. There are three types of EWAS results curated in this database: (i) EWAS for single marker; (ii) EWAS for KEGG pathway and (iii) EWAS for GO (Gene Ontology) category. As the first comprehensive EWAS database, EWASdb has been searched or downloaded by researchers from 43 countries to date. We believe that EWASdb will become a valuable resource and significantly contribute to the epigenetic research of diseases/phenotypes and have potential clinical applications. EWASdb is freely available at http://www.ewas.org.cn/ewasdb or http://www.bioapp.org/ewasdb.


Subject(s)
DNA Methylation , Databases, Genetic , Epigenesis, Genetic , Epigenome , Disease/classification , Disease/genetics , Gene Ontology , Genetic Association Studies , Phenotype , User-Computer Interface
5.
Brief Bioinform ; 19(5): 811-820, 2018 09 28.
Article in English | MEDLINE | ID: mdl-28334239

ABSTRACT

The murine model serves as an important experimental system in biomedical science because of its high degree of similarities at the sequence level with human. Recent studies have compared the transcriptional landscapes between human and mouse, but the general co-expression landscapes have not been characterized. Here, we calculated the general co-expression coefficients and constructed the general co-expression maps for human and mouse. The differences and similarities of the general co-expression maps between the two species were compared in detail. The results showed low similarities in the human and mouse, with only about 36.54% of the co-expression relationships conserved between the two species. These results indicate that researchers should pay attention to these differences when performing research using the expression data of human and mouse. To facilitate use of this information, we also developed the human-mouse general co-expression difference database (coexpressMAP) to search differences in co-expression between human and mouse. This database is freely available at http://www.bioapp.org/coexpressMAP.


Subject(s)
Databases, Genetic , Gene Expression Profiling/methods , Animals , Computational Biology/methods , Databases, Genetic/statistics & numerical data , Gene Expression Profiling/statistics & numerical data , Humans , Mice , Species Specificity
6.
Brief Bioinform ; 19(1): 89-100, 2018 01 01.
Article in English | MEDLINE | ID: mdl-27760738

ABSTRACT

At present, understanding of DNA methylation at the population level is still limited. Here, we first extended the classical framework of population genetics, such as single nucleotide polymorphism allele frequency, linkage disequilibrium (LD), LD block and haplotype, to epigenetics. Then, as an example, we compared the DNA methylation disequilibrium (MD) maps between HapMap CEU (Caucasian residents of European ancestry from Utah) population and YRI (Yoruba people from Ibadan) population (lymphoblastoid cell lines). We analyzed the differences and similarities between CEU and YRI from the following aspects: SMP (single methylation polymorphism) allele frequency, SMP allele association, MD, MD block and methylation haplotype (meplotype) frequency. The results showed that CEU and YRI had similar distribution of SMP allele frequency, and shared many MD block region. We believe that the framework of population genetics can be used in the population epigenetics. The population epigenetic framework also has potential prospects in the study of complex diseases, such as epigenome-wide association study.


Subject(s)
Black People/genetics , Epigenesis, Genetic , Genetics, Population/methods , Polymorphism, Single Nucleotide , White People/genetics , Alleles , Cells, Cultured , DNA Methylation , HapMap Project , Haplotypes , Humans , Linkage Disequilibrium , Lymphocytes/cytology , Lymphocytes/metabolism
7.
Sci Rep ; 6: 37951, 2016 11 28.
Article in English | MEDLINE | ID: mdl-27892496

ABSTRACT

Similar to the SNP (single nucleotide polymorphism) data, there is non-random association of the DNA methylation level (we call it methylation disequilibrium, MD) between neighboring methylation loci. For the case-control study of complex diseases, it is important to identify the association between methylation levels combination types (we call it methylecomtype) and diseases/phenotypes. We extended the classical framework of SNP haplotype-based association study in population genetics to DNA methylation level data, and developed a software EWAS to identify the disease-related methylecomtypes. EWAS can provide the following basic functions: (1) calculating the DNA methylation disequilibrium coefficient between two CpG loci; (2) identifying the MD blocks across the whole genome; (3) carrying out case-control association study of methylecomtypes and identifying the disease-related methylecomtypes. For a DNA methylation level data set including 689 samples (354 cases and 335 controls) and 473864 CpG loci, it takes only about 25 min to complete the full scan. EWAS v1.0 can rapidly identify the association between combinations of methylation levels (methylecomtypes) and diseases. EWAS v1.0 is freely available at: http://www.ewas.org.cn or http://www.bioapp.org/ewas.


Subject(s)
DNA Methylation , Epigenesis, Genetic , Genome-Wide Association Study/methods , Software , Case-Control Studies , CpG Islands , Humans
SELECTION OF CITATIONS
SEARCH DETAIL
...