Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 5 de 5
Filter
Add more filters










Database
Language
Publication year range
1.
Article in English | MEDLINE | ID: mdl-36572336

ABSTRACT

Biological databases serve as a global fundamental infrastructure for the worldwide scientific community, which dramatically aid the transformation of big data into knowledge discovery and drive significant innovations in a wide range of research fields. Given the rapid data production, biological databases continue to increase in size and importance. To build a catalog of worldwide biological databases, therefore, we curate a total of 5825 biological databases from 8931 publications, which are geographically distributed in 72 countries/regions and developed by 1975 institutions (as of September 20, 2022). We further devise a z-index, a novel index to characterize the scientific impact of a database, and rank all these biological databases as well as their hosting institutions and countries in terms of citation and z-index. Consequently, we present a series of statistics and trends of worldwide biological databases, yielding a global perspective to better understand their status and impact for life and health sciences. An up-to-date catalog of worldwide biological databases as well as their curated meta-information and derived statistics is publicly available at Database Commons (https://ngdc.cncb.ac.cn/databasecommons/).

2.
Curr Protoc Bioinformatics ; 67(1): e82, 2019 09.
Article in English | MEDLINE | ID: mdl-31524988

ABSTRACT

In recent years, the number of human long noncoding RNAs (lncRNAs) that have been identified has increased exponentially. However, these lncRNAs are poorly annotated compared to protein-coding genes, posing great challenges for a better understanding of their functional significance and elucidating their complex functioning molecular mechanisms. Here we employ both community and expert curation to yield a comprehensive collection of human lncRNAs and their annotations. Specifically, LncRNAWiki (http://lncrna.big.ac.cn/index.php/Main_Page) uses a wiki-based community curation model, thus showing great promise in dealing with the flood of biological knowledge, while LncBook (http://bigd.big.ac.cn/lncbook) is an expert curation-based database that provides a complement to LncRNAWiki. LncBook features a comprehensive collection of human lncRNAs and a systematic curation of lncRNAs by multi-omics data integration, functional annotation, and disease association. These protocols provide step-by-step instructions on how to browse and search a specific lncRNA and how to obtain a range of related information including expression, methylation, variation, function, and disease association. © 2019 by John Wiley & Sons, Inc.


Subject(s)
Community-Based Participatory Research , Molecular Sequence Annotation/methods , Data Management , Databases, Nucleic Acid , Humans , RNA, Long Noncoding/genetics
3.
Bioinformatics ; 35(17): 2949-2956, 2019 09 01.
Article in English | MEDLINE | ID: mdl-30649200

ABSTRACT

MOTIVATION: The significance of long non-coding RNAs (lncRNAs) in many biological processes and diseases has gained intense interests over the past several years. However, computational identification of lncRNAs in a wide range of species remains challenging; it requires prior knowledge of well-established sequences and annotations or species-specific training data, but the reality is that only a limited number of species have high-quality sequences and annotations. RESULTS: Here we first characterize lncRNAs in contrast to protein-coding RNAs based on feature relationship and find that the feature relationship between open reading frame length and guanine-cytosine (GC) content presents universally substantial divergence in lncRNAs and protein-coding RNAs, as observed in a broad variety of species. Based on the feature relationship, accordingly, we further present LGC, a novel algorithm for identifying lncRNAs that is able to accurately distinguish lncRNAs from protein-coding RNAs in a cross-species manner without any prior knowledge. As validated on large-scale empirical datasets, comparative results show that LGC outperforms existing algorithms by achieving higher accuracy, well-balanced sensitivity and specificity, and is robustly effective (>90% accuracy) in discriminating lncRNAs from protein-coding RNAs across diverse species that range from plants to mammals. To our knowledge, this study, for the first time, differentially characterizes lncRNAs and protein-coding RNAs based on feature relationship, which is further applied in computational identification of lncRNAs. Taken together, our study represents a significant advance in characterization and identification of lncRNAs and LGC thus bears broad potential utility for computational analysis of lncRNAs in a wide range of species. AVAILABILITY AND IMPLEMENTATION: LGC web server is publicly available at http://bigd.big.ac.cn/lgc/calculator. The scripts and data can be downloaded at http://bigd.big.ac.cn/biocode/tools/BT000004. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
Algorithms , Open Reading Frames , RNA, Long Noncoding , Animals , Mammals , Plants , Proteins
4.
Dev Growth Differ ; 55(8): 699-709, 2013 Oct.
Article in English | MEDLINE | ID: mdl-24102645

ABSTRACT

The zinc finger transcription factor Gli3 is an important mediator of Sonic hedgehog (Shh) signaling. During early embryonic development Gli3 participates in patterning and growth of the central nervous system, face, skeleton, limb, tooth and gut. Precise regulation of the temporal and spatial expression of Gli3 is crucial for the proper specification of these structures in mammals and other vertebrates. Previously we reported a set of human intronic cis-regulators controlling almost the entire known repertoire of endogenous Gli3 expression in mouse neural tube and limbs. However, the genetic underpinning of GLI3 expression in other embryonic domains such as craniofacial structures and internal organs remain elusive. Here we demonstrate in a transgenic mice assay the potential of a subset of human/fish conserved non-coding sequences (CNEs) residing within GLI3 intronic intervals to induce reporter gene expression at known regions of endogenous Gli3 transcription in embryonic domains other than central nervous system (CNS) and limbs. Highly specific reporter expression was observed in craniofacial structures, eye, gut, and genitourinary system. Moreover, the comparison of expression patterns directed by these intronic cis-acting regulatory elements in mouse and zebrafish embryos suggests that in accordance with sequence conservation, the target site specificity of a subset of these elements remains preserved among these two lineages. Taken together with our recent investigations, it is proposed here that during vertebrate evolution the Gli3 expression control acquired multiple, independently acting, intronic enhancers for spatiotemporal patterning of CNS, limbs, craniofacial structures and internal organs.


Subject(s)
Gene Expression Regulation, Developmental/physiology , Introns/physiology , Kruppel-Like Transcription Factors/biosynthesis , Nerve Tissue Proteins/biosynthesis , Organogenesis/physiology , Skull/embryology , Transcription, Genetic/physiology , Animals , Humans , Kruppel-Like Transcription Factors/genetics , Mice , Mice, Transgenic , Nerve Tissue Proteins/genetics , Neural Tube/embryology , Organ Specificity/physiology , Zinc Finger Protein Gli3
5.
BMC Dev Biol ; 10: 44, 2010 Apr 28.
Article in English | MEDLINE | ID: mdl-20426846

ABSTRACT

BACKGROUND: The zinc-finger transcription factor GLI3 is an important mediator of Sonic hedgehog signaling and crucial for patterning of many aspects of the vertebrate body plan. In vertebrates, the mechanism of SHH signal transduction and its action on target genes by means of activating or repressing forms of GLI3 have been studied most extensively during limb development and the specification of the central nervous system. From these studies it has emerged, that Gli3 expression must be subject to a tight spatiotemporal regulation. However, the genetic mechanisms and the cis-acting elements controlling the expression of Gli3 remained largely unknown. RESULTS: Here, we demonstrate in chicken and mouse transgenic embryos that human GLI3-intronic conserved non-coding sequence elements (CNEs) autonomously control individual aspects of Gli3 expression. Their combined action shows many aspects of a Gli3-specific pattern of transcriptional activity. In the mouse limb bud, different CNEs enhance Gli3-specific expression in evolutionary ancient stylopod and zeugopod versus modern skeletal structures of the autopod. Limb bud specificity is also found in chicken but had not been detected in zebrafish embryos. Three of these elements govern central nervous system specific gene expression during mouse embryogenesis, each targeting a subset of endogenous Gli3 transcription sites. Even though fish, birds, and mammals share an ancient repertoire of gene regulatory elements within Gli3, the functions of individual enhancers from this catalog have diverged significantly. During evolution, ancient broad-range regulatory elements within Gli3 attained higher specificity, critical for patterning of more specialized structures, by abolishing the potential for redundant expression control. CONCLUSION: These results not only demonstrate the high level of complexity in the genetic mechanisms controlling Gli3 expression, but also reveal the evolutionary significance of cis-acting regulatory networks of early developmental regulators in vertebrates.


Subject(s)
Central Nervous System/embryology , Enhancer Elements, Genetic , Extremities/embryology , Gene Expression Regulation, Developmental , Kruppel-Like Transcription Factors/genetics , Nerve Tissue Proteins/genetics , Animals , Animals, Genetically Modified , Chick Embryo , Humans , Mice , Mice, Transgenic , Zinc Finger Protein Gli3
SELECTION OF CITATIONS
SEARCH DETAIL
...