Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 4 de 4
Filter
Add more filters










Database
Language
Publication year range
1.
Bioinformatics ; 39(1)2023 01 01.
Article in English | MEDLINE | ID: mdl-36571493

ABSTRACT

MOTIVATION: Recent experimental evidence has shown that some long non-coding RNAs (lncRNAs) contain small open reading frames (sORFs) that are translated into functional micropeptides, suggesting that these lncRNAs are misannotated as non-coding. Current methods to detect misannotated lncRNAs rely on ribosome-profiling (Ribo-Seq) and mass-spectrometry experiments, which are cell-type dependent and expensive. RESULTS: Here, we propose a computational method to identify possible misannotated lncRNAs from sequence information alone. Our approach first builds deep learning models to discriminate coding and non-coding transcripts and leverages these models' training dynamics to identify misannotated lncRNAs-i.e. lncRNAs with coding potential. The set of misannotated lncRNAs we identified significantly overlap with experimentally validated ones and closely resemble coding protein sequences as evidenced by significant BLAST hits. Our analysis on a subset of misannotated lncRNA candidates also shows that some ORFs they contain yield high confidence folded structures as predicted by AlphaFold2. This methodology offers promising potential for assisting experimental efforts in characterizing the hidden proteome encoded by misannotated lncRNAs and for curating better datasets for building coding potential predictors. AVAILABILITY AND IMPLEMENTATION: Source code is available at https://github.com/nabiafshan/DetectingMisannotatedLncRNAs. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
Deep Learning , RNA, Long Noncoding , RNA, Long Noncoding/genetics , Amino Acid Sequence , Proteome/genetics , Open Reading Frames , Micropeptides
2.
Genome ; 65(2): 57-74, 2022 Feb.
Article in English | MEDLINE | ID: mdl-34606733

ABSTRACT

Human Angiotensin I Converting Enzyme 2 (ACE2) plays an essential role in blood pressure regulation and SARS-CoV-2 entry. ACE2 has a highly conserved, one-to-one ortholog (ace2) in zebrafish, which is an important model for human diseases. However, the zebrafish ace2 expression profile has not yet been studied during early development, between genders, across different genotypes, or in disease. Moreover, a network-based meta-analysis for the extraction of functionally enriched pathways associated with differential ace2 expression is lacking in the literature. Herein, we first identified significant development-, tissue-, genotype-, and gender-specific modulations in ace2 expression via meta-analysis of zebrafish Affymetrix transcriptomics datasets (ndatasets = 107); and the correlation analysis of ace2 meta-differential expression profile revealed distinct positively and negatively correlated local functionally enriched gene networks. Moreover, we demonstrated that ace2 expression was significantly modulated under different physiological and pathological conditions related to development, tissue, gender, diet, infection, and inflammation using additional RNA-seq datasets. Our findings implicate a novel translational role for zebrafish ace2 in organ differentiation and pathologies observed in the intestines and liver.


Subject(s)
Angiotensin-Converting Enzyme 2/genetics , Zebrafish Proteins/genetics , Zebrafish , Animals , Female , Gene Expression Regulation, Developmental , Male , RNA-Seq , Zebrafish/genetics
3.
4.
BMC Bioinformatics ; 22(1): 294, 2021 Jun 02.
Article in English | MEDLINE | ID: mdl-34078267

ABSTRACT

BACKGROUND: While some non-coding RNAs (ncRNAs) are assigned critical regulatory roles, most remain functionally uncharacterized. This presents a challenge whenever an interesting set of ncRNAs needs to be analyzed in a functional context. Transcripts located close-by on the genome are often regulated together. This genomic proximity on the sequence can hint at a functional association. RESULTS: We present a tool, NoRCE, that performs cis enrichment analysis for a given set of ncRNAs. Enrichment is carried out using the functional annotations of the coding genes located proximal to the input ncRNAs. Other biologically relevant information such as topologically associating domain (TAD) boundaries, co-expression patterns, and miRNA target prediction information can be incorporated to conduct a richer enrichment analysis. To this end, NoRCE includes several relevant datasets as part of its data repository, including cell-line specific TAD boundaries, functional gene sets, and expression data for coding & ncRNAs specific to cancer. Additionally, the users can utilize custom data files in their investigation. Enrichment results can be retrieved in a tabular format or visualized in several different ways. NoRCE is currently available for the following species: human, mouse, rat, zebrafish, fruit fly, worm, and yeast. CONCLUSIONS: NoRCE is a platform-independent, user-friendly, comprehensive R package that can be used to gain insight into the functional importance of a list of ncRNAs of any type. The tool offers flexibility to conduct the users' preferred set of analyses by designing their own pipeline of analysis. NoRCE is available in Bioconductor and https://github.com/guldenolgun/NoRCE .


Subject(s)
MicroRNAs , Zebrafish , Animals , Genome , Mice , RNA, Untranslated/genetics , Rats , Zebrafish/genetics
SELECTION OF CITATIONS
SEARCH DETAIL
...