Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 1 de 1
Filter
Add more filters










Database
Language
Publication year range
1.
Int J Mol Sci ; 16(1): 1466-81, 2015 Jan 08.
Article in English | MEDLINE | ID: mdl-25580537

ABSTRACT

The discovery of novel microRNA (miRNA) and piwi-interacting RNA (piRNA) is an important task for the understanding of many biological processes. Most of the available miRNA and piRNA identification methods are dependent on the availability of the organism's genome sequence and the quality of its annotation. Therefore, an efficient prediction method based solely on the short RNA reads and requiring no genomic information is highly desirable. In this study, we propose an approach that relies primarily on the nucleotide composition of the read and does not require reference genomes of related species for prediction. Using an empirical Bayesian kernel method and the error correcting output codes framework, compact models suitable for large-scale analyses are built on databases of known mature miRNAs and piRNAs. We found that the usage of an L1-based Gaussian kernel can double the true positive rate compared to the standard L2-based Gaussian kernel. Our approach can increase the true positive rate by at most 60% compared to the existing piRNA predictor based on the analysis of a hold-out test set. Using experimental data, we also show that our approach can detect about an order of magnitude or more known miRNAs than the mature miRNA predictor, miRPlex.


Subject(s)
MicroRNAs/metabolism , RNA, Small Interfering/metabolism , Animals , Caenorhabditis elegans/genetics , Databases, Genetic , Drosophila melanogaster/genetics , Genome , MicroRNAs/genetics , Normal Distribution , RNA, Small Interfering/genetics , ROC Curve , Support Vector Machine
SELECTION OF CITATIONS
SEARCH DETAIL
...