Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 5 de 5
Filter
Add more filters










Database
Language
Publication year range
1.
Curr Genomics ; 18(4): 322-331, 2017 Aug.
Article in English | MEDLINE | ID: mdl-29081688

ABSTRACT

BACKGROUNDS: With the advent of the post genomic era, the research for the genetic mechanism of the diseases has found to be increasingly depended on the studies of the genes, the gene-networks and gene-protein interaction networks. To explore gene expression and regulation, the researchers have carried out many studies on transcription factors and their binding sites (TFBSs). Based on the large amount of transcription factor binding sites predicting values in the deep learning models, further computation and analysis have been done to reveal the relationship between the gene mutation and the occurrence of the disease. It has been demonstrated that based on the deep learning methods, the performances of the prediction for the functions of the noncoding variants are outperforming than those of the conventional methods. The research on the prediction for functions of Single Nucleotide Polymorphisms (SNPs) is expected to uncover the mechanism of the gene mutation affection on traits and diseases of human beings. RESULTS: We reviewed the conventional TFBSs identification methods from different perspectives. As for the deep learning methods to predict the TFBSs, we discussed the related problems, such as the raw data preprocessing, the structure design of the deep convolution neural network (CNN) and the model performance measure et al. And then we summarized the techniques that usually used in finding out the functional noncoding variants from de novo sequence. CONCLUSION: Along with the rapid development of the high-throughout assays, more and more sample data and chromatin features would be conducive to improve the prediction accuracy of the deep convolution neural network for TFBSs identification. Meanwhile, getting more insights into the deep CNN framework itself has been proved useful for both the promotion on model performance and the development for more suitable design to sample data. Based on the feature values predicted by the deep CNN model, the prioritization model for functional noncoding variants would contribute to reveal the affection of gene mutation on the diseases.

2.
J Theor Biol ; 395: 23-30, 2016 Apr 21.
Article in English | MEDLINE | ID: mdl-26829314

ABSTRACT

This study presents an alternative alignment-free relative feature analysis method based on the minimal absent word, which has potential advantages over the local alignment method in local analysis. Smooth-local-analysis-curve and similarity-distribution are constructed for a fast, efficient, and visual comparison. Moreover, when the multi-sequence-comparison is needed, the local-analysis-curves can illustrate some interesting zones.


Subject(s)
Genome, Human , Models, Genetic , Sequence Analysis, DNA , Animals , Humans
3.
J Theor Biol ; 380: 473-9, 2015 Sep 07.
Article in English | MEDLINE | ID: mdl-26100179

ABSTRACT

The function of an RNA-molecule is mainly determined by its tertiary structures. And its secondary structure is an important determinant of its tertiary structure. The comparative methods usually give better results than the single-sequence methods. Based on minimum and suboptimal free energy structures, the paper presents a novel method for predicting conserved secondary structure of a group of related RNAs. In the method, the information from the known RNA structures is used as training data in a SVM (Support Vector Machine) classifier. Our method has been tested on the benchmark dataset given by Puton et al. The results show that the average sensitivity of our method is higher than that of other comparative methods such as CentroidAlifold, MXScrana, RNAalifold, and TurboFold.


Subject(s)
Models, Chemical , Nucleic Acid Conformation , RNA/chemistry
4.
Genet Mol Res ; 8(3): 848-60, 2009 Jul 21.
Article in English | MEDLINE | ID: mdl-19731207

ABSTRACT

At present, studies on microRNA mainly focus on the identification of microRNA genes and their mRNA targets. Although researchers have identified many microRNA genes, relatively few microRNA targets have been identified by experimental methods. Computational programs designed for predicting potential microRNA targets provide numerous targets for experimental validation. We used a Markov model to examine base-pairing binding patterns of known microRNA targets. Using this model, potential microRNA targets in human species predicted by four well-known computational programs were assessed. Each potential target was assigned a score reflecting consistency with known target binding patterns. Targets with scores higher than the cutoff value would be identified by our model. The predicted targets identified by our model have base-pairing binding patterns consistent with known targets. This model was efficient for evaluating the extent to which a potential target was accurately predicted.


Subject(s)
Markov Chains , MicroRNAs/genetics , Models, Genetic , Base Sequence , Computational Biology , Databases, Nucleic Acid , Humans , Molecular Sequence Data
5.
J Biomol Struct Dyn ; 26(6): 763-9, 2009 Jun.
Article in English | MEDLINE | ID: mdl-19385704

ABSTRACT

Protein sequences are treated as stochastic processes on the basis of a reduced amino acid alphabet of 10 types of amino acids. The realization of a stochastic process is described by associated transition probability matrix that corresponds to the process uniquely. Then new distances between transition probability matrices are defined for sequences similarity analysis. Two separate datasets are prepared and tested to identify the validity of the method. The results demonstrate the new method is powerful and efficient.


Subject(s)
Algorithms , Amino Acids/analysis , Proteins/analysis , Sequence Alignment/methods , Amino Acid Sequence , Amino Acids/genetics , Animals , Humans , Membrane Glycoproteins/analysis , Membrane Glycoproteins/classification , Membrane Glycoproteins/genetics , Molecular Sequence Data , Phylogeny , Proteins/classification , Proteins/genetics , Reproducibility of Results , Spike Glycoprotein, Coronavirus , Transferrin/analysis , Transferrin/classification , Transferrin/genetics , Viral Envelope Proteins/analysis , Viral Envelope Proteins/classification , Viral Envelope Proteins/genetics
SELECTION OF CITATIONS
SEARCH DETAIL
...