Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 2 de 2
Filter
Add more filters











Database
Language
Publication year range
1.
Stat Appl Genet Mol Biol ; 15(5): 381-400, 2016 10 01.
Article in English | MEDLINE | ID: mdl-27337743

ABSTRACT

The aim of this study was to show that amino acid sequences have a latent periodicity with insertions and deletions of amino acids in unknown positions of the analyzed sequence. Genetic algorithm, dynamic programming and random weight matrices were used to develop a new mathematical algorithm for latent periodicity search. A multiple alignment of periods was calculated with help of the direct optimization of the position-weight matrix without using pairwise alignments. The developed algorithm was applied to analyze amino acid sequences of a small number of proteins. This study showed the presence of latent periodicity with insertions and deletions in the amino acid sequences of such proteins, for which the presence of latent periodicity was not previously known. The origin of latent periodicity with insertions and deletions is discussed.


Subject(s)
Algorithms , Amino Acid Sequence , Computational Biology/methods , Models, Genetic , Models, Statistical , Mutagenesis, Insertional , Sequence Deletion
2.
Comput Biol Chem ; 51: 12-21, 2014 Aug.
Article in English | MEDLINE | ID: mdl-24840641

ABSTRACT

We describe a new mathematical method for finding very diverged short tandem repeats containing a single indel. The method involves comparison of two frequency matrices: a first matrix for a subsequence before shift and a second one for a subsequence after it. A measure of comparison is based on matrix similarity. The approach developed was applied to analysis of the genomes of Caenorhabditis elegans, Drosophila melanogaster and Saccharomyces cerevisiae. They were investigated regarding the presence of tandem repeats having repeat length equal to 2 - 11 nucleotides except equal to 3, 6 and 9 nucleotides. A number of phase shift regions for these genomes was approximately 2.2 × 10(4), 1.5 × 10(4) and 1.7 × 10(2), respectively. Type I error was less than 5%. The mean length of fuzzy periodicity and phase shift regions was about 220 nucleotides. The regions of fuzzy periodicity having single insertion or deletion occupy substantial parts of the genomes: 5%, 3% and 0.3%, respectively. Only less than 10% of these regions have been detected previously. That is, the number of such regions in the genomes of C. elegans, D. melanogaster and S. cerevisiae is dramatically higher than it has been revealed by any known methods. We suppose that some found regions of fuzzy periodicity could be the regions for protein binding.


Subject(s)
Caenorhabditis elegans/genetics , Drosophila melanogaster/genetics , Frameshift Mutation , Genome , Microsatellite Repeats , Saccharomyces cerevisiae/genetics , Animals , Base Sequence , INDEL Mutation , Molecular Sequence Data , Monte Carlo Method , Mutagenesis, Insertional
SELECTION OF CITATIONS
SEARCH DETAIL