Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 7 de 7
Filtrar
Más filtros










Base de datos
Intervalo de año de publicación
1.
Int J Mol Sci ; 23(7)2022 Mar 29.
Artículo en Inglés | MEDLINE | ID: mdl-35409113

RESUMEN

In the course of evolution, both flowering plants and some gymnosperms have developed such an adaptation to winter and unfavorable living conditions as deciduousness. Of particular interest is Siberian larch (Larix sibirica Ledeb.), which is the only species in the pine family (Pinaceae) with a seasonal deciduousness. New generation sequencing technologies make it possible to study this phenomenon at the genomic level and to reveal the genetic mechanisms of leaf and needle aging in angiosperms and gymnosperms. Using a comparative analysis of the genomes of evergreen and deciduous trees, it was found that the genes that control EXORDIUM LIKE 2 (EXL2) and DORMANCY-ASSOCIATED PROTEIN 1 (DRM1) proteins are most represented in Siberian larch, while an excess of genes that control proteins acting as immune receptors were found in evergreens. Orthologs from the family of genes that control leucine-rich repeat receptor-like kinases (LRR-RLK) contributed mostly to the distinction between evergreens and deciduous plants.


Asunto(s)
Larix , Magnoliopsida , Tracheophyta , Bosques , Genómica , Larix/genética , Estaciones del Año , Árboles/genética
2.
PLoS One ; 17(3): e0264640, 2022.
Artículo en Inglés | MEDLINE | ID: mdl-35259178

RESUMEN

The SARS-CoV-2 is the third coronavirus in addition to SARS-CoV and MERS-CoV that causes severe respiratory syndrome in humans. All of them likely crossed the interspecific barrier between animals and humans and are of zoonotic origin, respectively. The origin and evolution of viruses and their phylogenetic relationships are of great importance for study of their pathogenicity and development of antiviral drugs and vaccines. The main objective of the presented study was to compare two methods for identifying relationships between coronavirus genomes: phylogenetic one based on the whole genome alignment followed by molecular phylogenetic tree inference and alignment-free clustering of triplet frequencies, respectively, using 69 coronavirus genomes selected from two public databases. Both approaches resulted in well-resolved robust classifications. In general, the clusters identified by the first approach were in good agreement with the classes identified by the second using K-means and the elastic map method, but not always, which still needs to be explained. Both approaches demonstrated also a significant divergence of genomes on a taxonomic level, but there was less correspondence between genomes regarding the types of diseases they caused, which may be due to the individual characteristics of the host. This research showed that alignment-free methods are efficient in combination with alignment-based methods. They have a significant advantage in computational complexity and provide valuable additional alternative information on the genomes relationships.


Asunto(s)
Hibridación Genómica Comparativa/métodos , Coronavirus/genética , Genoma Viral , Mapeo Cromosómico , Análisis por Conglomerados , Coronavirus/clasificación , Humanos , Filogenia , SARS-CoV-2/clasificación , SARS-CoV-2/genética , Alineación de Secuencia
3.
BMC Genomics ; 21(1): 654, 2020 Sep 23.
Artículo en Inglés | MEDLINE | ID: mdl-32972367

RESUMEN

BACKGROUND: Plant mitochondrial genomes (mitogenomes) can be structurally complex while their size can vary from ~ 222 Kbp in Brassica napus to 11.3 Mbp in Silene conica. To date, in comparison with the number of plant species, only a few plant mitogenomes have been sequenced and released, particularly for conifers (the Pinaceae family). Conifers cover an ancient group of land plants that includes about 600 species, and which are of great ecological and economical value. Among them, Siberian larch (Larix sibirica Ledeb.) represents one of the keystone species in Siberian boreal forests. Yet, despite its importance for evolutionary and population studies, the mitogenome of Siberian larch has not yet been assembled and studied. RESULTS: Two sources of DNA sequences were used to search for mitochondrial DNA (mtDNA) sequences: mtDNA enriched samples and nucleotide reads generated in the de novo whole genome sequencing project, respectively. The assembly of the Siberian larch mitogenome contained nine contigs, with the shortest and the largest contigs being 24,767 bp and 4,008,762 bp, respectively. The total size of the genome was estimated at 11.7 Mbp. In total, 40 protein-coding, 34 tRNA, and 3 rRNA genes and numerous repetitive elements (REs) were annotated in this mitogenome. In total, 864 C-to-U RNA editing sites were found for 38 out of 40 protein-coding genes. The immense size of this genome, currently the largest reported, can be partly explained by variable numbers of mobile genetic elements, and introns, but unlikely by plasmid-related sequences. We found few plasmid-like insertions representing only 0.11% of the entire Siberian larch mitogenome. CONCLUSIONS: Our study showed that the size of the Siberian larch mitogenome is much larger than in other so far studied Gymnosperms, and in the same range as for the annual flowering plant Silene conica (11.3 Mbp). Similar to other species, the Siberian larch mitogenome contains relatively few genes, and despite its huge size, the repeated and low complexity regions cover only 14.46% of the mitogenome sequence.


Asunto(s)
Tamaño del Genoma , Genoma Mitocondrial , Larix/genética , Mapeo Contig , Anotación de Secuencia Molecular , Proteínas de Plantas/genética , ARN Ribosómico/genética , ARN de Transferencia/genética , Secuencias Repetitivas de Ácidos Nucleicos
4.
BMC Bioinformatics ; 21(Suppl 2): 83, 2020 Mar 11.
Artículo en Inglés | MEDLINE | ID: mdl-32164552

RESUMEN

BACKGROUND: Previously, a seven-cluster pattern claiming to be a universal one in bacterial genomes has been reported. Keeping in mind the most popular theory of chloroplast origin, we checked whether a similar pattern is observed in chloroplast genomes. RESULTS: Surprisingly, eight cluster structure has been found, for chloroplasts. The pattern observed for chloroplasts differs rather significantly, from bacterial one, and from that latter observed for cyanobacteria. The structure is provided by clustering of the fragments of equal length isolated within a genome so that each fragment is converted in triplet frequency dictionary with non-overlapping triplets with no gaps in frame tiling. The points in 63-dimensional space were clustered due to elastic map technique. The eight cluster found in chloroplasts comprises the fragments of a genome bearing tRNA genes and exhibiting excessively high GC-content, in comparison to the entire genome. CONCLUSION: Chloroplasts exhibit very specific symmetry type in distribution of coding and non-coding fragments of a genome in the space of triplet frequencies: this is mirror symmetry. Cyanobacteria may have both mirror symmetry, and the rotational symmetry typical for other bacteria.


Asunto(s)
Cloroplastos/genética , Genoma del Cloroplasto , Composición de Base , Análisis por Conglomerados , Cianobacterias/genética , ARN de Transferencia/genética
5.
Theory Biosci ; 127(2): 69-78, 2008 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-18443840

RESUMEN

Information capacity of nucleotide sequences measures the unexpectedness of a continuation of a given string of nucleotides, thus having a sound relation to a variety of biological issues. A continuation is defined in a way maximizing the entropy of the ensemble of such continuations. The capacity is defined as a mutual entropy of real frequency dictionary of a sequence with respect to the one bearing the most expected continuations; it does not depend on the length of strings contained in a dictionary. Various genomes exhibit a multi-minima pattern of the dependence of information capacity on the string length, thus reflecting an order within a sequence. The strings with significant deviation of an expected frequency from the real one are the words of increased information value. Such words exhibit a non-random distribution alongside a sequence, thus making it possible to retrieve the correlation between a structure, and a function encoded within a sequence.


Asunto(s)
Algoritmos , Mapeo Cromosómico/métodos , Biología Computacional/métodos , ADN/genética , Almacenamiento y Recuperación de la Información/métodos , Modelos Genéticos , Análisis de Secuencia de ADN/métodos , Secuencia de Bases , Simulación por Computador , Datos de Secuencia Molecular , Relación Estructura-Actividad
6.
Bull Math Biol ; 65(2): 309-22, 2003 Mar.
Artículo en Inglés | MEDLINE | ID: mdl-12675334

RESUMEN

A new method to compare two (or several) symbol sequences is developed. The method is based on the comparison of the frequencies of the small fragments of the compared sequences; it requires neither string editing, nor other transformations of the compared objects. The comparison is executed through a calculation of the specific entropy of a frequency dictionary against the special dictionary called the hybrid one; this latter is the statistical ancestor of the group of sequences under comparison. Some applications of the developed method in the fields of genetics and bioinformatics are discussed.


Asunto(s)
Nucleótidos/química , Análisis de Secuencia/métodos , Animales , Secuencia de Bases , Bases de Datos Factuales , Entropía , Alineación de Secuencia/métodos , Homología de Secuencia de Ácido Nucleico , Estadística como Asunto
7.
J Biol Phys ; 29(1): 23-38, 2003 Mar.
Artículo en Inglés | MEDLINE | ID: mdl-23345817

RESUMEN

The information capacity of nucleotide sequences is defined through the calculation of specific entropy of their frequency dictionary. The specificentropy of the frequency dictionary is calculated against the reconstructeddictionary; this latter bears the most probable continuations of the shorterstrings. This developed measure allows to distinguish the sequences both from the randons ones, and from those with high level of (rather simple) order. Some implications of the developed methodology in the fields of genetics,bioinformatics, and molecular biology are discussed.

SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...