Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 9 de 9
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
Bioinformatics ; 39(10)2023 10 03.
Artigo em Inglês | MEDLINE | ID: mdl-37846049

RESUMO

SUMMARY: Pangenomes are replacing single reference genomes as the definitive representation of DNA sequence within a species or clade. Pangenome analysis predominantly leverages graph-based methods that require computationally intensive multiple genome alignments, do not scale to highly complex eukaryotic genomes, limit their scope to identifying structural variants (SVs), or incur bias by relying on a reference genome. Here, we present PanKmer, a toolkit designed for reference-free analysis of pangenome datasets consisting of dozens to thousands of individual genomes. PanKmer decomposes a set of input genomes into a table of observed k-mers and their presence-absence values in each genome. These are stored in an efficient k-mer index data format that encodes SNPs, INDELs, and SVs. It also includes functions for downstream analysis of the k-mer index, such as calculating sequence similarity statistics between individuals at whole-genome or local scales. For example, k-mers can be "anchored" in any individual genome to quantify sequence variability or conservation at a specific locus. This facilitates workflows with various biological applications, e.g. identifying cases of hybridization between plant species. PanKmer provides researchers with a valuable and convenient means to explore the full scope of genetic variation in a population, without reference bias. AVAILABILITY AND IMPLEMENTATION: PanKmer is implemented as a Python package with components written in Rust, released under a BSD license. The source code is available from the Python Package Index (PyPI) at https://pypi.org/project/pankmer/ as well as Gitlab at https://gitlab.com/salk-tm/pankmer. Full documentation is available at https://salk-tm.gitlab.io/pankmer/.


Assuntos
Genoma , Software , Humanos , Eucariotos , Documentação , Análise de Sequência de DNA/métodos
2.
New Phytol ; 239(1): 116-131, 2023 07.
Artigo em Inglês | MEDLINE | ID: mdl-37149888

RESUMO

Over 15 families of aquatic plants are known to use a strategy of developmental switching upon environmental stress to produce dormant propagules called turions. However, few molecular details for turion biology have been elucidated due to the difficulties in isolating high-quality nucleic acids from this tissue. We successfully developed a new protocol to isolate high-quality transcripts and carried out RNA-seq analysis of mature turions from the Greater Duckweed Spirodela polyrhiza. Comparison of turion transcriptomes to that of fronds, the actively growing leaf-like tissue, were carried out. Bioinformatic analysis of high confidence, differentially expressed transcripts between frond and mature turion tissues revealed major pathways related to stress tolerance, starch and lipid metabolism, and dormancy that are mobilized to reprogram frond meristems for turion differentiation. We identified the key genes that are likely to drive starch and lipid accumulation during turion formation, as well as those in pathways for starch and lipid utilization upon turion germination. Comparison of genome-wide cytosine methylation levels also revealed evidence for epigenetic changes in the formation of turion tissues. Similarities between turions and seeds provide evidence that key regulators for seed maturation and germination were retooled for their function in turion biology.


Assuntos
Araceae , Germinação , Germinação/genética , Araceae/genética , Genômica , Amido/metabolismo , Lipídeos , Dormência de Plantas/genética
3.
PLoS One ; 17(3): e0264966, 2022.
Artigo em Inglês | MEDLINE | ID: mdl-35255111

RESUMO

Cranberry (Vaccinium macrocarpon) is a member of the Heath family (Ericaceae) and is a temperate low-growing woody perennial native to North America that is both economically important and has significant health benefits. While some native varieties are still grown today, breeding programs over the past 50 years have made significant contributions to improving disease resistance, fruit quality and yield. An initial genome sequence of an inbred line of the wild selection 'Ben Lear,' which is parent to multiple breeding programs, provided insight into the gene repertoire as well as a platform for molecular breeding. Recent breeding efforts have focused on leveraging the circumboreal V. oxycoccos, which forms interspecific hybrids with V. macrocarpon, offering to bring in novel fruit chemistry and other desirable traits. Here we present an updated, chromosome-resolved V. macrocarpon reference genome, and compare it to a high-quality draft genome of V. oxycoccos. Leveraging the chromosome resolved cranberry reference genome, we confirmed that the Ericaceae has undergone two whole genome duplications that are shared with blueberry and rhododendron. Leveraging resequencing data for 'Ben Lear' inbred lines, as well as several wild and elite selections, we identified common regions that are targets of improvement. These same syntenic regions in V. oxycoccos, were identified and represent environmental response and plant architecture genes. These data provide insight into early genomic selection in the domestication of a native North American berry crop.


Assuntos
Ericaceae , Vaccinium macrocarpon , Domesticação , Ericaceae/genética , Frutas/genética , Genoma de Planta , Melhoramento Vegetal , Extratos Vegetais/análise , Vaccinium macrocarpon/química , Vaccinium macrocarpon/genética
4.
Plant Physiol ; 188(2): 879-897, 2022 02 04.
Artigo em Inglês | MEDLINE | ID: mdl-34893913

RESUMO

The ability to trace every cell in some model organisms has led to the fundamental understanding of development and cellular function. However, in plants the complexity of cell number, organ size, and developmental time makes this a challenge even in the diminutive model plant Arabidopsis (Arabidopsis thaliana). Duckweed, basal nongrass aquatic monocots, provide an opportunity to follow every cell of an entire plant due to their small size, reduced body plan, and fast clonal growth habit. Here we present a chromosome-resolved genome for the highly invasive Lesser Duckweed (Lemna minuta) and generate a preliminary cell atlas leveraging low cell coverage single-nuclei sequencing. We resolved the 360 megabase genome into 21 chromosomes, revealing a core nonredundant gene set with only the ancient tau whole-genome duplication shared with all monocots, and paralog expansion as a result of tandem duplications related to phytoremediation. Leveraging SMARTseq2 single-nuclei sequencing, which provided higher gene coverage yet lower cell count, we profiled 269 nuclei covering 36.9% (8,457) of the L. minuta transcriptome. Since molecular validation was not possible in this nonmodel plant, we leveraged gene orthology with model organism single-cell expression datasets, gene ontology, and cell trajectory analysis to define putative cell types. We found that the tissue that we computationally defined as mesophyll expressed high levels of elemental transport genes consistent with this tissue playing a role in L. minuta wastewater detoxification. The L. minuta genome and preliminary cell map provide a paradigm to decipher developmental genes and pathways for an entire plant.


Assuntos
Araceae/genética , Espécies Introduzidas , Dispersão Vegetal/genética , Transcriptoma , Genoma de Planta
5.
Science ; 374(6569): eabi7489, 2021 Nov 12.
Artigo em Inglês | MEDLINE | ID: mdl-34762468

RESUMO

Centromeres attach chromosomes to spindle microtubules during cell division and, despite this conserved role, show paradoxically rapid evolution and are typified by complex repeats. We used long-read sequencing to generate the Col-CEN Arabidopsis thaliana genome assembly that resolves all five centromeres. The centromeres consist of megabase-scale tandemly repeated satellite arrays, which support CENTROMERE SPECIFIC HISTONE H3 (CENH3) occupancy and are densely DNA methylated, with satellite variants private to each chromosome. CENH3 preferentially occupies satellites that show the least amount of divergence and occur in higher-order repeats. The centromeres are invaded by ATHILA retrotransposons, which disrupt genetic and epigenetic organization. Centromeric crossover recombination is suppressed, yet low levels of meiotic DNA double-strand breaks occur that are regulated by DNA methylation. We propose that Arabidopsis centromeres are evolving through cycles of satellite homogenization and retrotransposon-driven diversification.


Assuntos
Arabidopsis/genética , Centrômero/genética , Cromossomos de Plantas/genética , Epigênese Genética , Arabidopsis/ultraestrutura , Centrômero/química , Metilação de DNA , DNA Satélite , Evolução Molecular , Genoma de Planta , Histonas/análise , Meiose , Recombinação Genética , Retroelementos , Análise de Sequência de DNA
6.
Nat Commun ; 12(1): 6348, 2021 11 03.
Artigo em Inglês | MEDLINE | ID: mdl-34732722

RESUMO

To conserve water in arid environments, numerous plant lineages have independently evolved Crassulacean Acid Metabolism (CAM). Interestingly, Isoetes, an aquatic lycophyte, can also perform CAM as an adaptation to low CO2 availability underwater. However, little is known about the evolution of CAM in aquatic plants and the lack of genomic data has hindered comparison between aquatic and terrestrial CAM. Here, we investigate underwater CAM in Isoetes taiwanensis by generating a high-quality genome assembly and RNA-seq time course. Despite broad similarities between CAM in Isoetes and terrestrial angiosperms, we identify several key differences. Notably, Isoetes may have recruited the lesser-known 'bacterial-type' PEPC, along with the 'plant-type' exclusively used in other CAM and C4 plants for carboxylation of PEP. Furthermore, we find that circadian control of key CAM pathway genes has diverged considerably in Isoetes relative to flowering plants. This suggests the existence of more evolutionary paths to CAM than previously recognized.


Assuntos
Metabolismo Ácido das Crassuláceas/fisiologia , Fotossíntese/fisiologia , Traqueófitas/genética , Traqueófitas/metabolismo , Dióxido de Carbono/metabolismo , Metabolismo Ácido das Crassuláceas/genética , Evolução Molecular , Expressão Gênica , Genoma , Tamanho do Genoma , Lignina/biossíntese , Magnoliopsida , Plantas/metabolismo , Taiwan , Água , Sequenciamento Completo do Genoma
7.
Genome Res ; 31(2): 225-238, 2021 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-33361111

RESUMO

Rootless plants in the genus Wolffia are some of the fastest growing known plants on Earth. Wolffia have a reduced body plan, primarily multiplying through a budding type of asexual reproduction. Here, we generated draft reference genomes for Wolffia australiana (Benth.) Hartog & Plas, which has the smallest genome size in the genus at 357 Mb and has a reduced set of predicted protein-coding genes at about 15,000. Comparison between multiple high-quality draft genome sequences from W. australiana clones confirmed loss of several hundred genes that are highly conserved among flowering plants, including genes involved in root developmental and light signaling pathways. Wolffia has also lost most of the conserved nucleotide-binding leucine-rich repeat (NLR) genes that are known to be involved in innate immunity, as well as those involved in terpene biosynthesis, while having a significant overrepresentation of genes in the sphingolipid pathways that may signify an alternative defense system. Diurnal expression analysis revealed that only 13% of Wolffia genes are expressed in a time-of-day (TOD) fashion, which is less than the typical ∼40% found in several model plants under the same condition. In contrast to the model plants Arabidopsis and rice, many of the pathways associated with multicellular and developmental processes are not under TOD control in W. australiana, where genes that cycle the conditions tested predominantly have carbon processing and chloroplast-related functions. The Wolffia genome and TOD expression data set thus provide insight into the interplay between a streamlined plant body plan and optimized growth.

8.
New Phytol ; 227(6): 1709-1724, 2020 09.
Artigo em Inglês | MEDLINE | ID: mdl-32112414

RESUMO

Plants are continuously exposed to diurnal fluctuations in light and temperature, and spontaneous changes in their physical or biotic environment. The circadian clock coordinates regulation of gene expression with a 24 h period, enabling the anticipation of these events. We used RNA sequencing to characterize the Brachypodium distachyon transcriptome under light and temperature cycles, as well as under constant conditions. Approximately 3% of the transcriptome was regulated by the circadian clock, a smaller proportion than reported in most other species. For most transcripts that were rhythmic under all conditions, including many known clock genes, the period of gene expression lengthened from 24 to 27 h in the absence of external cues. To functionally characterize the cyclic transcriptome in B. distachyon, we used Gene Ontology enrichment analysis, and found several terms significantly associated with peak expression at particular times of the day. Furthermore, we identified sequence motifs enriched in the promoters of similarly phased genes, some potentially associated with transcription factors. When considering the overlap in rhythmic gene expression and specific pathway behavior, thermocycles was the prevailing cue that controlled diurnal gene regulation. Taken together, our characterization of the rhythmic B. distachyon transcriptome represents a foundational resource with implications in other grass species.


Assuntos
Brachypodium , Brachypodium/genética , Ritmo Circadiano/genética , Sinais (Psicologia) , Regulação da Expressão Gênica , Regulação da Expressão Gênica de Plantas , Temperatura
9.
Bioinformatics ; 32(22): 3380-3387, 2016 11 15.
Artigo em Inglês | MEDLINE | ID: mdl-27466620

RESUMO

MOTIVATION: Plasmids are stably maintained extra-chromosomal genetic elements that replicate independently from the host cell's chromosomes. Although plasmids harbor biomedically important genes, (such as genes involved in virulence and antibiotics resistance), there is a shortage of specialized software tools for extracting and assembling plasmid data from whole genome sequencing projects. RESULTS: We present the plasmidSPAdes algorithm and software tool for assembling plasmids from whole genome sequencing data and benchmark its performance on a diverse set of bacterial genomes. AVAILABILITY AND IMPLEMENTATION: plasmidSPAdes is publicly available at http://spades.bioinf.spbau.ru/plasmidSPAdes/ CONTACT: d.antipov@spbu.ruSupplementary information: Supplementary data are available at Bioinformatics online.


Assuntos
Genoma Bacteriano , Plasmídeos/genética , Algoritmos , Análise de Sequência de DNA , Software
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...