RESUMO
The genus Passiflora comprises a large group of plants popularly known as passionfruit, much appreciated for their exotic flowers and edible fruits. The species (â¼500) are morphologically variable (e.g., growth habit, size, and color of flowers) and are adapted to distinct tropical ecosystems. In this study, we generated the genome of the wild diploid species Passiflora organensis Gardner by adopting a hybrid assembly approach. Passiflora organensis has a small genome of 259 Mbp and a heterozygosity rate of 81%, consistent with its reproductive system. Most of the genome sequences could be integrated into its chromosomes with cytogenomic markers (satellite DNA) as references. The repeated sequences accounted for 58.55% of the total DNA analyzed, and the Tekay lineage was the prevalent retrotransposon. In total, 25,327 coding genes were predicted. Passiflora organensis retains 5,609 singletons and 15,671 gene families. We focused on the genes potentially involved in the locus determining self-incompatibility and the MADS-box gene family, allowing us to infer expansions and contractions within specific subfamilies. Finally, we recovered the organellar DNA. Structural rearrangements and two mitoviruses, besides relics of other mobile elements, were found in the chloroplast and mt-DNA molecules, respectively. This study presents the first draft genome assembly of a wild Passiflora species, providing a valuable sequence resource for genomic and evolutionary studies on the genus, and support for breeding cropped passionfruit species.
Assuntos
Passiflora , Diploide , Ecossistema , Passiflora/genética , Melhoramento Vegetal , RetroelementosRESUMO
Chloroplast genomes (cpDNA) in angiosperms are usually highly conserved. Although rearrangements have been observed in some lineages, such as Passiflora, the mechanisms that lead to rearrangements are still poorly elucidated. In the present study, we obtained 20 new chloroplast genomes (18 species from the genus Passiflora, and Dilkea retusa and Mitostemma brevifilis from the family Passifloraceae) in order to investigate cpDNA evolutionary history in this group. Passiflora cpDNAs vary in size considerably, with â¼50 kb between shortest and longest. Large inverted repeat (IR) expansions were identified, and at the extreme opposite, the loss of an IR was detected for the first time in Passiflora, a rare event in angiosperms. The loss of an IR region was detected in Passiflora capsularis and Passiflora costaricensis, a species in which occasional biparental chloroplast inheritance has previously been reported. A repertory of rearrangements such as inversions and gene losses were detected, making Passiflora one of the few groups with complex chloroplast genome evolution. We also performed a phylogenomic study based on all the available cp genomes and our analysis implies that there is a need to reconsider the taxonomic classifications of some species in the group.
Assuntos
DNA de Cloroplastos/química , Rearranjo Gênico , Genoma de Cloroplastos , Passiflora/genética , Filogenia , Sequências Repetidas Invertidas , Passiflora/química , Passiflora/classificaçãoRESUMO
A significant proportion of plant genomes is consists of transposable elements (TEs), especially LTR retrotransposons (LTR-RTs) which are known to drive genome evolution. However, not much information is available on the structure and evolutionary role of TEs in the Passifloraceae family (Malpighiales order). Against this backdrop, we identified, characterized, and inferred the potential genomic impact of the TE repertoire found in the available genomic resources for Passiflora edulis, a tropical fruit species. A total of 250 different TE sequences were identified (96% Class I, and 4% Class II), corresponding to ~ 19% of the P. edulis draft genome. TEs were found preferentially in intergenic spaces (70.4%), but also overlapping genes (30.6%). LTR-RTs accounted for 181 single elements corresponding to ~ 13% of the draft genome. A phylogenetic inference of the reverse transcriptase domain of the LTR-RT revealed association of 37 elements with the Copia superfamily (Angela, Ale, Tork, and Sire) and 128 with the Gypsy (Del, Athila, Reina, CRM, and Galadriel) superfamily, and Del elements were the most frequent. Interestingly, according to insertion time analysis, the majority (95.9%) of the LTR-RTs were recently inserted into the P. edulis genome (< 2.0 Mya), and with the exception of the Athila lineage, all LTR-RTs are transcriptionally active. Moreover, functional analyses disclosed that the Angela, Del, CRM and Tork lineages are conserved in wild Passiflora species, supporting the idea of a common expansion of Copia and Gypsy superfamilies. Overall, this is the first study describing the P. edulis TE repertoire, and it also lends weight to the suggestion that LTR-RTs had a recent expansion into the analyzed gene-rich region of the P. edulis genome, possibly along WGD (Whole genome duplication) events, but are under negative selection due to their potential deleterious impact on gene regions.
Assuntos
Elementos de DNA Transponíveis , Evolução Molecular , Frutas/genética , Passiflora/genética , Retroelementos , Sequências Repetidas Terminais , Mutagênese Insercional , Passiflora/classificação , Filogenia , Transcrição GênicaRESUMO
Sugarcane (Saccharum spp.) is highly polyploid and aneuploid. Modern cultivars are derived from hybridization between S. officinarum and S. spontaneum. This combination results in a genome exhibiting variable ploidy among different loci, a huge genome size (~10 Gb) and a high content of repetitive regions. An approach using genomic, transcriptomic, and genetic mapping can improve our knowledge of the behavior of genetics in sugarcane. The hypothetical HP600 and Centromere Protein C (CENP-C) genes from sugarcane were used to elucidate the allelic expression and genomic and genetic behaviors of this complex polyploid. The physically linked side-by-side genes HP600 and CENP-C were found in two different homeologous chromosome groups with ploidies of eight and ten. The first region (Region01) was a Sorghum bicolor ortholog region with all haplotypes of HP600 and CENP-C expressed, but HP600 exhibited an unbalanced haplotype expression. The second region (Region02) was a scrambled sugarcane sequence formed from different noncollinear genes containing partial duplications of HP600 and CENP-C (paralogs). This duplication resulted in a non-expressed HP600 pseudogene and a recombined fusion version of CENP-C and the orthologous gene Sobic.003G299500 with at least two chimeric gene haplotypes expressed. It was also determined that it occurred before Saccharum genus formation and after the separation of sorghum and sugarcane. A linkage map was constructed using markers from nonduplicated Region01 and for the duplication (Region01 and Region02). We compare the physical and linkage maps, demonstrating the possibility of mapping markers located in duplicated regions with markers in nonduplicated region. Our results contribute directly to the improvement of linkage mapping in complex polyploids and improve the integration of physical and genetic data for sugarcane breeding programs. Thus, we describe the complexity involved in sugarcane genetics and genomics and allelic dynamics, which can be useful for understanding complex polyploid genomes.
RESUMO
Passiflora edulis is the most widely cultivated species of passionflowers, cropped mainly for industrialized juice production and fresh fruit consumption. Despite its commercial importance, little is known about the genome structure of P. edulis. To fill in this gap in our knowledge, a genomic library was built, and now completely sequenced over 100 large-inserts. Sequencing data were assembled from long sequence reads, and structural sequence annotation resulted in the prediction of about 1,900 genes, providing data for subsequent functional analysis. The richness of repetitive elements was also evaluated. Microsyntenic regions of P. edulis common to Populus trichocarpa and Manihot esculenta, two related Malpighiales species with available fully sequenced genomes were examined. Overall, gene order was well conserved, with some disruptions of collinearity identified as rearrangements, such as inversion and translocation events. The microsynteny level observed between the P. edulis sequences and the compared genomes is surprising, given the long divergence time that separates them from the common ancestor. P. edulis gene-rich segments are more compact than those of the other two species, even though its genome is much larger. This study provides a first accurate gene set for P. edulis, opening the way for new studies on the evolutionary issues in Malpighiales genomes.