ABSTRACT
The genome sequence of Leifsonia xyli subsp. xyli, which causes ratoon stunting disease and affects sugarcane worldwide, was determined. The single circular chromosome of Leifsonia xyli subsp. xyli CTCB07 was 2.6 Mb in length with a GC content of 68% and 2,044 predicted open reading frames. The analysis also revealed 307 predicted pseudogenes, which is more than any bacterial plant pathogen sequenced to date. Many of these pseudogenes, if functional, would likely be involved in the degradation of plant heteropolysaccharides, uptake of free sugars, and synthesis of amino acids. Although L. xyli subsp. xyli has only been identified colonizing the xylem vessels of sugarcane, the numbers of predicted regulatory genes and sugar transporters are similar to those in free-living organisms. Some of the predicted pathogenicity genes appear to have been acquired by lateral transfer and include genes for cellulase, pectinase, wilt-inducing protein, lysozyme, and desaturase. The presence of the latter may contribute to stunting, since it is likely involved in the synthesis of abscisic acid, a hormone that arrests growth. Our findings are consistent with the nutritionally fastidious behavior exhibited by L. xyli subsp. xyli and suggest an ongoing adaptation to the restricted ecological niche it inhabits.
Subject(s)
Actinomycetales/genetics , Genome, Bacterial , Actinomycetales/classification , Base Composition , Genes, Bacterial , Molecular Sequence Data , Pseudogenes , Saccharum/microbiologyABSTRACT
Over 40,000 sugarcane (Saccharum officinarum) consensus sequences assembled from 237,954 expressed sequence tags were compared with the protein and DNA sequences from other angiosperms, including the genomes of Arabidopsis and rice (Oryza sativa). Approximately two-thirds of the sugarcane transcriptome have similar sequences in Arabidopsis. These sequences may represent a core set of proteins or protein domains that are conserved among monocots and eudicots and probably encode for essential angiosperm functions. The remaining sequences represent putative monocot-specific genetic material, one-half of which were found only in sugarcane. These monocot-specific cDNAs represent either novelties or, in many cases, fast-evolving sequences that diverged substantially from their eudicot homologs. The wide comparative genome analysis presented here provides information on the evolutionary changes that underlie the divergence of monocots and eudicots. Our comparative analysis also led to the identification of several not yet annotated putative genes and possible gene loss events in Arabidopsis.
Subject(s)
Magnoliopsida/classification , Magnoliopsida/genetics , Saccharum/classification , Saccharum/genetics , Arabidopsis/classification , Arabidopsis/genetics , Chromosomes, Plant/genetics , Consensus Sequence , Evolution, Molecular , Expressed Sequence Tags , Genome, Plant , Oryza/classification , Oryza/genetics , Transcription, GeneticABSTRACT
To contribute to our understanding of the genome complexity of sugarcane, we undertook a large-scale expressed sequence tag (EST) program. More than 260,000 cDNA clones were partially sequenced from 26 standard cDNA libraries generated from different sugarcane tissues. After the processing of the sequences, 237,954 high-quality ESTs were identified. These ESTs were assembled into 43,141 putative transcripts. Of the assembled sequences, 35.6% presented no matches with existing sequences in public databases. A global analysis of the whole SUCEST data set indicated that 14,409 assembled sequences (33% of the total) contained at least one cDNA clone with a full-length insert. Annotation of the 43,141 assembled sequences associated almost 50% of the putative identified sugarcane genes with protein metabolism, cellular communication/signal transduction, bioenergetics, and stress responses. Inspection of the translated assembled sequences for conserved protein domains revealed 40,821 amino acid sequences with 1415 Pfam domains. Reassembling the consensus sequences of the 43,141 transcripts revealed a 22% redundancy in the first assembling. This indicated that possibly 33,620 unique genes had been identified and indicated that >90% of the sugarcane expressed genes were tagged.