RESUMEN
BACKGROUND: Sugarcane cultivars are polyploid interspecific hybrids of giant genomes, typically with 10-13 sets of chromosomes from 2 Saccharum species. The ploidy, hybridity, and size of the genome, estimated to have >10 Gb, pose a challenge for sequencing. RESULTS: Here we present a gene space assembly of SP80-3280, including 373,869 putative genes and their potential regulatory regions. The alignment of single-copy genes in diploid grasses to the putative genes indicates that we could resolve 2-6 (up to 15) putative homo(eo)logs that are 99.1% identical within their coding sequences. Dissimilarities increase in their regulatory regions, and gene promoter analysis shows differences in regulatory elements within gene families that are expressed in a species-specific manner. We exemplify these differences for sucrose synthase (SuSy) and phenylalanine ammonia-lyase (PAL), 2 gene families central to carbon partitioning. SP80-3280 has particular regulatory elements involved in sucrose synthesis not found in the ancestor Saccharum spontaneum. PAL regulatory elements are found in co-expressed genes related to fiber synthesis within gene networks defined during plant growth and maturation. Comparison with sorghum reveals predominantly bi-allelic variations in sugarcane, consistent with the formation of 2 "subgenomes" after their divergence â¼3.8-4.6 million years ago and reveals single-nucleotide variants that may underlie their differences. CONCLUSIONS: This assembly represents a large step towards a whole-genome assembly of a commercial sugarcane cultivar. It includes a rich diversity of genes and homo(eo)logous resolution for a representative fraction of the gene space, relevant to improve biomass and food production.
Asunto(s)
Mapeo Contig/métodos , Glucosiltransferasas/genética , Fenilanina Amoníaco-Liasa/genética , Saccharum/crecimiento & desarrollo , Biomasa , Productos Agrícolas/genética , Productos Agrícolas/crecimiento & desarrollo , Variación Genética , Tamaño del Genoma , Genoma de Planta , Familia de Multigenes , Proteínas de Plantas/genética , Poliploidía , Regiones Promotoras Genéticas , Saccharum/genéticaRESUMEN
Background: Sugarcane cultivars are polyploid interspecific hybrids of giant genomes, typically with 10–13 sets of chromosomes from 2 Saccharum species. The ploidy, hybridity, and size of the genome, estimated to have >10 Gb, pose a challenge for sequencing. Results: Here we present a gene space assembly of SP80-3280, including 373,869 putative genes and their potential regulatory regions. The alignment of single-copy genes in diploid grasses to the putative genes indicates that we could resolve 2–6 (up to 15) putative homo(eo)logs that are 99.1% identical within their coding sequences. Dissimilarities increase in their regulatory regions, and gene promoter analysis shows differences in regulatory elements within gene families that are expressed in a species-specific manner. We exemplify these differences for sucrose synthase (SuSy) and phenylalanine ammonia-lyase (PAL), 2 gene families central to carbon partitioning. SP80-3280 has particular regulatory elements involved in sucrose synthesis not found in the ancestor Saccharum spontaneum. PAL regulatory elements are found in co-expressed genes related to fiber synthesis within gene networks defined during plant growth and maturation. Comparison with sorghum reveals predominantly bi-allelic variations in sugarcane, consistent with the formation of 2 "subgenomes" after their divergence ~3.8–4.6 million years ago and reveals single-nucleotide variants that may underlie their differences. Conclusions: This assembly represents a large step towards a whole-genome assembly of a commercial sugarcane cultivar. It includes a rich diversity of genes and homo(eo)logous resolution for a representative fraction of the gene space, relevant to improve biomass and food production.
RESUMEN
Johnsongrass (Sorghum halepense) is a striking example of a post-Columbian founder event. This natural experiment within ecological time-scales provides a unique opportunity for understanding patterns of continent-wide genetic diversity following range expansion. Microsatellite markers were used for population genetic analyses including leaf-optimized Neighbor-Joining tree, pairwise FST, mismatch analysis, principle coordinate analysis, Tajima's D, Fu's F and Bayesian clusterings of population structure. Evidence indicates two geographically distant introductions of divergent genotypes, which spread across much of the US in <200 years. Based on geophylogeny, gene flow patterns can be inferred to have involved five phases. Centers of genetic diversity have shifted from two introduction sites separated by ~2000 miles toward the middle of the range, consistent with admixture between genotypes from the respective introductions. Genotyping provides evidence for a 'habitat switch' from agricultural to non-agricultural systems and may contribute to both Johnsongrass ubiquity and aggressiveness. Despite lower and more structured diversity at the invasion front, Johnsongrass continues to advance northward into cooler and drier habitats. Association genetic approaches may permit identification of alleles contributing to the habitat switch or other traits important to weed/invasive management and/or crop improvement.
Asunto(s)
Ecosistema , Variación Genética , Sorghum/genética , Teorema de Bayes , Colombia , Genotipo , Especies Introducidas , Desequilibrio de Ligamiento , Repeticiones de Microsatélite/genética , Análisis de Componente Principal , Sorghum/crecimiento & desarrollo , Estados UnidosRESUMEN
Among the seven tetraploid cotton species, little is known about transmission genetics and genome organization in Gossypium mustelinum, the species most distant from the source of most cultivated cotton, G. hirsutum In this research, an F2 population was developed from an interspecific cross between G. hirsutum and G. mustelinum (HM). A genetic linkage map was constructed mainly using simple sequence repeat (SSRs) and restriction fragment length polymorphism (RFLP) DNA markers. The arrangements of most genetic loci along the HM chromosomes were identical to those of other tetraploid cotton species. However, both major and minor structural rearrangements were also observed, for which we propose a parsimony-based model for structural divergence of tetraploid cottons from common ancestors. Sequences of mapped markers were used for alignment with the 26 scaffolds of the G. hirsutum draft genome, and showed high consistency. Quantitative trait locus (QTL) mapping of fiber elongation in advanced backcross populations derived from the same parents demonstrated the value of the HM map. The HM map will serve as a valuable resource for QTL mapping and introgression of G. mustelinum alleles into G. hirsutum, and help clarify evolutionary relationships between the tetraploid cotton genomes.
Asunto(s)
Mapeo Cromosómico , Genoma de Planta , Genómica , Gossypium/genética , Sitios de Carácter Cuantitativo , Cromosomas de las Plantas , Cruzamientos Genéticos , Estudios de Asociación Genética , Ligamiento Genético , Genómica/métodos , Gossypium/clasificación , Filogenia , Mapeo Físico de Cromosoma , Carácter Cuantitativo Heredable , TetraploidíaRESUMEN
BACKGROUND AND AIMS: Peanut (Arachis hypogaea) is an allotetraploid (AABB-type genome) of recent origin, with a genome of about 2·8 Gb and a high repetitive content. This study reports an analysis of the repetitive component of the peanut A genome using bacterial artificial chromosome (BAC) clones from A. duranensis, the most probable A genome donor, and the probable consequences of the activity of these elements since the divergence of the peanut A and B genomes. METHODS: The repetitive content of the A genome was analysed by using A. duranensis BAC clones as probes for fluorescence in situ hybridization (BAC-FISH), and by sequencing and characterization of 12 genomic regions. For the analysis of the evolutionary dynamics, two A genome regions are compared with their B genome homeologues. KEY RESULTS: BAC-FISH using 27 A. duranensis BAC clones as probes gave dispersed and repetitive DNA characteristic signals, predominantly in interstitial regions of the peanut A chromosomes. The sequences of 14 BAC clones showed complete and truncated copies of ten abundant long terminal repeat (LTR) retrotransposons, characterized here. Almost all dateable transposition events occurred <3·5 million years ago, the estimated date of the divergence of A and B genomes. The most abundant retrotransposon is Feral, apparently parasitic on the retrotransposon FIDEL, followed by Pipa, also non-autonomous and probably parasitic on a retrotransposon we named Pipoka. The comparison of the A and B genome homeologous regions showed conserved segments of high sequence identity, punctuated by predominantly indel regions without significant similarity. CONCLUSIONS: A substantial proportion of the highly repetitive component of the peanut A genome appears to be accounted for by relatively few LTR retrotransposons and their truncated copies or solo LTRs. The most abundant of the retrotransposons are non-autonomous. The activity of these retrotransposons has been a very significant driver of genome evolution since the evolutionary divergence of the A and B genomes.
Asunto(s)
Arachis/genética , ADN Intergénico , Evolución Molecular , Genoma de Planta , Cromosomas Artificiales Bacterianos/genética , Hibridación Fluorescente in Situ , Filogenia , Secuencias Repetitivas de Ácidos Nucleicos , Retroelementos/fisiologíaRESUMEN
Cultivated peanut is an allotetraploid with an AB-genome. In order to learn more of the genomic structure of peanut, we characterized and studied the evolution of a retrotransposon originally isolated from a resistance gene analog (RGA)-containing bacterial artificial chromosome (BAC) clone. It is a moderate copy number Ty1-copia retrotransposon from the Bianca lineage and we named it Matita. Fluorescent in situ hybridization (FISH) experiments showed that Matita is mainly located on the distal regions of chromosome arms and is of approximately equal frequency on both A- and B-chromosomes. Its chromosome-specific hybridization pattern facilitates the identification of individual chromosomes, a useful cytogenetic tool considering that chromosomes in peanut are mostly metacentric and of similar size. Phylogenetic analysis of Matita elements, molecular dating of transposition events, and an estimation of the evolutionary divergence of the most probable A- and B-donor species suggest that Matita underwent its last major burst of transposition activity at around the same time of the A- and B-genome divergence about 3.5 million years ago. By probing BAC libraries with overgos probes for Matita, resistance gene analogues, and single- or low-copy genes, it was demonstrated that Matita is not randomly distributed in the genome but exhibits a significant tendency of being more abundant near resistance gene homologues than near single-copy genes. The described work is a further step towards broadening the knowledge on genomic and chromosomal structure of peanut and on its evolution.