Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 14 de 14
Filter
Add more filters










Publication year range
1.
Nat Plants ; 2024 May 30.
Article in English | MEDLINE | ID: mdl-38816498

ABSTRACT

Cotton (Gossypium hirsutum L.) is the key renewable fibre crop worldwide, yet its yield and fibre quality show high variability due to genotype-specific traits and complex interactions among cultivars, management practices and environmental factors. Modern breeding practices may limit future yield gains due to a narrow founding gene pool. Precision breeding and biotechnological approaches offer potential solutions, contingent on accurate cultivar-specific data. Here we address this need by generating high-quality reference genomes for three modern cotton cultivars ('UGA230', 'UA48' and 'CSX8308') and updating the 'TM-1' cotton genetic standard reference. Despite hypothesized genetic uniformity, considerable sequence and structural variation was observed among the four genomes, which overlap with ancient and ongoing genomic introgressions from 'Pima' cotton, gene regulatory mechanisms and phenotypic trait divergence. Differentially expressed genes across fibre development correlate with fibre production, potentially contributing to the distinctive fibre quality traits observed in modern cotton cultivars. These genomes and comparative analyses provide a valuable foundation for future genetic endeavours to enhance global cotton yield and sustainability.

2.
Proc Natl Acad Sci U S A ; 121(4): e2312607121, 2024 Jan 23.
Article in English | MEDLINE | ID: mdl-38236735

ABSTRACT

Homosporous lycophytes (Lycopodiaceae) are a deeply diverged lineage in the plant tree of life, having split from heterosporous lycophytes (Selaginella and Isoetes) ~400 Mya. Compared to the heterosporous lineage, Lycopodiaceae has markedly larger genome sizes and remains the last major plant clade for which no chromosome-level assembly has been available. Here, we present chromosomal genome assemblies for two homosporous lycophyte species, the allotetraploid Huperzia asiatica and the diploid Diphasiastrum complanatum. Remarkably, despite that the two species diverged ~350 Mya, around 30% of the genes are still in syntenic blocks. Furthermore, both genomes had undergone independent whole genome duplications, and the resulting intragenomic syntenies have likewise been preserved relatively well. Such slow genome evolution over deep time is in stark contrast to heterosporous lycophytes and is correlated with a decelerated rate of nucleotide substitution. Together, the genomes of H. asiatica and D. complanatum not only fill a crucial gap in the plant genomic landscape but also highlight a potentially meaningful genomic contrast between homosporous and heterosporous species.


Subject(s)
Genome, Plant , Genomics , Genome, Plant/genetics , Genome Size , Phylogeny , Evolution, Molecular
3.
Nat Commun ; 15(1): 579, 2024 Jan 17.
Article in English | MEDLINE | ID: mdl-38233380

ABSTRACT

Frogs are an ecologically diverse and phylogenetically ancient group of anuran amphibians that include important vertebrate cell and developmental model systems, notably the genus Xenopus. Here we report a high-quality reference genome sequence for the western clawed frog, Xenopus tropicalis, along with draft chromosome-scale sequences of three distantly related emerging model frog species, Eleutherodactylus coqui, Engystomops pustulosus, and Hymenochirus boettgeri. Frog chromosomes have remained remarkably stable since the Mesozoic Era, with limited Robertsonian (i.e., arm-preserving) translocations and end-to-end fusions found among the smaller chromosomes. Conservation of synteny includes conservation of centromere locations, marked by centromeric tandem repeats associated with Cenp-a binding surrounded by pericentromeric LINE/L1 elements. This work explores the structure of chromosomes across frogs, using a dense meiotic linkage map for X. tropicalis and chromatin conformation capture (Hi-C) data for all species. Abundant satellite repeats occupy the unusually long (~20 megabase) terminal regions of each chromosome that coincide with high rates of recombination. Both embryonic and differentiated cells show reproducible associations of centromeric chromatin and of telomeres, reflecting a Rabl-like configuration. Our comparative analyses reveal 13 conserved ancestral anuran chromosomes from which contemporary frog genomes were constructed.


Subject(s)
Chromatin , Evolution, Molecular , Animals , Chromatin/genetics , Genome/genetics , Anura/genetics , Xenopus/genetics , Centromere/genetics
4.
Plant J ; 116(4): 1003-1017, 2023 Nov.
Article in English | MEDLINE | ID: mdl-37675609

ABSTRACT

Populus species play a foundational role in diverse ecosystems and are important renewable feedstocks for bioenergy and bioproducts. Hybrid aspen Populus tremula × P. alba INRA 717-1B4 is a widely used transformation model in tree functional genomics and biotechnology research. As an outcrossing interspecific hybrid, its genome is riddled with sequence polymorphisms which present a challenge for sequence-sensitive analyses. Here we report a telomere-to-telomere genome for this hybrid aspen with two chromosome-scale, haplotype-resolved assemblies. We performed a comprehensive analysis of the repetitive landscape and identified both tandem repeat array-based and array-less centromeres. Unexpectedly, the most abundant satellite repeats in both haplotypes lie outside of the centromeres, consist of a 147 bp monomer PtaM147, frequently span >1 megabases, and form heterochromatic knobs. PtaM147 repeats are detected exclusively in aspens (section Populus) but PtaM147-like sequences occur in LTR-retrotransposons of closely related species, suggesting their origin from the retrotransposons. The genomic resource generated for this transformation model genotype has greatly improved the design and analysis of genome editing experiments that are highly sensitive to sequence polymorphisms. The work should motivate future hypothesis-driven research to probe into the function of the abundant and aspen-specific PtaM147 satellite DNA.


Subject(s)
DNA, Satellite , Populus , DNA, Satellite/genetics , Haplotypes/genetics , Populus/genetics , Ecosystem , Retroelements , Centromere/genetics
5.
Nucleic Acids Res ; 51(16): 8383-8401, 2023 09 08.
Article in English | MEDLINE | ID: mdl-37526283

ABSTRACT

Gene functional descriptions offer a crucial line of evidence for candidate genes underlying trait variation. Conversely, plant responses to environmental cues represent important resources to decipher gene function and subsequently provide molecular targets for plant improvement through gene editing. However, biological roles of large proportions of genes across the plant phylogeny are poorly annotated. Here we describe the Joint Genome Institute (JGI) Plant Gene Atlas, an updateable data resource consisting of transcript abundance assays spanning 18 diverse species. To integrate across these diverse genotypes, we analyzed expression profiles, built gene clusters that exhibited tissue/condition specific expression, and tested for transcriptional response to environmental queues. We discovered extensive phylogenetically constrained and condition-specific expression profiles for genes without any previously documented functional annotation. Such conserved expression patterns and tightly co-expressed gene clusters let us assign expression derived additional biological information to 64 495 genes with otherwise unknown functions. The ever-expanding Gene Atlas resource is available at JGI Plant Gene Atlas (https://plantgeneatlas.jgi.doe.gov) and Phytozome (https://phytozome.jgi.doe.gov/), providing bulk access to data and user-specified queries of gene sets. Combined, these web interfaces let users access differentially expressed genes, track orthologs across the Gene Atlas plants, graphically represent co-expressed genes, and visualize gene ontology and pathway enrichments.


Subject(s)
Genes, Plant , Transcriptome , Gene Expression Regulation, Plant , Genome, Plant , Phylogeny , Software , Transcriptome/genetics , Atlases as Topic
6.
Genome Res ; 32(10): 1952-1964, 2022 10.
Article in English | MEDLINE | ID: mdl-36109148

ABSTRACT

We assembled the 9.8-Gbp genome of western redcedar (WRC; Thuja plicata), an ecologically and economically important conifer species of the Cupressaceae. The genome assembly, derived from a uniquely inbred tree produced through five generations of self-fertilization (selfing), was determined to be 86% complete by BUSCO analysis, one of the most complete genome assemblies for a conifer. Population genomic analysis revealed WRC to be one of the most genetically depauperate wild plant species, with an effective population size of approximately 300 and no significant genetic differentiation across its geographic range. Nucleotide diversity, π, is low for a continuous tree species, with many loci showing zero diversity, and the ratio of π at zero- to fourfold degenerate sites is relatively high (approximately 0.33), suggestive of weak purifying selection. Using an array of genetic lines derived from up to five generations of selfing, we explored the relationship between genetic diversity and mating system. Although overall heterozygosity was found to decline faster than expected during selfing, heterozygosity persisted at many loci, and nearly 100 loci were found to deviate from expectations of genetic drift, suggestive of associative overdominance. Nonreference alleles at such loci often harbor deleterious mutations and are rare in natural populations, implying that balanced polymorphisms are maintained by linkage to dominant beneficial alleles. This may account for how WRC remains responsive to natural and artificial selection, despite low genetic diversity.


Subject(s)
Tracheophyta , Tracheophyta/genetics , Self-Fertilization/genetics , Alleles , Heterozygote , Polymorphism, Genetic , Genetic Variation , Selection, Genetic
7.
Nat Commun ; 12(1): 4125, 2021 07 05.
Article in English | MEDLINE | ID: mdl-34226565

ABSTRACT

Genome-enabled biotechnologies have the potential to accelerate breeding efforts in long-lived perennial crop species. Despite the transformative potential of molecular tools in pecan and other outcrossing tree species, highly heterozygous genomes, significant presence-absence gene content variation, and histories of interspecific hybridization have constrained breeding efforts. To overcome these challenges, here, we present diploid genome assemblies and annotations of four outbred pecan genotypes, including a PacBio HiFi chromosome-scale assembly of both haplotypes of the 'Pawnee' cultivar. Comparative analysis and pan-genome integration reveal substantial and likely adaptive interspecific genomic introgressions, including an over-retained haplotype introgressed from bitternut hickory into pecan breeding pedigrees. Further, by leveraging our pan-genome presence-absence and functional annotation database among genomes and within the two outbred haplotypes of the 'Lakota' genome, we identify candidate genes for pest and pathogen resistance. Combined, these analyses and resources highlight significant progress towards functional and quantitative genomics in highly diverse and outbred crops.


Subject(s)
Carya/genetics , Chromosomes , Genome, Plant , Genomics , Plant Breeding , Diploidy , Disease Resistance/genetics , Genetic Variation , Genotype , Haplotypes , Phenotype
8.
HGG Adv ; 2(2)2021 Apr 08.
Article in English | MEDLINE | ID: mdl-33937879

ABSTRACT

Exome and genome sequencing have proven to be effective tools for the diagnosis of neurodevelopmental disorders (NDDs), but large fractions of NDDs cannot be attributed to currently detectable genetic variation. This is likely, at least in part, a result of the fact that many genetic variants are difficult or impossible to detect through typical short-read sequencing approaches. Here, we describe a genomic analysis using Pacific Biosciences circular consensus sequencing (CCS) reads, which are both long (>10 kb) and accurate (>99% bp accuracy). We used CCS on six proband-parent trios with NDDs that were unexplained despite extensive testing, including genome sequencing with short reads. We identified variants and created de novo assemblies in each trio, with global metrics indicating these datasets are more accurate and comprehensive than those provided by short-read data. In one proband, we identified a likely pathogenic (LP), de novo L1-mediated insertion in CDKL5 that results in duplication of exon 3, leading to a frameshift. In a second proband, we identified multiple large de novo structural variants, including insertion-translocations affecting DGKB and MLLT3, which we show disrupt MLLT3 transcript levels. We consider this extensive structural variation likely pathogenic. The breadth and quality of variant detection, coupled to finding variants of clinical and research interest in two of six probands with unexplained NDDs, support the hypothesis that long-read genome sequencing can substantially improve rare disease genetic discovery rates.

9.
Plant Cell ; 33(6): 1888-1906, 2021 07 19.
Article in English | MEDLINE | ID: mdl-33710295

ABSTRACT

Sequence assembly of large and repeat-rich plant genomes has been challenging, requiring substantial computational resources and often several complementary sequence assembly and genome mapping approaches. The recent development of fast and accurate long-read sequencing by circular consensus sequencing (CCS) on the PacBio platform may greatly increase the scope of plant pan-genome projects. Here, we compare current long-read sequencing platforms regarding their ability to rapidly generate contiguous sequence assemblies in pan-genome studies of barley (Hordeum vulgare). Most long-read assemblies are clearly superior to the current barley reference sequence based on short-reads. Assemblies derived from accurate long reads excel in most metrics, but the CCS approach was the most cost-effective strategy for assembling tens of barley genomes. A downsampling analysis indicated that 20-fold CCS coverage can yield very good sequence assemblies, while even five-fold CCS data may capture the complete sequence of most genes. We present an updated reference genome assembly for barley with near-complete representation of the repeat-rich intergenic space. Long-read assembly can underpin the construction of accurate and complete sequences of multiple genomes of a species to build pan-genome infrastructures in Triticeae crops and their wild relatives.


Subject(s)
Genomics/methods , High-Throughput Nucleotide Sequencing/methods , Hordeum/genetics , Computational Biology/methods , DNA, Intergenic , Genome, Plant , Molecular Sequence Annotation , Retroelements , Sequence Analysis, DNA , Terminal Repeat Sequences
10.
Nature ; 588(7837): 284-289, 2020 12.
Article in English | MEDLINE | ID: mdl-33239781

ABSTRACT

Genetic diversity is key to crop improvement. Owing to pervasive genomic structural variation, a single reference genome assembly cannot capture the full complement of sequence diversity of a crop species (known as the 'pan-genome'1). Multiple high-quality sequence assemblies are an indispensable component of a pan-genome infrastructure. Barley (Hordeum vulgare L.) is an important cereal crop with a long history of cultivation that is adapted to a wide range of agro-climatic conditions2. Here we report the construction of chromosome-scale sequence assemblies for the genotypes of 20 varieties of barley-comprising landraces, cultivars and a wild barley-that were selected as representatives of global barley diversity. We catalogued genomic presence/absence variants and explored the use of structural variants for quantitative genetic analysis through whole-genome shotgun sequencing of 300 gene bank accessions. We discovered abundant large inversion polymorphisms and analysed in detail two inversions that are frequently found in current elite barley germplasm; one is probably the product of mutation breeding and the other is tightly linked to a locus that is involved in the expansion of geographical range. This first-generation barley pan-genome makes previously hidden genetic variation accessible to genetic studies and breeding.


Subject(s)
Chromosomes, Plant/genetics , Genome, Plant/genetics , Hordeum/genetics , Internationality , Mutation , Plant Breeding , Chromosome Inversion/genetics , Chromosome Mapping , Genetic Loci/genetics , Genotype , Hordeum/classification , Polymorphism, Genetic/genetics , Reference Standards , Seed Bank , Sequence Inversion , Whole Genome Sequencing
11.
Nat Genet ; 52(5): 525-533, 2020 05.
Article in English | MEDLINE | ID: mdl-32313247

ABSTRACT

Polyploidy is an evolutionary innovation for many animals and all flowering plants, but its impact on selection and domestication remains elusive. Here we analyze genome evolution and diversification for all five allopolyploid cotton species, including economically important Upland and Pima cottons. Although these polyploid genomes are conserved in gene content and synteny, they have diversified by subgenomic transposon exchanges that equilibrate genome size, evolutionary rate heterogeneities and positive selection between homoeologs within and among lineages. These differential evolutionary trajectories are accompanied by gene-family diversification and homoeolog expression divergence among polyploid lineages. Selection and domestication drive parallel gene expression similarities in fibers of two cultivated cottons, involving coexpression networks and N6-methyladenosine RNA modifications. Furthermore, polyploidy induces recombination suppression, which correlates with altered epigenetic landscapes and can be overcome by wild introgression. These genomic insights will empower efforts to manipulate genetic recombination and modify epigenetic landscapes and target genes for crop improvement.


Subject(s)
Genome, Plant/genetics , Gossypium/genetics , Cotton Fiber , Domestication , Epigenomics/methods , Evolution, Molecular , Gene Expression Regulation, Plant/genetics , Genomics/methods , Phylogeny , Polyploidy
12.
Plant J ; 100(5): 1066-1082, 2019 12.
Article in English | MEDLINE | ID: mdl-31433882

ABSTRACT

We report reference-quality genome assemblies and annotations for two accessions of soybean (Glycine max) and for one accession of Glycine soja, the closest wild relative of G. max. The G. max assemblies provided are for widely used US cultivars: the northern line Williams 82 (Wm82) and the southern line Lee. The Wm82 assembly improves the prior published assembly, and the Lee and G. soja assemblies are new for these accessions. Comparisons among the three accessions show generally high structural conservation, but nucleotide difference of 1.7 single-nucleotide polymorphisms (snps) per kb between Wm82 and Lee, and 4.7 snps per kb between these lines and G. soja. snp distributions and comparisons with genotypes of the Lee and Wm82 parents highlight patterns of introgression and haplotype structure. Comparisons against the US germplasm collection show placement of the sequenced accessions relative to global soybean diversity. Analysis of a pan-gene collection shows generally high conservation, with variation occurring primarily in genomically clustered gene families. We found approximately 40-42 inversions per chromosome between either Lee or Wm82v4 and G. soja, and approximately 32 inversions per chromosome between Wm82 and Lee. We also investigated five domestication loci. For each locus, we found two different alleles with functional differences between G. soja and the two domesticated accessions. The genome assemblies for multiple cultivated accessions and for the closest wild ancestor of soybean provides a valuable set of resources for identifying causal variants that underlie traits for the domestication and improvement of soybean, serving as a basis for future research and crop improvement efforts for this important crop species.


Subject(s)
Fabaceae/genetics , Genetic Variation , Genome, Plant , Alleles , Centromere/genetics , Disease Resistance/genetics , Genetics, Population , Genotype , Haplotypes , Hardness , Multigene Family , Phylogeny , Polymorphism, Single Nucleotide , Quantitative Trait Loci , Repetitive Sequences, Nucleic Acid , Seed Bank/classification , Sequence Inversion , Telomere/genetics
13.
Nat Commun ; 9(1): 5213, 2018 12 06.
Article in English | MEDLINE | ID: mdl-30523281

ABSTRACT

Environmental stress is a major driver of ecological community dynamics and agricultural productivity. This is especially true for soil water availability, because drought is the greatest abiotic inhibitor of worldwide crop yields. Here, we test the genetic basis of drought responses in the genetic model for C4 perennial grasses, Panicum hallii, through population genomics, field-scale gene-expression (eQTL) analysis, and comparison of two complete genomes. While gene expression networks are dominated by local cis-regulatory elements, we observe three genomic hotspots of unlinked trans-regulatory loci. These regulatory hubs are four times more drought responsive than the genome-wide average. Additionally, cis- and trans-regulatory networks are more likely to have opposing effects than expected under neutral evolution, supporting a strong influence of compensatory evolution and stabilizing selection. These results implicate trans-regulatory evolution as a driver of drought responses and demonstrate the potential for crop improvement in drought-prone regions through modification of gene regulatory networks.


Subject(s)
Droughts , Gene Expression Regulation, Plant , Genomics/methods , Panicum/genetics , Stress, Physiological , Gene Regulatory Networks , Genes, Plant/genetics , Genotype , Panicum/classification , Phylogeny , Quantitative Trait Loci/genetics , Species Specificity
14.
Genome Res ; 26(4): 510-8, 2016 Apr.
Article in English | MEDLINE | ID: mdl-26953271

ABSTRACT

Climatic adaptation is an example of a genotype-by-environment interaction (G×E) of fitness. Selection upon gene expression regulatory variation can contribute to adaptive phenotypic diversity; however, surprisingly few studies have examined how genome-wide patterns of gene expression G×E are manifested in response to environmental stress and other selective agents that cause climatic adaptation. Here, we characterize drought-responsive expression divergence between upland (drought-adapted) and lowland (mesic) ecotypes of the perennial C4 grass,Panicum hallii, in natural field conditions. Overall, we find that cis-regulatory elements contributed to gene expression divergence across 47% of genes, 7.2% of which exhibit drought-responsive G×E. While less well-represented, we observe 1294 genes (7.8%) with transeffects.Trans-by-environment interactions are weaker and much less common than cis G×E, occurring in only 0.7% oft rans-regulated genes. Finally, gene expression heterosis is highly enriched in expression phenotypes with significant G×E. As such, modes of inheritance that drive heterosis, such as dominance or overdominance, may be common among G×E genes. Interestingly, motifs specific to drought-responsive transcription factors are highly enriched in the promoters of genes exhibiting G×E and transregulation, indicating that expression G×E and heterosis may result from the evolution of transcription factors or their binding sites.P. hallii serves as the genomic model for its close relative and emerging biofuel crop, switchgrass (Panicum virgatum). Accordingly, the results here not only aid in the discovery of the genetic mechanisms that underlie local adaptation but also provide a foundation to improve switchgrass yield under water-limited conditions.


Subject(s)
Droughts , Ecotype , Gene Expression Regulation, Plant , Poaceae/genetics , Alleles , Climate , Gene-Environment Interaction , Genes, Plant , Genotype , Hybridization, Genetic
SELECTION OF CITATIONS
SEARCH DETAIL
...