Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 28
Filter
Add more filters











Publication year range
1.
G3 (Bethesda) ; 11(12)2021 12 08.
Article in English | MEDLINE | ID: mdl-34515796

ABSTRACT

Aegilops tauschii is the donor of the D subgenome of hexaploid wheat and an important genetic resource. The reference-quality genome sequence Aet v4.0 for Ae. tauschii acc. AL8/78 was therefore an important milestone for wheat biology and breeding. Further advances in sequencing acc. AL8/78 and release of the Aet v5.0 sequence assembly are reported here. Two new optical maps were constructed and used in the revision of pseudomolecules. Gaps were closed with Pacific Biosciences long-read contigs, decreasing the gap number by 38,899. Transposable elements and protein-coding genes were reannotated. The number of annotated high-confidence genes was reduced from 39,635 in Aet v4.0 to 32,885 in Aet v5.0. A total of 2245 biologically important genes, including those affecting plant phenology, grain quality, and tolerance of abiotic stresses in wheat, was manually annotated and disease-resistance genes were annotated by a dedicated pipeline. Disease-resistance genes encoding nucleotide-binding site domains, receptor-like protein kinases, and receptor-like proteins were preferentially located in distal chromosome regions, whereas those encoding transmembrane coiled-coil proteins were dispersed more evenly along the chromosomes. Discovery, annotation, and expression analyses of microRNA (miRNA) precursors, mature miRNAs, and phasiRNAs are reported, including miRNA target genes. Other small RNAs, such as hc-siRNAs and tRFs, were characterized. These advances enhance the utility of the Ae. tauschii genome sequence for wheat genetics, biotechnology, and breeding.


Subject(s)
Aegilops , Genome, Plant , Plant Breeding , Poaceae/genetics , Triticum/genetics
2.
Plant J ; 107(1): 303-314, 2021 07.
Article in English | MEDLINE | ID: mdl-33893684

ABSTRACT

Until recently, achieving a reference-quality genome sequence for bread wheat was long thought beyond the limits of genome sequencing and assembly technology, primarily due to the large genome size and > 80% repetitive sequence content. The release of the chromosome scale 14.5-Gb IWGSC RefSeq v1.0 genome sequence of bread wheat cv. Chinese Spring (CS) was, therefore, a milestone. Here, we used a direct label and stain (DLS) optical map of the CS genome together with a prior nick, label, repair and stain (NLRS) optical map, and sequence contigs assembled with Pacific Biosciences long reads, to refine the v1.0 assembly. Inconsistencies between the sequence and maps were reconciled and gaps were closed. Gap filling and anchoring of 279 unplaced scaffolds increased the total length of pseudomolecules by 168 Mb (excluding Ns). Positions and orientations were corrected for 233 and 354 scaffolds, respectively, representing 10% of the genome sequence. The accuracy of the remaining 90% of the assembly was validated. As a result of the increased contiguity, the numbers of transposable elements (TEs) and intact TEs have increased in IWGSC RefSeq v2.1 compared with v1.0. In total, 98% of the gene models identified in v1.0 were mapped onto this new assembly through development of a dedicated approach implemented in the MAGAAT pipeline. The numbers of high-confidence genes on pseudomolecules have increased from 105 319 to 105 534. The reconciled assembly enhances the utility of the sequence for genetic mapping, comparative genomics, gene annotation and isolation, and more general studies on the biology of wheat.


Subject(s)
Chromosome Mapping/methods , Genome, Plant , Triticum/genetics , Chromosomes, Artificial, Bacterial , Chromosomes, Plant/chemistry , DNA Transposable Elements , Molecular Sequence Annotation
3.
Plants (Basel) ; 9(10)2020 Oct 09.
Article in English | MEDLINE | ID: mdl-33050178

ABSTRACT

MicroRNAs (miRNAs) are important factors for the post-transcriptional regulation of protein-coding genes in plants and animals. They are discovered either by sequencing small RNAs or computationally. We employed a sequence-homology-based computational approach to identify conserved miRNAs and their target genes in Persian (English) walnut, Juglans regia, and its North American wild relative, J. microcarpa. A total of 119 miRNA precursors (pre-miRNAs) were detected in the J. regia genome and 121 in the J. microcarpa genome and miRNA target genes were predicted and their functional annotations were performed in both genomes. In the J. regia genome, 325 different genes were targets; 87.08% were regulated by transcript cleavage and 12.92% by translation repression. In the J. microcarpa genome, 316 different genes were targets; 88.92% were regulated by transcript cleavage and 11.08% were regulated by translation repression. Totals of 1.3% and 2.0% of all resistance gene analogues (RGA) and 2.7% and 2.6% of all transcription factors (TFs) were regulated by miRNAs in the J. regia and J. microcarpa genomes, respectively. Juglans genomes evolved by a whole genome duplication (WGD) and consist of eight pairs of fractionated homoeologous chromosomes. Within each pair, the chromosome that has more genes with greater average transcription also harbors more pre-miRNAs and more target genes than its homoeologue. While only minor differences were detected in pre-miRNAs between the J. regia and J. microcarpa genomes, about one-third of the pre-miRNA loci were not conserved between homoeologous chromosome within each genome. Pre-miRNA and their corresponding target genes showed a tendency to be collocated within a subgenome.

4.
New Phytol ; 228(3): 1011-1026, 2020 11.
Article in English | MEDLINE | ID: mdl-32569398

ABSTRACT

Powdery mildew poses severe threats to wheat production. The most sustainable way to control this disease is through planting resistant cultivars. We report the map-based cloning of the powdery mildew resistance allele Pm5e from a Chinese wheat landrace. We applied a two-step bulked segregant RNA sequencing (BSR-Seq) approach in developing tightly linked or co-segregating markers to Pm5e. The first BSR-Seq used phenotypically contrasting bulks of recombinant inbred lines (RILs) to identify Pm5e-linked markers. The second BSR-Seq utilized bulks of genetic recombinants screened from a fine-mapping population to precisely quantify the associated genomic variation in the mapping interval, and identified the Pm5e candidate genes. The function of Pm5e was validated by transgenic assay, loss-of-function mutants and haplotype association analysis. Pm5e encodes a nucleotide-binding domain leucine-rich-repeat-containing (NLR) protein. A rare nonsynonymous single nucleotide variant (SNV) within the C-terminal leucine rich repeat (LRR) domain is responsible for the gain of powdery mildew resistance function of Pm5e, an allele endemic to wheat landraces of Shaanxi province of China. Results from this study demonstrate the value of landraces in discovering useful genes for modern wheat breeding. The key SNV associated with powdery mildew resistance will be useful for marker-assisted selection of Pm5e in wheat breeding programs.


Subject(s)
Disease Resistance , Triticum , China , Disease Resistance/genetics , Genes, Plant , Nucleotides , Plant Breeding , Plant Diseases/genetics , Triticum/genetics
5.
Theor Appl Genet ; 133(9): 2545-2554, 2020 Sep.
Article in English | MEDLINE | ID: mdl-32494869

ABSTRACT

KEY MESSAGE: A locus for perennial growth was mapped on Lophopyrum elongatum chromosome arm 4ES and introgressed into the wheat genome. Evidence was obtained that in addition to chromosome 4E, other L. elongatum chromosomes control perennial growth. Monocarpy versus polycarpy is one of the fundamental developmental dichotomies in flowering plants. Advances in the understanding of the genetic basis of this dichotomy are important for basic biological reasons and practically for genetic manipulation of growth development in economically important plants. Nine wheat introgression lines (ILs) harboring germplasm of the Lophopyrum elongatum genome present in the octoploid amphiploid Triticum aestivum cv. Chinese Spring (subgenomes AABBDD) × L. elongatum (genomes EE) were selected from a population of ILs developed earlier. These ILs were employed here in genomic analyses of post-sexual cycle regrowth (PSCR), which is a component of polycarpy in caespitose L. elongatum. Analyses of disomic substitution (DS) lines confirmed that L. elongatum chromosome 4E confers PSCR on wheat. The gene was mapped into a short distal region of L. elongatum arm 4ES and was tentatively named Pscr1. ILs harboring recombined chromosomes with 4ES segments, including Pscr1, incorporated into the distal part of the 4DS chromosome arm were identified. Based on the location, Pscr1 is not orthologous with the rice rhizome-development gene Rhz2 located on rice chromosome Os3, which is homoeologous with chromosome 4E, but it may correspond to the Teosinte branched1 (TB1) gene, which is located in the introgressed region in the L. elongatum and Ae. tauschii genomes. A hexaploid IL harboring a large portion of the E-genome but devoid of chromosome 4E also expressed PSCR, which provided evidence that perennial growth is controlled by genes on other L. elongatum chromosomes in addition to 4E.


Subject(s)
Genes, Plant , Plant Breeding , Poaceae/growth & development , Triticum/genetics , Chromosome Mapping , Chromosomes, Plant , Genotype , Poaceae/genetics , Polymorphism, Single Nucleotide , Polyploidy
6.
Nat Commun ; 11(1): 680, 2020 02 03.
Article in English | MEDLINE | ID: mdl-32015344

ABSTRACT

Powdery mildew, caused by Blumeria graminis f. sp. tritici (Bgt), is one of the most destructive diseases that pose a great threat to wheat production. Wheat landraces represent a rich source of powdery mildew resistance. Here, we report the map-based cloning of powdery mildew resistance gene Pm24 from Chinese wheat landrace Hulutou. It encodes a tandem kinase protein (TKP) with putative kinase-pseudokinase domains, designated WHEAT TANDEM KINASE 3 (WTK3). The resistance function of Pm24 was validated by transgenic assay, independent mutants, and allelic association analyses. Haplotype analysis revealed that a rare 6-bp natural deletion of lysine-glycine codons, endemic to wheat landraces of Shaanxi Province, China, in the kinase I domain (Kin I) of WTK3 is critical for the resistance function. Transgenic assay of WTK3 chimeric variants revealed that only the specific two amino acid deletion, rather than any of the single or more amino acid deletions, in the Kin I of WTK3 is responsible for gaining the resistance function of WTK3 against the Bgt fungus.


Subject(s)
Disease Resistance/genetics , Gain of Function Mutation , Genes, Plant/genetics , Plant Diseases/microbiology , Triticum/genetics , Ascomycota/pathogenicity , China , Hydrogen Peroxide/metabolism , Mutagenesis , Plant Immunity/genetics , Plant Proteins/genetics , Plants, Genetically Modified , Protein Domains , Protein Kinases/genetics , Transformation, Genetic
7.
Theor Appl Genet ; 133(4): 1227-1241, 2020 Apr.
Article in English | MEDLINE | ID: mdl-31980837

ABSTRACT

KEY MESSAGE: We introgressed wheatgrass germplasm from the octoploid amphiploid Triticum aestivum× Lophopyrum elongatum into wheat by manipulating the wheat Ph1 gene and discovered and characterized 130 introgression lines harboring single or, in various combinations, complete and recombined L. elongatum chromosomes. Diploid wheatgrass Lophopyrum elongatum (genomes EE) possesses valuable traits for wheat genetics and breeding. We evaluated several strategies for introgression of this germplasm into wheat. To detect it, we developed and validated multiplexed sets of Sequenom MassARRAY single nucleotide polymorphism (SNP) markers, which differentiated disomic and monosomic L. elongatum chromosomes from wheat chromosomes. We identified 130 introgression lines (ILs), which harbored 108 complete and 89 recombined L. elongatum chromosomes. Of the latter, 59 chromosomes were recombined by one or more crossovers and 30 were involved in centromeric (Robertsonian) translocations or were telocentric. To identify wheat chromosomes substituted for or recombined with L. elongatum chromosomes, we genotyped the ILs with the wheat 90-K Infinium SNP array. We found that most of the wheat 90-K probes correctly detected their targets in the L. elongatum genome and showed that some wheat SNPs are ancient and had originated prior to the divergence of the wheat and L. elongatum lineages. Of the 130 ILs, 52% were homozygous for Ph1 deletion and thus are staged to be recombined further. We failed to detect in the L. elongatum genome the 4/5 reciprocal translocation that has been reported in Thinopyrum bessarabicum and several other Triticeae genomes.


Subject(s)
Crosses, Genetic , Genome, Plant , Inbreeding , Ploidies , Poaceae/genetics , Triticum/genetics , Bread , Chromosomes, Plant/genetics , Genetic Markers , Polymorphism, Single Nucleotide/genetics
8.
Hortic Res ; 6: 55, 2019.
Article in English | MEDLINE | ID: mdl-30937174

ABSTRACT

Members of the genus Juglans are monecious wind-pollinated trees in the family Juglandaceae with highly heterozygous genomes, which greatly complicates genome sequence assembly. The genomes of interspecific hybrids are usually comprised of haploid genomes of parental species. We exploited this attribute of interspecific hybrids to avoid heterozygosity and sequenced an interspecific hybrid Juglans microcarpa × J. regia using a novel combination of single-molecule sequencing and optical genome mapping technologies. The resulting assemblies of both genomes were remarkably complete including chromosome termini and centromere regions. Chromosome termini consisted of arrays of telomeric repeats about 8 kb long and heterochromatic subtelomeric regions about 10 kb long. The centromeres consisted of arrays of a centromere-specific Gypsy retrotransposon and most contained genes, many of them transcribed. Juglans genomes evolved by a whole-genome-duplication dating back to the Cretaceous-Paleogene boundary and consist of two subgenomes, which were fractionated by numerous short gene deletions evenly distributed along the length of the chromosomes. Fractionation was shown to be asymmetric with one subgenome exhibiting greater gene loss than the other. The asymmetry of the process is ongoing and mirrors an asymmetry in gene expression between the subgenomes. Given the importance of J. microcarpa × J. regia hybrids as potential walnut rootstocks, we catalogued disease resistance genes in the parental genomes and studied their chromosomal distribution. We also estimated the molecular clock rates for woody perennials and deployed them in estimating divergence times of Juglans genomes and those of other woody perennials.

9.
G3 (Bethesda) ; 9(3): 619-624, 2019 03 07.
Article in English | MEDLINE | ID: mdl-30622124

ABSTRACT

Wild emmer (Triticum turgidum ssp. dicoccoides) is the progenitor of all modern cultivated tetraploid wheat. Its genome is large (> 10 Gb) and contains over 80% repeated sequences. The successful whole-genome-shotgun assembly of the wild emmer (accession Zavitan) genome sequence (WEW_v1.0) was an important milestone for wheat genomics. In an effort to improve this assembly, an optical map of accession Zavitan was constructed using Bionano Direct Label and Stain (DLS) technology. The map spanned 10.4 Gb. This map and another map produced earlier by us with the Bionano's Nick Label Repair and Stain (NLRS) technology were used to improve the current wild emmer assembly. The WEW_v1.0 assembly consisted of 151,912 scaffolds. Of them, 3,102 could be confidently aligned on the optical maps. Forty-seven were chimeric. They were disjoined and new scaffolds were assembled with the aid of the optical maps. The total number of scaffolds was reduced from 151,912 to 149,252 and N50 increased from 6.96 Mb to 72.63 Mb. Of the 149,252 scaffolds, 485 scaffolds, which accounted for 97% of the total genome length, were aligned and oriented on genetic maps, and new WEW_v2.0 pseudomolecules were constructed. The new pseudomolecules included 333 scaffolds (68.51 Mb) which were originally unassigned, 226 scaffolds (554.84 Mb) were placed into new locations, and 332 scaffolds (394.83 Mb) were re-oriented. The improved wild emmer genome assembly is an important resource for understanding genomic modification that occurred by domestication.


Subject(s)
Genome, Plant , Triticum/genetics , Whole Genome Sequencing
10.
Theor Appl Genet ; 131(11): 2451-2462, 2018 Nov.
Article in English | MEDLINE | ID: mdl-30141064

ABSTRACT

KEY MESSAGE: Comparison of genome sequences of wild emmer wheat and Aegilops tauschii suggests a novel scenario of the evolution of rearranged wheat chromosomes 4A, 5A, and 7B. Past research suggested that wheat chromosome 4A was subjected to a reciprocal translocation T(4AL;5AL)1 that occurred in the diploid progenitor of the wheat A subgenome and to three major rearrangements that occurred in polyploid wheat: pericentric inversion Inv(4AS;4AL)1, paracentric inversion Inv(4AL;4AL)1, and reciprocal translocation T(4AL;7BS)1. Gene collinearity along the pseudomolecules of tetraploid wild emmer wheat (Triticum turgidum ssp. dicoccoides, subgenomes AABB) and diploid Aegilops tauschii (genomes DD) was employed to confirm these rearrangements and to analyze the breakpoints. The exchange of distal regions of chromosome arms 4AS and 4AL due to pericentric inversion Inv(4AS;4AL)1 was detected, and breakpoints were validated with an optical Bionano genome map. Both breakpoints contained satellite DNA. The breakpoints of reciprocal translocation T(4AL;7BS)1 were also found. However, the breakpoints that generated paracentric inversion Inv(4AL;4AL)1 appeared to be collocated with the 4AL breakpoints that had produced Inv(4AS;4AL)1 and T(4AL;7BS)1. Inv(4AS;4AL)1, Inv(4AL;4AL)1, and T(4AL;7BS)1 either originated sequentially, and Inv(4AL;4AL)1 was produced by recurrent chromosome breaks at the same breakpoints that generated Inv(4AS;4AL)1 and T(4AL;7BS)1, or Inv(4AS;4AL)1, Inv(4AL;4AL)1, and T(4AL;7BS)1 originated simultaneously. We prefer the latter hypothesis since it makes fewer assumptions about the sequence of events that produced these chromosome rearrangements.


Subject(s)
Chromosome Inversion , Chromosomes, Plant/genetics , Evolution, Molecular , Translocation, Genetic , Triticum/genetics , Chromosome Mapping , DNA, Satellite/genetics , Genome, Plant , Poaceae/genetics
11.
Genome ; 61(8): 559-565, 2018 Aug.
Article in English | MEDLINE | ID: mdl-29883550

ABSTRACT

Brachypodium distachyon (n = 5) is a diploid and has been widely used as a genetic model. Brachypodium stacei (n = 10) and B. hybridum (n = 15) are species that are related to B. distachyon, leading to an hypothesis that they are part of a polyploid series based on x = 5. Several lines of evidence suggest that this hypothesis is incorrect and that the genomes of the three taxa may have evolved by a more complex process. We constructed an optical whole-genome BioNano genome (BNG) map for each species and did pairwise alignment of the BNG maps. The maps showed that B. distachyon and B. stacei are both diploid, in spite of B. stacei having twice as many chromosomes as B. distachyon, and that B. hybridum is an allopolyploid formed from hybridization between B. distachyon and B. stacei. This study also demonstrated the use of BNG maps in the detection and quantification of structural variants among the genomes.


Subject(s)
Brachypodium/genetics , Evolution, Molecular , Genome/genetics , Phylogeny , Brachypodium/classification , Chromosomes, Plant/genetics , Diploidy , Polyploidy , Species Specificity
12.
Plant J ; 95(3): 487-503, 2018 08.
Article in English | MEDLINE | ID: mdl-29770515

ABSTRACT

Homology was searched with genes annotated in the Aegilops tauschii pseudomolecules against genes annotated in the pseudomolecules of tetraploid wild emmer wheat, Brachypodium distachyon, sorghum and rice. Similar searches were performed with genes annotated in the rice pseudomolecules. Matrices of collinear genes and rearrangements in their order were constructed. Optical BioNano genome maps were constructed and used to validate rearrangements unique to the wild emmer and Ae. tauschii genomes. Most common rearrangements were short paracentric inversions and short intrachromosomal translocations. Intrachromosomal translocations outnumbered segmental intrachromosomal duplications. The densities of paracentric inversion lengths were approximated by exponential distributions in all six genomes. Densities of collinear genes along the Ae. tauschii chromosomes were highly correlated with meiotic recombination rates but those of rearrangements were not, suggesting different causes of the erosion of gene collinearity and evolution of major chromosome rearrangements. Frequent rearrangements sharing breakpoints suggested that chromosomes have been rearranged recurrently at some sites. The distal 4 Mb of the short arms of rice chromosomes Os11 and Os12 and corresponding regions in the sorghum, B. distachyon and Triticeae genomes contain clusters of interstitial translocations including from 1 to 7 collinear genes. The rates of acquisition of major rearrangements were greater in the large wild emmer wheat and Ae. tauschii genomes than in the lineage preceding their divergence or in the B. distachyon, rice and sorghum lineages. It is suggested that synergy between large quantities of dynamic transposable elements and annual growth habit have been the primary causes of the fast evolution of the Triticeae genomes.


Subject(s)
Evolution, Molecular , Genome, Plant/genetics , Genomics , Poaceae/genetics , Aegilops/genetics , Brachypodium/genetics , Chromosome Mapping , Genes, Plant/genetics , Oryza/genetics , Sequence Analysis, DNA , Sorghum/genetics , Triticum/genetics
13.
Nature ; 551(7681): 498-502, 2017 11 23.
Article in English | MEDLINE | ID: mdl-29143815

ABSTRACT

Aegilops tauschii is the diploid progenitor of the D genome of hexaploid wheat (Triticum aestivum, genomes AABBDD) and an important genetic resource for wheat. The large size and highly repetitive nature of the Ae. tauschii genome has until now precluded the development of a reference-quality genome sequence. Here we use an array of advanced technologies, including ordered-clone genome sequencing, whole-genome shotgun sequencing, and BioNano optical genome mapping, to generate a reference-quality genome sequence for Ae. tauschii ssp. strangulata accession AL8/78, which is closely related to the wheat D genome. We show that compared to other sequenced plant genomes, including a much larger conifer genome, the Ae. tauschii genome contains unprecedented amounts of very similar repeated sequences. Our genome comparisons reveal that the Ae. tauschii genome has a greater number of dispersed duplicated genes than other sequenced genomes and its chromosomes have been structurally evolving an order of magnitude faster than those of other grass genomes. The decay of colinearity with other grass genomes correlates with recombination rates along chromosomes. We propose that the vast amounts of very similar repeated sequences cause frequent errors in recombination and lead to gene duplications and structural chromosome changes that drive fast genome evolution.


Subject(s)
Genome, Plant , Phylogeny , Poaceae/genetics , Triticum/genetics , Chromosome Mapping , Diploidy , Evolution, Molecular , Gene Duplication , Genes, Plant/genetics , Genomics/standards , Poaceae/classification , Recombination, Genetic/genetics , Sequence Analysis, DNA/standards , Triticum/classification
14.
Front Plant Sci ; 8: 2163, 2017.
Article in English | MEDLINE | ID: mdl-29326749

ABSTRACT

Meiotic pairing between homoeologous chromosomes in polyploid wheat is inhibited by the Ph1 locus on the long arm of chromosome 5 in the B genome. Aegilops speltoides (genomes SS), the closest relative of the progenitor of the wheat B genome, is polymorphic for genetic suppression of Ph1. Using this polymorphism, two major suppressor loci, Su1-Ph1 and Su2-Ph1, have been mapped in Ae. speltoides. Su1-Ph1 is located in the distal, high-recombination region of the long arm of the Ae. speltoides chromosome 3S. Its location and tight linkage to marker Xpsr1205-3S makes Su1-Ph1 a suitable target for introgression into wheat. Here, Xpsr1205-3S was introgressed into hexaploid bread wheat cv. Chinese Spring (CS) and from there into tetraploid durum wheat cv. Langdon (LDN). Sequential fluorescence in situ hybridization and genomic in situ hybridization showed that an Ae. speltoides segment with Xpsr1205-3S replaced the distal end of the long arm of chromosome 3A. In the CS genetic background, the chromosome induced homoeologous chromosome pairing in interspecific hybrids with Ae. peregrina but not in progenies from crosses involving alien disomic substitution lines. In the LDN genetic background, the chromosome induced homoeologous chromosome pairing in both interspecific hybrids and progenies from crosses involving alien disomic substitution lines. We conclude that the recombined chromosome harbors Su1-Ph1 but its expression requires expression of complementary gene that is present in LDN but absent in CS. We suggest that it is unlikely that Su1-Ph1 and ZIP4-1, a paralog of Ph1 located on wheat chromosomes 3A and 3B and Ae. tauschii chromosome 3D, are equivalent. The utility of Su1-Ph1 for induction of recombination between homoeologous chromosomes in wheat is illustrated.

15.
J Genet Genomics ; 44(1): 51-61, 2017 01 20.
Article in English | MEDLINE | ID: mdl-27765484

ABSTRACT

Bread wheat (Triticum aestivum, AABBDD) is an allohexaploid species derived from two rounds of interspecific hybridizations. A high-quality genome sequence assembly of diploid Aegilops tauschii, the donor of the wheat D genome, will provide a useful platform to study polyploid wheat evolution. A combined approach of BAC pooling and next-generation sequencing technology was employed to sequence the minimum tiling path (MTP) of 3176 BAC clones from the short arm of Ae. tauschii chromosome 3 (At3DS). The final assembly of 135 super-scaffolds with an N50 of 4.2 Mb was used to build a 247-Mb pseudomolecule with a total of 2222 predicted protein-coding genes. Compared with the orthologous regions of rice, Brachypodium, and sorghum, At3DS contains 38.67% more genes. In comparison to At3DS, the short arm sequence of wheat chromosome 3B (Ta3BS) is 95-Mb large in size, which is primarily due to the expansion of the non-centromeric region, suggesting that transposable element (TE) bursts in Ta3B likely occurred there. Also, the size increase is accompanied by a proportional increase in gene number in Ta3BS. We found that in the sequence of short arm of wheat chromosome 3D (Ta3DS), there was only less than 0.27% gene loss compared to At3DS. Our study reveals divergent evolution of grass genomes and provides new insights into sequence changes in the polyploid wheat genome.


Subject(s)
Chromosomes, Plant/genetics , Evolution, Molecular , Genome, Plant/genetics , Poaceae/genetics , Sequence Analysis , Triticum/genetics , INDEL Mutation , Polymorphism, Single Nucleotide , Synteny
16.
Methods Mol Biol ; 1429: 103-17, 2016.
Article in English | MEDLINE | ID: mdl-27511170

ABSTRACT

Application of optical mapping based on BioNano Genomics Irys(®) technology ( http://www.bionanogenomics.com/ ) is growing rapidly since its debut in November 2012. The technology can be used to facilitate genome sequence assembly and analysis of genome structural variations. We describe here the detailed protocol that we used to generate a whole genome BioNano map for Aegilops tauschii, the D genome progenitor of hexaploid wheat (Triticum aestivum). We are using the whole genome BioNano map to validate sequence assembly based on the next-generation sequencing, order sequence scaffolds, and ultimately build pseudomolecules for the genome.


Subject(s)
Chromosome Mapping/methods , Genome, Plant , Genomics/methods , High-Throughput Nucleotide Sequencing/methods , Nanotechnology/methods , Sequence Analysis, DNA/methods , Triticum/genetics , Optics and Photonics
17.
Proc Natl Acad Sci U S A ; 110(19): 7940-5, 2013 May 07.
Article in English | MEDLINE | ID: mdl-23610408

ABSTRACT

The current limitations in genome sequencing technology require the construction of physical maps for high-quality draft sequences of large plant genomes, such as that of Aegilops tauschii, the wheat D-genome progenitor. To construct a physical map of the Ae. tauschii genome, we fingerprinted 461,706 bacterial artificial chromosome clones, assembled contigs, designed a 10K Ae. tauschii Infinium SNP array, constructed a 7,185-marker genetic map, and anchored on the map contigs totaling 4.03 Gb. Using whole genome shotgun reads, we extended the SNP marker sequences and found 17,093 genes and gene fragments. We showed that collinearity of the Ae. tauschii genes with Brachypodium distachyon, rice, and sorghum decreased with phylogenetic distance and that structural genome evolution rates have been high across all investigated lineages in subfamily Pooideae, including that of Brachypodieae. We obtained additional information about the evolution of the seven Triticeae chromosomes from 12 ancestral chromosomes and uncovered a pattern of centromere inactivation accompanying nested chromosome insertions in grasses. We showed that the density of noncollinear genes along the Ae. tauschii chromosomes positively correlates with recombination rates, suggested a cause, and showed that new genes, exemplified by disease resistance genes, are preferentially located in high-recombination chromosome regions.


Subject(s)
Contig Mapping , Genome, Plant , Poaceae/genetics , Centromere/ultrastructure , Chromosomes, Artificial, Bacterial , Chromosomes, Plant/ultrastructure , Evolution, Molecular , Genes, Plant , Genetic Markers , Polymorphism, Single Nucleotide , Recombination, Genetic , Sequence Analysis, DNA , Triticum/genetics
18.
PLoS One ; 8(2): e55864, 2013.
Article in English | MEDLINE | ID: mdl-23405223

ABSTRACT

Next-generation sequencing (NGS) technologies have enabled high-throughput and low-cost generation of sequence data; however, de novo genome assembly remains a great challenge, particularly for large genomes. NGS short reads are often insufficient to create large contigs that span repeat sequences and to facilitate unambiguous assembly. Plant genomes are notorious for containing high quantities of repetitive elements, which combined with huge genome sizes, makes accurate assembly of these large and complex genomes intractable thus far. Using two-color genome mapping of tiling bacterial artificial chromosomes (BAC) clones on nanochannel arrays, we completed high-confidence assembly of a 2.1-Mb, highly repetitive region in the large and complex genome of Aegilops tauschii, the D-genome donor of hexaploid wheat (Triticum aestivum). Genome mapping is based on direct visualization of sequence motifs on single DNA molecules hundreds of kilobases in length. With the genome map as a scaffold, we anchored unplaced sequence contigs, validated the initial draft assembly, and resolved instances of misassembly, some involving contigs <2 kb long, to dramatically improve the assembly from 75% to 95% complete.


Subject(s)
Chromosome Mapping , Chromosomes, Plant/genetics , Genes, Plant/genetics , Genome, Plant/genetics , Nanotechnology/instrumentation , Triticum/genetics , Chromosomes, Artificial, Bacterial , High-Throughput Nucleotide Sequencing , Molecular Sequence Data , Sequence Analysis, DNA
19.
PLoS One ; 8(1): e54101, 2013.
Article in English | MEDLINE | ID: mdl-23326580

ABSTRACT

Wheat and maize genes were hypothesized to be clustered into islands but the hypothesis was not statistically tested. The hypothesis is statistically tested here in four grass species differing in genome size, Brachypodium distachyon, Oryza sativa, Sorghum bicolor, and Aegilops tauschii. Density functions obtained under a model where gene locations follow a homogeneous Poisson process and thus are not clustered are compared with a model-free situation quantified through a non-parametric density estimate. A simple homogeneous Poisson model for gene locations is not rejected for the small O. sativa and B. distachyon genomes, indicating that genes are distributed largely uniformly in those species, but is rejected for the larger S. bicolor and Ae. tauschii genomes, providing evidence for clustering of genes into islands. It is proposed to call the gene islands "gene insulae" to distinguish them from other types of gene clustering that have been proposed. An average S. bicolor and Ae. tauschii insula is estimated to contain 3.7 and 3.9 genes with an average intergenic distance within an insula of 2.1 and 16.5 kb, respectively. Inter-insular distances are greater than 8 and 81 kb and average 15.1 and 205 kb, in S. bicolor and Ae. tauschii, respectively. A greater gene density observed in the distal regions of the Ae. tauschii chromosomes is shown to be primarily caused by shortening of inter-insular distances. The comparison of the four grass genomes suggests that gene locations are largely a function of a homogeneous Poisson process in small genomes. Nonrandom insertions of LTR retroelements during genome expansion creates gene insulae, which become less dense and further apart with the increase in genome size. High concordance in relative lengths of orthologous intergenic distances among the investigated genomes including the maize genome suggests functional constraints on gene distribution in the grass genomes.


Subject(s)
Genome, Plant , Insulator Elements/genetics , Poaceae/genetics , Retroelements/genetics , Terminal Repeat Sequences/genetics , Brachypodium/genetics , Humans , Oryza/genetics , Sequence Analysis, DNA , Sorghum/genetics
20.
BMC Genomics ; 13: 354, 2012 Jul 31.
Article in English | MEDLINE | ID: mdl-22849334

ABSTRACT

BACKGROUND: A genome-wide set of single nucleotide polymorphisms (SNPs) is a valuable resource in genetic research and breeding and is usually developed by re-sequencing a genome. If a genome sequence is not available, an alternative strategy must be used. We previously reported the development of a pipeline (AGSNP) for genome-wide SNP discovery in coding sequences and other single-copy DNA without a complete genome sequence in self-pollinating (autogamous) plants. Here we updated this pipeline for SNP discovery in outcrossing (allogamous) species and demonstrated its efficacy in SNP discovery in walnut (Juglans regia L.). RESULTS: The first step in the original implementation of the AGSNP pipeline was the construction of a reference sequence and the identification of single-copy sequences in it. To identify single-copy sequences, multiple genome equivalents of short SOLiD reads of another individual were mapped to shallow genome coverage of long Sanger or Roche 454 reads making up the reference sequence. The relative depth of SOLiD reads was used to filter out repeated sequences from single-copy sequences in the reference sequence. The second step was a search for SNPs between SOLiD reads and the reference sequence. Polymorphism within the mapped SOLiD reads would have precluded SNP discovery; hence both individuals had to be homozygous. The AGSNP pipeline was updated here for using SOLiD or other type of short reads of a heterozygous individual for these two principal steps. A total of 32.6X walnut genome equivalents of SOLiD reads of vegetatively propagated walnut scion cultivar 'Chandler' were mapped to 48,661 'Chandler' bacterial artificial chromosome (BAC) end sequences (BESs) produced by Sanger sequencing during the construction of a walnut physical map. A total of 22,799 putative SNPs were initially identified. A total of 6,000 Infinium II type SNPs evenly distributed along the walnut physical map were selected for the construction of an Infinium BeadChip, which was used to genotype a walnut mapping population having 'Chandler' as one of the parents. Genotyping results were used to adjust the filtering parameters of the updated AGSNP pipeline. With the adjusted filtering criteria, 69.6% of SNPs discovered with the updated pipeline were real and could be mapped on the walnut genetic map. A total of 13,439 SNPs were discovered by BES re-sequencing. BESs harboring SNPs were in 677 FPC contigs covering 98% of the physical map of the walnut genome. CONCLUSION: The updated AGSNP pipeline is a versatile SNP discovery tool for a high-throughput, genome-wide SNP discovery in both autogamous and allogamous species. With this pipeline, a large set of SNPs were identified in a single walnut cultivar.


Subject(s)
Algorithms , Chromosome Mapping/methods , Genome, Plant , Genotyping Techniques , Juglans/genetics , Polymorphism, Single Nucleotide , Chromosomes, Artificial, Bacterial , Expressed Sequence Tags , Genome-Wide Association Study , Open Reading Frames , Pollination/physiology , Sequence Analysis, DNA
SELECTION OF CITATIONS
SEARCH DETAIL