Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 12 de 12
Filter
Add more filters










Publication year range
1.
Plant Methods ; 14: 65, 2018.
Article in English | MEDLINE | ID: mdl-30083222

ABSTRACT

BACKGROUND: Our knowledge of natural genetic variation is increasing at an extremely rapid pace, affording an opportunity to come to a much richer understanding of how effects of specific genes are dependent on the genetic background. To achieve a systematic understanding of such GxG interactions, it is desirable to develop genome editing tools that can be rapidly deployed across many different genetic varieties. RESULTS: We present an efficient CRISPR/Cas9 toolbox of super module (SM) vectors. These vectors are based on a previously described fluorescence protein marker expressed in seeds allowing identification of transgene-free mutants. We have used this vector series to delete genomic regions ranging from 1.7 to 13 kb in different natural accessions of the wild plant Arabidopsis thaliana. Based on results from 53 pairs of sgRNAs targeting individual nucleotide binding site leucine-rich repeat (NLR) genes, we provide a comprehensive overview of obtaining heritable deletions. CONCLUSIONS: The SM series of CRISPR/Cas9 vectors enables the rapid generation of transgene-free, genome edited plants for a diversity of functional studies.

2.
Nucleic Acids Res ; 46(D1): D1150-D1156, 2018 01 04.
Article in English | MEDLINE | ID: mdl-29059333

ABSTRACT

The abundance of high-quality genotype and phenotype data for the model organism Arabidopsis thaliana enables scientists to study the genetic architecture of many complex traits at an unprecedented level of detail using genome-wide association studies (GWAS). GWAS have been a great success in A. thaliana and many SNP-trait associations have been published. With the AraGWAS Catalog (https://aragwas.1001genomes.org) we provide a publicly available, manually curated and standardized GWAS catalog for all publicly available phenotypes from the central A. thaliana phenotype repository, AraPheno. All GWAS have been recomputed on the latest imputed genotype release of the 1001 Genomes Consortium using a standardized GWAS pipeline to ensure comparability between results. The catalog includes currently 167 phenotypes and more than 222 000 SNP-trait associations with P < 10-4, of which 3887 are significantly associated using permutation-based thresholds. The AraGWAS Catalog can be accessed via a modern web-interface and provides various features to easily access, download and visualize the results and summary statistics across GWAS.


Subject(s)
Arabidopsis/genetics , Databases, Genetic , Genome-Wide Association Study , Polymorphism, Single Nucleotide , User-Computer Interface
3.
Nucleic Acids Res ; 45(D1): D1054-D1059, 2017 01 04.
Article in English | MEDLINE | ID: mdl-27924043

ABSTRACT

Natural genetic variation makes it possible to discover evolutionary changes that have been maintained in a population because they are advantageous. To understand genotype-phenotype relationships and to investigate trait architecture, the existence of both high-resolution genotypic and phenotypic data is necessary. Arabidopsis thaliana is a prime model for these purposes. This herb naturally occurs across much of the Eurasian continent and North America. Thus, it is exposed to a wide range of environmental factors and has been subject to natural selection under distinct conditions. Full genome sequencing data for more than 1000 different natural inbred lines are available, and this has encouraged the distributed generation of many types of phenotypic data. To leverage these data for meta analyses, AraPheno (https://arapheno.1001genomes.org) provide a central repository of population-scale phenotypes for A. thaliana inbred lines. AraPheno includes various features to easily access, download and visualize the phenotypic data. This will facilitate a comparative analysis of the many different types of phenotypic data, which is the base to further enhance our understanding of the genotype-phenotype map.


Subject(s)
Arabidopsis/genetics , Arabidopsis/metabolism , Databases, Genetic , Genetic Association Studies/methods , Genotype , Phenotype , Search Engine , Database Management Systems , Software , Web Browser
4.
PLoS Genet ; 11(1): e1004920, 2015 Jan.
Article in English | MEDLINE | ID: mdl-25569172

ABSTRACT

There has been much excitement about the possibility that exposure to specific environments can induce an ecological memory in the form of whole-sale, genome-wide epigenetic changes that are maintained over many generations. In the model plant Arabidopsis thaliana, numerous heritable DNA methylation differences have been identified in greenhouse-grown isogenic lines, but it remains unknown how natural, highly variable environments affect the rate and spectrum of such changes. Here we present detailed methylome analyses in a geographically dispersed A. thaliana population that constitutes a collection of near-isogenic lines, diverged for at least a century from a common ancestor. Methylome variation largely reflected genetic distance, and was in many aspects similar to that of lines raised in uniform conditions. Thus, even when plants are grown in varying and diverse natural sites, genome-wide epigenetic variation accumulates mostly in a clock-like manner, and epigenetic divergence thus parallels the pattern of genome-wide DNA sequence divergence.


Subject(s)
DNA Methylation/genetics , Epigenesis, Genetic , Genetic Variation , Genome, Plant , Arabidopsis , DNA Transposable Elements/genetics , DNA, Plant/genetics , Plants, Genetically Modified/genetics
5.
Plant J ; 81(4): 597-610, 2015 Feb.
Article in English | MEDLINE | ID: mdl-25557441

ABSTRACT

MicroRNAs (miRNAs) are short RNAs involved in gene regulation through translational inhibition and transcript cleavage. After processing from imperfect fold-back structures, miRNAs are incorporated into RNA-induced silencing complexes (RISCs) before targeting transcripts with varying degrees of complementarity. Some miRNAs are evolutionarily deep-rooted, and sequence complementarity with their targets is maintained through purifying selection. Both Arabidopsis and Capsella belong to the tribe Camelineae in the Brassicaceae, with Capsella rubella serving as an outgroup to the genus Arabidopsis. The genome sequence of C. rubella has recently been released, which allows characterization of its miRNA complement in comparison with Arabidopsis thaliana and Arabidopsis lyrata. Through next-generation sequencing, we identify high-confidence miRNA candidates specific to the C. rubella lineage. Only a few lineage-specific miRNAs have been studied for evolutionary constraints, and there have been no systematic studies of miRNA target diversity within or divergence between closely related plant species. Therefore we contrast sequence variation in miRNAs and their targets within A. thaliana, and between A. thaliana, A. lyrata and C. rubella. We document a surprising amount of small-scale variation in miRNA-target pairs, where many miRNAs are predicted to have species-specific targets in addition to ones that are shared between species. Our results emphasize that the transitive nature of many miRNA-target pairs can be observed even on a relatively short evolutionary time-scale, with non-random occurrences of differences in miRNAs and their complements in the miRNA precursors, the miRNA* sequences.


Subject(s)
Arabidopsis/genetics , Capsella/genetics , Evolution, Molecular , MicroRNAs/genetics , Arabidopsis/metabolism , Capsella/metabolism , MicroRNAs/metabolism , Polymorphism, Genetic , RNA, Small Interfering/genetics , RNA, Small Interfering/metabolism , Synteny
6.
Plant Cell ; 25(8): 2848-63, 2013 Aug.
Article in English | MEDLINE | ID: mdl-23956262

ABSTRACT

Traditional forward genetic screens are limited in the identification of homologous genes with overlapping functions. Here, we report the analyses and assembly of genome-wide protein family definitions that comprise the largest estimate for the potentially redundant gene space in Arabidopsis thaliana. On this basis, a computational design of genome-wide family-specific artificial microRNAs (amiRNAs) was performed using high-performance computing resources. The amiRNA designs are searchable online (http://phantomdb.ucsd.edu). A computationally derived library of 22,000 amiRNAs was synthesized in 10 sublibraries of 1505 to 4082 amiRNAs, each targeting defined functional protein classes. For example, 2964 amiRNAs target annotated DNA and RNA binding protein families and 1777 target transporter proteins, and another sublibrary targets proteins of unknown function. To evaluate the potential of an amiRNA-based screen, we tested 122 amiRNAs targeting transcription factor, protein kinase, and protein phosphatase families. Several amiRNA lines showed morphological phenotypes, either comparable to known phenotypes of single and double/triple mutants or caused by overexpression of microRNAs. Moreover, novel morphological and abscisic acid-insensitive seed germination mutants were identified for amiRNAs targeting zinc finger homeodomain transcription factors and mitogen-activated protein kinase kinase kinases, respectively. These resources provide an approach for genome-wide genetic screens of the functionally redundant gene space in Arabidopsis.


Subject(s)
Arabidopsis/genetics , Genes, Plant/genetics , Genomic Library , Genomics/methods , MicroRNAs/genetics , Abscisic Acid/pharmacology , Arabidopsis/drug effects , Arabidopsis Proteins/genetics , Arabidopsis Proteins/metabolism , Flowers/drug effects , Flowers/growth & development , Genetic Testing , MicroRNAs/metabolism , Multigene Family , Mutation/genetics , Phenotype , Plant Leaves/drug effects , Plant Leaves/growth & development , Proteome/metabolism
7.
Bioinformatics ; 28(10): 1303-6, 2012 May 15.
Article in English | MEDLINE | ID: mdl-22451271

ABSTRACT

MOTIVATION: The sequencing of over a thousand natural strains of the model plant Arabidopsis thaliana is producing unparalleled information at the genetic level for plant researchers. To enable the rapid exploitation of these data for functional proteomics studies, we have created a resource for the visualization of protein information and proteomic datasets for sequenced natural strains of A. thaliana. RESULTS: The 1001 Proteomes portal can be used to visualize amino acid substitutions or non-synonymous single-nucleotide polymorphisms in individual proteins of A. thaliana based on the reference genome Col-0. We have used the available processed sequence information to analyze the conservation of known residues subject to protein phosphorylation among these natural strains. The substitution of amino acids in A. thaliana natural strains is heavily constrained and is likely a result of the conservation of functional attributes within proteins. At a practical level, we demonstrate that this information can be used to clarify ambiguously defined phosphorylation sites from phosphoproteomic studies. Protein sets of available natural variants are available for download to enable proteomic studies on these accessions. Together this information can be used to uncover the possible roles of specific amino acids in determining the structure and function of proteins in the model plant A. thaliana. An online portal to enable the community to exploit these data can be accessed at http://1001proteomes.masc-proteomics.org/


Subject(s)
Arabidopsis Proteins/genetics , Arabidopsis/genetics , Amino Acid Substitution , DNA, Plant , Databases, Protein , Phosphorylation , Polymorphism, Single Nucleotide , Protein Processing, Post-Translational , Proteome/genetics , Proteomics , Sequence Analysis, DNA
8.
Nat Genet ; 43(10): 956-63, 2011 Aug 28.
Article in English | MEDLINE | ID: mdl-21874002

ABSTRACT

The plant Arabidopsis thaliana occurs naturally in many different habitats throughout Eurasia. As a foundation for identifying genetic variation contributing to adaptation to diverse environments, a 1001 Genomes Project to sequence geographically diverse A. thaliana strains has been initiated. Here we present the first phase of this project, based on population-scale sequencing of 80 strains drawn from eight regions throughout the species' native range. We describe the majority of common small-scale polymorphisms as well as many larger insertions and deletions in the A. thaliana pan-genome, their effects on gene function, and the patterns of local and global linkage among these variants. The action of processes other than spontaneous mutation is identified by comparing the spectrum of mutations that have accumulated since A. thaliana diverged from its closest relative 10 million years ago with the spectrum observed in the laboratory. Recent species-wide selective sweeps are rare, and potentially deleterious mutations are more common in marginal populations.


Subject(s)
Arabidopsis/genetics , Genetics, Population , Genome, Plant , Sequence Analysis, DNA/methods , Alleles , Chromosome Mapping , Chromosomes, Plant , DNA, Plant/genetics , Genetic Loci , Geography , Linkage Disequilibrium , Mutation , Phenotype , Polymorphism, Single Nucleotide , Selection, Genetic
9.
Plant Physiol ; 157(2): 757-69, 2011 Oct.
Article in English | MEDLINE | ID: mdl-21810963

ABSTRACT

Plants, like animals, use several lines of defense against pathogen attack. Prominent among genes that confer disease resistance are those encoding nucleotide-binding site-leucine-rich repeat (NB-LRR) proteins. Likely due to selection pressures caused by pathogens, NB-LRR genes are the most variable gene family in plants, but there appear to be species-specific limits to the number of NB-LRR genes in a genome. Allelic diversity within an individual is also increased by obligatory outcrossing, which leads to genome-wide heterozygosity. In this study, we compared the NB-LRR gene complement of the selfer Arabidopsis thaliana and its outcrossing close relative Arabidopsis lyrata. We then complemented and contrasted the interspecific patterns with studies of NB-LRR diversity within A. thaliana. Three important insights are as follows: (1) that both species have similar numbers of NB-LRR genes; (2) that loci with single NB-LRR genes are less variable than tandem arrays; and (3) that presence-absence polymorphisms within A. thaliana are not strongly correlated with the presence or absence of orthologs in A. lyrata. Although A. thaliana individuals are mostly homozygous and thus potentially less likely to suffer from aberrant interaction of NB-LRR proteins with newly introduced alleles, the number of NB-LRR genes is similar to that in A. lyrata. In intraspecific and interspecific comparisons, NB-LRR genes are also more variable than receptor-like protein genes. Finally, in contrast to Drosophila, there is a clearly positive relationship between interspecific divergence and intraspecific polymorphisms.


Subject(s)
Arabidopsis Proteins/genetics , Arabidopsis Proteins/metabolism , Arabidopsis/genetics , Nucleotides/metabolism , Proteins/genetics , Proteins/metabolism , Binding Sites/genetics , Evolution, Molecular , Genes, Plant , Genome, Plant , Leucine-Rich Repeat Proteins , Multigene Family , Phylogeny , Polymorphism, Genetic
10.
Proc Natl Acad Sci U S A ; 108(25): 10249-54, 2011 Jun 21.
Article in English | MEDLINE | ID: mdl-21646520

ABSTRACT

We present whole-genome assemblies of four divergent Arabidopsis thaliana strains that complement the 125-Mb reference genome sequence released a decade ago. Using a newly developed reference-guided approach, we assembled large contigs from 9 to 42 Gb of Illumina short-read data from the Landsberg erecta (Ler-1), C24, Bur-0, and Kro-0 strains, which have been sequenced as part of the 1,001 Genomes Project for this species. Using alignments against the reference sequence, we first reduced the complexity of the de novo assembly and later integrated reads without similarity to the reference sequence. As an example, half of the noncentromeric C24 genome was covered by scaffolds that are longer than 260 kb, with a maximum of 2.2 Mb. Moreover, over 96% of the reference genome was covered by the reference-guided assembly, compared with only 87% with a complete de novo assembly. Comparisons with 2 Mb of dideoxy sequence reveal that the per-base error rate of the reference-guided assemblies was below 1 in 10,000. Our assemblies provide a detailed, genomewide picture of large-scale differences between A. thaliana individuals, most of which are difficult to access with alignment-consensus methods only. We demonstrate their practical relevance in studying the expression differences of polymorphic genes and show how the analysis of sRNA sequencing data can lead to erroneous conclusions if aligned against the reference genome alone. Genome assemblies, raw reads, and further information are accessible through http://1001genomes.org/projects/assemblies.html.


Subject(s)
Arabidopsis/genetics , Genome, Plant , Algorithms , Base Sequence , Polymorphism, Genetic , Sequence Alignment , Sequence Analysis, DNA
11.
PLoS Genet ; 6(3): e1000890, 2010 Mar 26.
Article in English | MEDLINE | ID: mdl-20361058

ABSTRACT

As Arabidopsis thaliana is increasingly employed in evolutionary and ecological studies, it is essential to understand patterns of natural genetic variation and the forces that shape them. Previous work focusing mostly on global and regional scales has demonstrated the importance of historical events such as long-distance migration and colonization. Far less is known about the role of contemporary factors or environmental heterogeneity in generating diversity patterns at local scales. We sampled 1,005 individuals from 77 closely spaced stands in diverse settings around Tübingen, Germany. A set of 436 SNP markers was used to characterize genome-wide patterns of relatedness and recombination. Neighboring genotypes often shared mosaic blocks of alternating marker identity and divergence. We detected recent outcrossing as well as stretches of residual heterozygosity in largely homozygous recombinants. As has been observed for several other selfing species, there was considerable heterogeneity among sites in diversity and outcrossing, with rural stands exhibiting greater diversity and heterozygosity than urban stands. Fine-scale spatial structure was evident as well. Within stands, spatial structure correlated negatively with observed heterozygosity, suggesting that the high homozygosity of natural A. thaliana may be partially attributable to nearest-neighbor mating of related individuals. The large number of markers and extensive local sampling employed here afforded unusual power to characterize local genetic patterns. Contemporary processes such as ongoing outcrossing play an important role in determining distribution of genetic diversity at this scale. Local "outcrossing hotspots" appear to reshuffle genetic information at surprising rates, while other stands contribute comparatively little. Our findings have important implications for sampling and interpreting diversity among A. thaliana accessions.


Subject(s)
Arabidopsis/genetics , Genetic Variation , Recombination, Genetic , Genotype , Hybridization, Genetic , Polymorphism, Single Nucleotide
12.
Bioinformatics ; 23(20): 2784-7, 2007 Oct 15.
Article in English | MEDLINE | ID: mdl-17785349

ABSTRACT

MOTIVATION: One challenging aspect of genotyping and association mapping projects is often the identification of markers that are informative between groups of individuals and to convert these into genotyping assays. RESULTS: The Multiple SNP Query Tool (MSQT) extracts SNP information from multiple sequence alignments, stores it in a database, provides a web interface to query the database and outputs SNP information in a format directly applicable for SNP-assay design. MSQT was applied to Arabidopsis thaliana sequence data to develop SNP genotyping assays that distinguish a recurrent parent (Col-0) from five other strains. SNPs with intermediate allele frequencies were also identified and developed into markers suitable for efficient genetic mapping among random pairs of wild strains. AVAILABILITY: The source code for MSQT is available at http://msqt.weigelworld.org, together with an online instance of MSQT containing data on 1214 sequenced fragments from 96 ecotypes (wild inbred strains) of the reference plant A. thaliana. All SNP genotyping assays are available in several formats for broad community use. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
Databases, Genetic , Polymorphism, Single Nucleotide/genetics , Sequence Alignment/methods , Sequence Analysis, DNA/methods , Software , User-Computer Interface , Base Sequence , Chromosome Mapping/methods , Database Management Systems , Genetic Markers/genetics , Information Storage and Retrieval/methods , Molecular Sequence Data
SELECTION OF CITATIONS
SEARCH DETAIL
...