Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 15 de 15
Filter
Add more filters










Publication year range
1.
Sci Rep ; 10(1): 8364, 2020 05 20.
Article in English | MEDLINE | ID: mdl-32433569

ABSTRACT

Red clover (Trifolium pratense L.) is a highly adaptable forage crop for temperate livestock agriculture. Genetic variation can be identified, via molecular techniques, and used to assess diversity among populations that may otherwise be indistinguishable. Here we have used genotyping by sequencing (GBS) to determine the genetic variation and population structure in red clover natural populations from Europe and Asia, and varieties or synthetic populations. Cluster analysis differentiated the collection into four large regional groups: Asia, Iberia, UK, and Central Europe. The five varieties clustered with the geographical area from which they were derived. Two methods (BayeScan and Samßada) were used to search for outlier loci indicating signatures of selection. A total of 60 loci were identified by both methods, but no specific genomic region was highlighted. The rate of decay in linkage disequilibrium was fast, and no significant evidence of any bottlenecks was found. Phenotypic analysis showed that a more prostrate and spreading growth habit was predominantly found among populations from Iberia and the UK. A genome wide association study identified a single nucleotide polymorphism (SNP) located in a homologue of the VEG2 gene from pea, associated with flowering time. The identification of genetic variation within the natural populations is likely to be useful for enhancing the breeding of red clover in the future.


Subject(s)
Crops, Agricultural/genetics , Genome, Plant/genetics , Selection, Genetic , Trifolium/genetics , Asia , Chromosome Mapping , Cluster Analysis , Genome-Wide Association Study , Linkage Disequilibrium , Phylogeography , Plant Breeding , Polymorphism, Single Nucleotide , Spain , United Kingdom
2.
Sci Data ; 4: 170044, 2017 04 27.
Article in English | MEDLINE | ID: mdl-28448065

ABSTRACT

Barley (Hordeum vulgare L.) is a cereal grass mainly used as animal fodder and raw material for the malting industry. The map-based reference genome sequence of barley cv. 'Morex' was constructed by the International Barley Genome Sequencing Consortium (IBSC) using hierarchical shotgun sequencing. Here, we report the experimental and computational procedures to (i) sequence and assemble more than 80,000 bacterial artificial chromosome (BAC) clones along the minimum tiling path of a genome-wide physical map, (ii) find and validate overlaps between adjacent BACs, (iii) construct 4,265 non-redundant sequence scaffolds representing clusters of overlapping BACs, and (iv) order and orient these BAC clusters along the seven barley chromosomes using positional information provided by dense genetic maps, an optical map and chromosome conformation capture sequencing (Hi-C). Integrative access to these sequence and mapping resources is provided by the barley genome explorer (BARLEX).


Subject(s)
Genome, Plant , Hordeum/genetics , Chromosome Mapping , Sequence Analysis
3.
Int J Genomics ; 2017: 9272363, 2017.
Article in English | MEDLINE | ID: mdl-28265566

ABSTRACT

Next generation sequencing technologies have provided numerous opportunities for application in the study of whole plant genomes. In this study, we present the sequencing and bioinformatic analyses of five typical rice landraces including three indica and two japonica with potential blast resistance. A total of 688.4 million 100 bp paired-end reads have yielded approximately 30-fold coverage to compare with the Nipponbare reference genome. Among them, a small number of reads were mapped to both chromosomes and organellar genomes. Over two million and eight hundred thousand single nucleotide polymorphisms (SNPs) and insertions and deletions (InDels) in indica and japonica lines have been determined, which potentially have significant impacts on multiple transcripts of genes. SNP deserts, contiguous SNP-low regions, were found on chromosomes 1, 4, and 5 of all genomes of rice examined. Based on the distribution of SNPs per 100 kilobase pairs, the phylogenetic relationships among the landraces have been constructed. This is the first step towards revealing several salient features of rice genomes in Vietnam and providing significant information resources to further marker-assisted selection (MAS) in rice breeding programs.

4.
Proc Natl Acad Sci U S A ; 114(6): E913-E921, 2017 02 07.
Article in English | MEDLINE | ID: mdl-28096351

ABSTRACT

Comprehensive reverse genetic resources, which have been key to understanding gene function in diploid model organisms, are missing in many polyploid crops. Young polyploid species such as wheat, which was domesticated less than 10,000 y ago, have high levels of sequence identity among subgenomes that mask the effects of recessive alleles. Such redundancy reduces the probability of selection of favorable mutations during natural or human selection, but also allows wheat to tolerate high densities of induced mutations. Here we exploited this property to sequence and catalog more than 10 million mutations in the protein-coding regions of 2,735 mutant lines of tetraploid and hexaploid wheat. We detected, on average, 2,705 and 5,351 mutations per tetraploid and hexaploid line, respectively, which resulted in 35-40 mutations per kb in each population. With these mutation densities, we identified an average of 23-24 missense and truncation alleles per gene, with at least one truncation or deleterious missense mutation in more than 90% of the captured wheat genes per population. This public collection of mutant seed stocks and sequence data enables rapid identification of mutations in the different copies of the wheat genes, which can be combined to uncover previously hidden variation. Polyploidy is a central phenomenon in plant evolution, and many crop species have undergone recent genome duplication events. Therefore, the general strategy and methods developed herein can benefit other polyploid crops.


Subject(s)
Genome, Plant/genetics , Mutation , Polyploidy , Triticum/genetics , DNA Mutational Analysis/methods , Evolution, Molecular , Exome/genetics , Plant Breeding , Plant Proteins/genetics , Selection, Genetic
5.
Plant Genome ; 9(1)2016 03.
Article in English | MEDLINE | ID: mdl-27898761

ABSTRACT

The genome sequences of many important Triticeae species, including bread wheat ( L.) and barley ( L.), remained uncharacterized for a long time because their high repeat content, large sizes, and polyploidy. As a result of improvements in sequencing technologies and novel analyses strategies, several of these have recently been deciphered. These efforts have generated new insights into Triticeae biology and genome organization and have important implications for downstream usage by breeders, experimental biologists, and comparative genomicists. transPLANT () is an EU-funded project aimed at constructing hardware, software, and data infrastructure for genome-scale research in the life sciences. Since the Triticeae data are intrinsically complex, heterogenous, and distributed, the transPLANT consortium has undertaken efforts to develop common data formats and tools that enable the exchange and integration of data from distributed resources. Here we present an overview of the individual Triticeae genome resources hosted by transPLANT partners, introduce the objectives of transPLANT, and outline common developments and interfaces supporting integrated data access.


Subject(s)
Genome, Plant , Genomics/methods , Poaceae/genetics , Evolution, Molecular , Hordeum/genetics , Polyploidy , Triticum/genetics
6.
Article in English | MEDLINE | ID: mdl-27337980

ABSTRACT

The Ensembl gene annotation system has been used to annotate over 70 different vertebrate species across a wide range of genome projects. Furthermore, it generates the automatic alignment-based annotation for the human and mouse GENCODE gene sets. The system is based on the alignment of biological sequences, including cDNAs, proteins and RNA-seq reads, to the target genome in order to construct candidate transcript models. Careful assessment and filtering of these candidate transcripts ultimately leads to the final gene set, which is made available on the Ensembl website. Here, we describe the annotation process in detail.Database URL: http://www.ensembl.org/index.html.


Subject(s)
Databases, Nucleic Acid , Databases, Protein , Internet , Molecular Sequence Annotation/methods , Animals , Humans , Mice
7.
Otolaryngol Head Neck Surg ; 154(1): 138-43, 2016 Jan.
Article in English | MEDLINE | ID: mdl-26567047

ABSTRACT

OBJECTIVE: The aim of this study was to measure the effects of endotracheal intubation on innate immune response within the pig laryngeal mucosa. STUDY DESIGN: Prospective controlled basic science study. SETTING: The animal experiments and analyses were conducted at the University of Bristol. SAMPLES AND METHODS: Eighteen pigs, matched at the major histocompatibility complex (MHC), were used in the study. The pigs were divided into 9 pairs. One of each pair (9 pigs in total) was intubated with an endotracheal tube under general anesthesia for 90 minutes. Two days later, pinch biopsies were taken from the supraglottis (specifically the false cords) and subglottis of both pigs. The experiment was repeated 8 more times. Based on quantitative immunohistochemistry, percentage areas of positive staining for CD172a, CD163, MHC class II, CD14, and CD16 were calculated separately for the epithelium and lamina propria of each biopsy. RESULTS: Total areas of laryngeal mucosa (epithelium and lamina propria) expressing CD172a and coexpressing CD163 and CD172a were significantly reduced at 2 days following endotracheal intubation (P = .039 and P = .037, respectively). MHC class II expression and MHC class II coexpression with CD172a were similarly reduced following intubation (P = .003 and P = .005, respectively). In the supraglottis, MHC class II coexpression with CD16 and CD14 was also reduced following endotracheal intubation (P = .037). CONCLUSIONS: Our results indicate that endotracheal intubation reduces the number of innate immune cells within the upper airway mucosa. This may be an important first step in a cascade leading to chronic wound and scar formation causing airway stenosis.


Subject(s)
Immunity, Innate , Intubation, Intratracheal , Laryngeal Mucosa/immunology , Animals , Laryngeal Mucosa/pathology , Swine
8.
Sci Rep ; 5: 17394, 2015 Nov 30.
Article in English | MEDLINE | ID: mdl-26617401

ABSTRACT

Red clover (Trifolium pratense L.) is a globally significant forage legume in pastoral livestock farming systems. It is an attractive component of grassland farming, because of its high yield and protein content, nutritional value and ability to fix atmospheric nitrogen. Enhancing its role further in sustainable agriculture requires genetic improvement of persistency, disease resistance, and tolerance to grazing. To help address these challenges, we have assembled a chromosome-scale reference genome for red clover. We observed large blocks of conserved synteny with Medicago truncatula and estimated that the two species diverged ~23 million years ago. Among the 40,868 annotated genes, we identified gene clusters involved in biochemical pathways of importance for forage quality and livestock nutrition. Genotyping by sequencing of a synthetic population of 86 genotypes show that the number of markers required for genomics-based breeding approaches is tractable, making red clover a suitable candidate for association studies and genomic selection.


Subject(s)
Genome, Plant , Quantitative Trait, Heritable , Trifolium/genetics , Computational Biology/methods , Genes, Plant , Genomics/methods , Linkage Disequilibrium , Molecular Sequence Annotation , Multigene Family , Phenotype , Sequence Analysis, DNA
9.
Int J Genomics ; 2013: 857986, 2013.
Article in English | MEDLINE | ID: mdl-24328029

ABSTRACT

The study of microRNAs (miRNAs) in plants has gained significant attention in recent years due to their regulatory role during development and in response to biotic and abiotic stresses. Although cassava (Manihot esculenta Crantz) is tolerant to drought and other adverse conditions, most cassava miRNAs have been predicted using bioinformatics alone or through sequencing of plants challenged by biotic stress. Here, we use high-throughput sequencing and different bioinformatics methods to identify potential cassava miRNAs expressed in different tissues subject to heat and drought conditions. We identified 60 miRNAs conserved in other plant species and 821 potential cassava-specific miRNAs. We also predicted 134 and 1002 potential target genes for these two sets of sequences. Using real time PCR, we verified the condition-specific expression of 5 cassava small RNAs relative to a non-stress control. We also found, using publicly available expression data, a significantly lower expression of the predicted target genes of conserved and nonconserved miRNAs under drought stress compared to other cassava genes. Gene Ontology enrichment analysis along with condition specific expression of predicted miRNA targets, allowed us to identify several interesting miRNAs which may play a role in stress-induced posttranscriptional regulation in cassava and other plants.

10.
Plant J ; 76(3): 494-505, 2013 Nov.
Article in English | MEDLINE | ID: mdl-23889683

ABSTRACT

Advanced resources for genome-assisted research in barley (Hordeum vulgare) including a whole-genome shotgun assembly and an integrated physical map have recently become available. These have made possible studies that aim to assess genetic diversity or to isolate single genes by whole-genome resequencing and in silico variant detection. However such an approach remains expensive given the 5 Gb size of the barley genome. Targeted sequencing of the mRNA-coding exome reduces barley genomic complexity more than 50-fold, thus dramatically reducing this heavy sequencing and analysis load. We have developed and employed an in-solution hybridization-based sequence capture platform to selectively enrich for a 61.6 megabase coding sequence target that includes predicted genes from the genome assembly of the cultivar Morex as well as publicly available full-length cDNAs and de novo assembled RNA-Seq consensus sequence contigs. The platform provides a highly specific capture with substantial and reproducible enrichment of targeted exons, both for cultivated barley and related species. We show that this exome capture platform provides a clear path towards a broader and deeper understanding of the natural variation residing in the mRNA-coding part of the barley genome and will thus constitute a valuable resource for applications such as mapping-by-sequencing and genetic diversity analyzes.


Subject(s)
Exome , Genome, Plant , Genomics/methods , Hordeum/genetics , Genomics/trends , Ploidies , Polymorphism, Single Nucleotide , Triticum/genetics
11.
Genome Biol ; 14(6): R66, 2013 Jun 25.
Article in English | MEDLINE | ID: mdl-23800085

ABSTRACT

BACKGROUND: The high level of identity among duplicated homoeologous genomes in tetraploid pasta wheat presents substantial challenges for de novo transcriptome assembly. To solve this problem, we develop a specialized bioinformatics workflow that optimizes transcriptome assembly and separation of merged homoeologs. To evaluate our strategy, we sequence and assemble the transcriptome of one of the diploid ancestors of pasta wheat, and compare both assemblies with a benchmark set of 13,472 full-length, non-redundant bread wheat cDNAs. RESULTS: A total of 489 million 100 bp paired-end reads from tetraploid wheat assemble in 140,118 contigs, including 96% of the benchmark cDNAs. We used a comparative genomics approach to annotate 66,633 open reading frames. The multiple k-mer assembly strategy increases the proportion of cDNAs assembled full-length in a single contig by 22% relative to the best single k-mer size. Homoeologs are separated using a post-assembly pipeline that includes polymorphism identification, phasing of SNPs, read sorting, and re-assembly of phased reads. Using a reference set of genes, we determine that 98.7% of SNPs analyzed are correctly separated by phasing. CONCLUSIONS: Our study shows that de novo transcriptome assembly of tetraploid wheat benefit from multiple k-mer assembly strategies more than diploid wheat. Our results also demonstrate that phasing approaches originally designed for heterozygous diploid organisms can be used to separate the close homoeologous genomes of tetraploid wheat. The predicted tetraploid wheat proteome and gene models provide a valuable tool for the wheat research community and for those interested in comparative genomic studies.


Subject(s)
Chromosomes, Plant/chemistry , Contig Mapping/methods , Genes, Plant , Genome, Plant , Transcriptome , Triticum/genetics , Base Sequence , Genetic Markers , Models, Genetic , Molecular Sequence Data , Open Reading Frames , Phylogeny , Polymorphism, Single Nucleotide , Pseudogenes , Tetraploidy , Triticum/classification
12.
BMC Plant Biol ; 11: 171, 2011 Nov 25.
Article in English | MEDLINE | ID: mdl-22118559

ABSTRACT

BACKGROUND: Common bean is an important legume crop with only a moderate number of short expressed sequence tags (ESTs) made with traditional methods. The goal of this research was to use full-length cDNA technology to develop ESTs that would overlap with the beginning of open reading frames and therefore be useful for gene annotation of genomic sequences. The library was also constructed to represent genes expressed under drought, low soil phosphorus and high soil aluminum toxicity. We also undertook comparisons of the full-length cDNA library to two previous non-full clone EST sets for common bean. RESULTS: Two full-length cDNA libraries were constructed: one for the drought tolerant Mesoamerican genotype BAT477 and the other one for the acid-soil tolerant Andean genotype G19833 which has been selected for genome sequencing. Plants were grown in three soil types using deep rooting cylinders subjected to drought and non-drought stress and tissues were collected from both roots and above ground parts. A total of 20,000 clones were selected robotically, half from each library. Then, nearly 10,000 clones from the G19833 library were sequenced with an average read length of 850 nucleotides. A total of 4,219 unigenes were identified consisting of 2,981 contigs and 1,238 singletons. These were functionally annotated with gene ontology terms and placed into KEGG pathways. Compared to other EST sequencing efforts in common bean, about half of the sequences were novel or represented the 5' ends of known genes. CONCLUSIONS: The present full-length cDNA libraries add to the technological toolbox available for common bean and our sequencing of these clones substantially increases the number of unique EST sequences available for the common bean genome. All of this should be useful for both functional gene annotation, analysis of splice site variants and intron/exon boundary determination by comparison to soybean genes or with common bean whole-genome sequences. In addition the library has a large number of transcription factors and will be interesting for discovery and validation of drought or abiotic stress related genes in common bean.


Subject(s)
Droughts , Expressed Sequence Tags , Gene Library , Phaseolus/genetics , Sequence Analysis, DNA/methods , DNA, Complementary/genetics , DNA, Plant/genetics , Gene Expression Regulation, Plant , Genotype
13.
Genome Res ; 19(7): 1316-23, 2009 Jul.
Article in English | MEDLINE | ID: mdl-19498102

ABSTRACT

Effective use of the human and mouse genomes requires reliable identification of genes and their products. Although multiple public resources provide annotation, different methods are used that can result in similar but not identical representation of genes, transcripts, and proteins. The collaborative consensus coding sequence (CCDS) project tracks identical protein annotations on the reference mouse and human genomes with a stable identifier (CCDS ID), and ensures that they are consistently represented on the NCBI, Ensembl, and UCSC Genome Browsers. Importantly, the project coordinates on manually reviewing inconsistent protein annotations between sites, as well as annotations for which new evidence suggests a revision is needed, to progressively converge on a complete protein-coding set for the human and mouse reference genomes, while maintaining a high standard of reliability and biological accuracy. To date, the project has identified 20,159 human and 17,707 mouse consensus coding regions from 17,052 human and 16,893 mouse genes. Three evaluation methods indicate that the entries in the CCDS set are highly likely to represent real proteins, more so than annotations from contributing groups not included in CCDS. The CCDS database thus centralizes the function of identifying well-supported, identically-annotated, protein-coding regions.


Subject(s)
Consensus Sequence , Genome , Open Reading Frames/genetics , Animals , Humans , Mice , Sequence Alignment
14.
BMC Bioinformatics ; 9: 115, 2008 Feb 25.
Article in English | MEDLINE | ID: mdl-18298812

ABSTRACT

BACKGROUND: Visualising the evolutionary history of a set of sequences is a challenge for molecular phylogenetics. One approach is to use undirected graphs, such as median networks, to visualise phylogenies where reticulate relationships such as recombination or homoplasy are displayed as cycles. Median networks contain binary representations of sequences as nodes, with edges connecting those sequences differing at one character; hypothetical ancestral nodes are invoked to generate a connected network which contains all most parsimonious trees. Quasi-median networks are a generalisation of median networks which are not restricted to binary data, although phylogenetic information contained within the multistate positions can be lost during the preprocessing of data. Where the history of a set of samples contain frequent homoplasies or recombination events quasi-median networks will have a complex topology. Graph reduction or pruning methods have been used to reduce network complexity but some of these methods are inapplicable to datasets in which recombination has occurred and others are procedurally complex and/or result in disconnected networks. RESULTS: We address the problems inherent in construction and reduction of quasi-median networks. We describe a novel method of generating quasi-median networks that uses all characters, both binary and multistate, without imposing an arbitrary ordering of the multistate partitions. We also describe a pruning mechanism which maintains at least one shortest path between observed sequences, displaying the underlying relations between all pairs of sequences while maintaining a connected graph. CONCLUSION: Application of this approach to 5S rDNA sequence data from sea beet produced a pruned network within which genetic isolation between populations by distance was evident, demonstrating the value of this approach for exploration of evolutionary relationships.


Subject(s)
Algorithms , Chromosome Mapping/methods , DNA Mutational Analysis/methods , Evolution, Molecular , RNA, Ribosomal, 5S/genetics , Sequence Alignment/methods , Sequence Analysis, DNA/methods , Base Sequence , Humans , Molecular Sequence Data
15.
Acta Otolaryngol ; 124(7): 839-46, 2004 Sep.
Article in English | MEDLINE | ID: mdl-15370570

ABSTRACT

OBJECTIVE: To develop a large animal model for studies of laryngeal abductor reinnervation. MATERIAL AND METHODS: Six minipigs underwent unilateral anastomosis of the phrenic nerve-abductor branch of the recurrent laryngeal nerve (RLN). Polyhydroxybutyrate (PHB) conduits were used for repair. At each of 30, 60 and 120 days, 2 animals underwent video laryngeal endoscopy (VLE) and were then killed. VLE was also performed in the 120-day pair at 60 days. Nerve-conduit-nerve-muscle samples were fixed for light and immunofluorescence (pan-neurofilaments, S-100) microscopy. Laryngeal muscles were harvested (myosin heavy chain analysis). RESULTS: VLE showed recovery of abductor function in 1 animal at 60 days and in 1 at 120 days. Haematoxylin-eosin staining demonstrated a complex inflammatory response. Eosinophil recruitment was observed. Stepwise regeneration and reorganization of the distal nerve between 30 and 120 days was observed with pan-NF staining. The mean minimum diameter in the reinnervated posterior crico-arytenoids tended to increase for up to 120 days. CONCLUSIONS: Anastomosis of the phrenic nerve-abductor branch of the RLN with a PHB conduit in a pig can result in functional and histological recovery within 2-4 months and appears to at least sustain abductor muscle fibre morphology. Recovery occurs despite a complex inflammatory response, which may be an essential part of healing rather than inhibitory.


Subject(s)
Laryngeal Muscles/innervation , Recurrent Laryngeal Nerve/surgery , Anastomosis, Surgical , Animals , Endoscopy , Female , Laryngeal Muscles/anatomy & histology , Laryngeal Muscles/cytology , Nerve Fibers, Myelinated/physiology , Phrenic Nerve/surgery , Pilot Projects , Recovery of Function , Recurrent Laryngeal Nerve/anatomy & histology , Recurrent Laryngeal Nerve/cytology , Regeneration/physiology , Respiratory Function Tests , Swine , Time Factors , Videotape Recording
SELECTION OF CITATIONS
SEARCH DETAIL
...