Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 17 de 17
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
2.
Genome Biol ; 15(11): 506, 2014.
Artigo em Inglês | MEDLINE | ID: mdl-25468217

RESUMO

BACKGROUND: The use of high throughput genome-sequencing technologies has uncovered a large extent of structural variation in eukaryotic genomes that makes important contributions to genomic diversity and phenotypic variation. When the genomes of different strains of a given organism are compared, whole genome resequencing data are typically aligned to an established reference sequence. However, when the reference differs in significant structural ways from the individuals under study, the analysis is often incomplete or inaccurate. RESULTS: Here, we use rice as a model to demonstrate how improvements in sequencing and assembly technology allow rapid and inexpensive de novo assembly of next generation sequence data into high-quality assemblies that can be directly compared using whole genome alignment to provide an unbiased assessment. Using this approach, we are able to accurately assess the "pan-genome" of three divergent rice varieties and document several megabases of each genome absent in the other two. CONCLUSIONS: Many of the genome-specific loci are annotated to contain genes, reflecting the potential for new biological properties that would be missed by standard reference-mapping approaches. We further provide a detailed analysis of several loci associated with agriculturally important traits, including the S5 hybrid sterility locus, the Sub1 submergence tolerance locus, the LRK gene cluster associated with improved yield, and the Pup1 cluster associated with phosphorus deficiency, illustrating the utility of our approach for biological discovery. All of the data and software are openly available to support further breeding and functional studies of rice and other species.


Assuntos
Variação Genética , Genoma de Planta , Oryza/genética , Locos de Características Quantitativas/genética , Cruzamento , Mapeamento Cromossômico , Sequenciamento de Nucleotídeos em Larga Escala , Fenótipo , Alinhamento de Sequência
3.
Nat Genet ; 44(7): 808-11, 2012 Jun 03.
Artigo em Inglês | MEDLINE | ID: mdl-22660546

RESUMO

Domestication and plant breeding are ongoing 10,000-year-old evolutionary experiments that have radically altered wild species to meet human needs. Maize has undergone a particularly striking transformation. Researchers have sought for decades to identify the genes underlying maize evolution, but these efforts have been limited in scope. Here, we report a comprehensive assessment of the evolution of modern maize based on the genome-wide resequencing of 75 wild, landrace and improved maize lines. We find evidence of recovery of diversity after domestication, likely introgression from wild relatives, and evidence for stronger selection during domestication than improvement. We identify a number of genes with stronger signals of selection than those previously shown to underlie major morphological changes. Finally, through transcriptome-wide analysis of gene expression, we find evidence both consistent with removal of cis-acting variation during maize domestication and improvement and suggestive of modern breeding having increased dominance in expression while targeting highly expressed genes.


Assuntos
Produtos Agrícolas/genética , Regulação da Expressão Gênica de Plantas , Genoma de Planta , Zea mays/genética , Cruzamento/métodos , Evolução Molecular , Metagenômica/métodos , Polimorfismo de Nucleotídeo Único , Seleção Genética , Transcriptoma/genética
4.
Nat Genet ; 44(7): 803-7, 2012 Jun 03.
Artigo em Inglês | MEDLINE | ID: mdl-22660545

RESUMO

Whereas breeders have exploited diversity in maize for yield improvements, there has been limited progress in using beneficial alleles in undomesticated varieties. Characterizing standing variation in this complex genome has been challenging, with only a small fraction of it described to date. Using a population genetics scoring model, we identified 55 million SNPs in 103 lines across pre-domestication and domesticated Zea mays varieties, including a representative from the sister genus Tripsacum. We find that structural variations are pervasive in the Z. mays genome and are enriched at loci associated with important traits. By investigating the drivers of genome size variation, we find that the larger Tripsacum genome can be explained by transposable element abundance rather than an allopolyploid origin. In contrast, intraspecies genome size variation seems to be controlled by chromosomal knob content. There is tremendous overlap in key gene content in maize and Tripsacum, suggesting that adaptations from Tripsacum (for example, perennialism and frost and drought tolerance) can likely be integrated into maize.


Assuntos
Genoma de Planta , Zea mays/genética , Alelos , Elementos de DNA Transponíveis , Tamanho do Genoma , Projeto HapMap , Polimorfismo de Nucleotídeo Único , Seleção Genética
6.
Proc Natl Acad Sci U S A ; 108(9): 3530-5, 2011 Mar 01.
Artigo em Inglês | MEDLINE | ID: mdl-21245334

RESUMO

The grape is one of the earliest domesticated fruit crops and, since antiquity, it has been widely cultivated and prized for its fruit and wine. Here, we characterize genome-wide patterns of genetic variation in over 1,000 samples of the domesticated grape, Vitis vinifera subsp. vinifera, and its wild relative, V. vinifera subsp. sylvestris from the US Department of Agriculture grape germplasm collection. We find support for a Near East origin of vinifera and present evidence of introgression from local sylvestris as the grape moved into Europe. High levels of genetic diversity and rapid linkage disequilibrium (LD) decay have been maintained in vinifera, which is consistent with a weak domestication bottleneck followed by thousands of years of widespread vegetative propagation. The considerable genetic diversity within vinifera, however, is contained within a complex network of close pedigree relationships that has been generated by crosses among elite cultivars. We show that first-degree relationships are rare between wine and table grapes and among grapes from geographically distant regions. Our results suggest that although substantial genetic diversity has been maintained in the grape subsequent to domestication, there has been a limited exploration of this diversity. We propose that the adoption of vegetative propagation was a double-edged sword: Although it provided a benefit by ensuring true breeding cultivars, it also discouraged the generation of unique cultivars through crosses. The grape currently faces severe pathogen pressures, and the long-term sustainability of the grape and wine industries will rely on the exploitation of the grape's tremendous natural genetic diversity.


Assuntos
Produtos Agrícolas/genética , Produtos Agrícolas/história , Vitis/genética , Células Clonais , Haplótipos/genética , História Antiga , Desequilíbrio de Ligação/genética , Linhagem , Filogenia , Dinâmica Populacional , Análise de Componente Principal , Sementes/genética , Estados Unidos , United States Department of Agriculture
7.
PLoS One ; 5(1): e8219, 2010 Jan 13.
Artigo em Inglês | MEDLINE | ID: mdl-20084295

RESUMO

Next-generation sequencing technologies promise to dramatically accelerate the use of genetic information for crop improvement by facilitating the genetic mapping of agriculturally important phenotypes. The first step in optimizing the design of genetic mapping studies involves large-scale polymorphism discovery and a subsequent genome-wide assessment of the population structure and pattern of linkage disequilibrium (LD) in the species of interest. In the present study, we provide such an assessment for the grapevine (genus Vitis), the world's most economically important fruit crop. Reduced representation libraries (RRLs) from 17 grape DNA samples (10 cultivated V. vinifera and 7 wild Vitis species) were sequenced with sequencing-by-synthesis technology. We developed heuristic approaches for SNP calling, identified hundreds of thousands of SNPs and validated a subset of these SNPs on a 9K genotyping array. We demonstrate that the 9K SNP array provides sufficient resolution to distinguish among V. vinifera cultivars, between V. vinifera and wild Vitis species, and even among diverse wild Vitis species. We show that there is substantial sharing of polymorphism between V. vinifera and wild Vitis species and find that genetic relationships among V. vinifera cultivars agree well with their proposed geographic origins using principal components analysis (PCA). Levels of LD in the domesticated grapevine are low even at short ranges, but LD persists above background levels to 3 kb. While genotyping arrays are useful for assessing population structure and the decay of LD across large numbers of samples, we suggest that whole-genome sequencing will become the genotyping method of choice for genome-wide genetic mapping studies in high-diversity plant species. This study demonstrates that we can move quickly towards genome-wide studies of crop species using next-generation sequencing. Our study sets the stage for future work in other high diversity crop species, and provides a significant enhancement to current genetic resources available to the grapevine genetic community.


Assuntos
Genoma de Planta , Vitis/genética , Polimorfismo de Nucleotídeo Único
8.
Science ; 326(5956): 1115-7, 2009 Nov 20.
Artigo em Inglês | MEDLINE | ID: mdl-19965431

RESUMO

Maize is an important crop species of high genetic diversity. We identified and genotyped several million sequence polymorphisms among 27 diverse maize inbred lines and discovered that the genome was characterized by highly divergent haplotypes and showed 10- to 30-fold variation in recombination rates. Most chromosomes have pericentromeric regions with highly suppressed recombination that appear to have influenced the effectiveness of selection during maize inbred development and may be a major component of heterosis. We found hundreds of selective sweeps and highly differentiated regions that probably contain loci that are key to geographic adaptation. This survey of genetic diversity provides a foundation for uniting breeding efforts across the world and for dissecting complex traits through genome-wide association studies.


Assuntos
Mapeamento Cromossômico , Variação Genética , Genoma de Planta , Haplótipos , Recombinação Genética , Seleção Genética , Zea mays/genética , Cruzamento , Cromossomos de Plantas/genética , Evolução Molecular , Estudo de Associação Genômica Ampla , Heterozigoto , Vigor Híbrido , Polimorfismo de Nucleotídeo Único , Análise de Sequência de DNA , Sorghum/genética
9.
PLoS Genet ; 5(11): e1000716, 2009 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-19936050

RESUMO

MicroRNAs (miRNAs) are small, non-coding RNAs that play essential roles in plant growth, development, and stress response. We conducted a genome-wide survey of maize miRNA genes, characterizing their structure, expression, and evolution. Computational approaches based on homology and secondary structure modeling identified 150 high-confidence genes within 26 miRNA families. For 25 families, expression was verified by deep-sequencing of small RNA libraries that were prepared from an assortment of maize tissues. PCR-RACE amplification of 68 miRNA transcript precursors, representing 18 families conserved across several plant species, showed that splice variation and the use of alternative transcriptional start and stop sites is common within this class of genes. Comparison of sequence variation data from diverse maize inbred lines versus teosinte accessions suggest that the mature miRNAs are under strong purifying selection while the flanking sequences evolve equivalently to other genes. Since maize is derived from an ancient tetraploid, the effect of whole-genome duplication on miRNA evolution was examined. We found that, like protein-coding genes, duplicated miRNA genes underwent extensive gene-loss, with approximately 35% of ancestral sites retained as duplicate homoeologous miRNA genes. This number is higher than that observed with protein-coding genes. A search for putative miRNA targets indicated bias towards genes in regulatory and metabolic pathways. As maize is one of the principal models for plant growth and development, this study will serve as a foundation for future research into the functional roles of miRNA genes.


Assuntos
Genes de Plantas , MicroRNAs/genética , Zea mays/genética , Sequência de Bases , Sequência Conservada , Perfilação da Expressão Gênica , Regulação da Expressão Gênica de Plantas , Variação Genética , Dados de Sequência Molecular , Família Multigênica , Nucleotídeos/genética , Fases de Leitura Aberta/genética , Especificidade de Órgãos/genética , Splicing de RNA/genética , RNA Mensageiro/genética , RNA Mensageiro/metabolismo , Homologia de Sequência do Ácido Nucleico , Sorghum/genética , Sintenia/genética
10.
Immunogenetics ; 57(8): 559-65, 2005 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-16133449

RESUMO

The recent publication of the complete sequence of human chromosome 6 provides a platform from which to investigate genomic sequence variation. We report here a detailed linkage disequilibrium (LD) pattern map across the entire human chromosome 6p by using a set of 1152 single nucleotide polymorphisms (SNPs) in a population of 198 Singaporean Chinese, with 326 SNPs focused in the major histocompatibility complex (MHC) region. Our analysis shows some unexpectedly high segments of strong LD in a 10-Mb region that includes the extremely polymorphic and gene-rich MHC loci and many non-MHC genes. These include the telomeric peri-MHC region that harbors olfactory receptors, histones and zinc finger clusters, and the centromeric peri-MHC region that contains several unknown open reading frames. The data also help refine a human-mouse synteny break in the region between 28.6 and 29.4 Mb. The population-based LD map presented here will provide an essential resource for understanding the genomic sequence variation of chromosome 6p and LD mapping of disease genes of complex genetic traits.


Assuntos
Cromossomos Humanos Par 6 , Desequilíbrio de Ligação , Mapeamento Cromossômico , Humanos , Complexo Principal de Histocompatibilidade , Polimorfismo de Nucleotídeo Único , Recombinação Genética
11.
BMC Bioinformatics ; 5: 161, 2004 Oct 26.
Artigo em Inglês | MEDLINE | ID: mdl-15504241

RESUMO

BACKGROUND: Several in silico methods exist that were developed to predict protein interactions from the copious amount of genomic and proteomic data. One of these methods is Domain Fusion, which has proven to be effective in predicting functional links between proteins. RESULTS: Analyzing the structures of multi-domain single-chain peptides, we found that domain pairs located less than 30 residues apart on a chain are almost certain to share a physical interface. The majority of these interactions are also conserved across separate chains. We make use of this observation to improve domain fusion based protein interaction predictions, and demonstrate this by implementing it on a set of Saccharomyces cerevisiae proteins. CONCLUSION: We show that existing structural data supports the domain fusion hypothesis. Empirical information from structural data also enables us to refine and assess domain fusion based protein interaction predictions. These interactions can then be integrated with downstream biochemical and genetic assays to generate more reliable protein interaction data sets.


Assuntos
Mapeamento de Interação de Proteínas/métodos , Proteínas de Saccharomyces cerevisiae/química , Aminoácidos/genética , Sequência Conservada/genética , Bases de Dados de Proteínas , Peptídeos/química , Valor Preditivo dos Testes , Estrutura Terciária de Proteína/genética
12.
BMC Infect Dis ; 4: 34, 2004 Sep 09.
Artigo em Inglês | MEDLINE | ID: mdl-15357874

RESUMO

BACKGROUND: The molecular basis of severe acute respiratory syndrome (SARS) coronavirus (CoV) induced pathology is still largely unclear. Many SARS patients suffer respiratory distress brought on by interstitial infiltration and frequently show peripheral blood lymphopenia and occasional leucopenia. One possible cause of this could be interstitial inflammation, following a localized host response. In this study, we therefore examine the immune response of SARS-CoV in human peripheral blood mononuclear cells (PBMCs) over the first 24 hours. METHODS: PBMCs from normal healthy donors were inoculated in vitro with SARS-CoV and the viral replication kinetics was studied by real-time quantitative assays. SARS-CoV specific gene expression changes were examined by high-density oligonucleotide array analysis. RESULTS: We observed that SARS-CoV was capable of infecting and replicating in PBMCs and the kinetics of viral replication was variable among the donors. SARS-CoV antibody binding assays indicated that SARS specific antibodies inhibited SARS-CoV viral replication. Array data showed monocyte-macrophage cell activation, coagulation pathway upregulation and cytokine production together with lung trafficking chemokines such as IL8 and IL17, possibly activated through the TLR9 signaling pathway; that mimicked clinical features of the disease. CONCLUSIONS: The identification of human blood mononuclear cells as a direct target of SARS-CoV in the model system described here provides a new insight into disease pathology and a tool for investigating the host response and mechanisms of pathogenesis.


Assuntos
Leucócitos Mononucleares/virologia , Síndrome Respiratória Aguda Grave/virologia , Coronavírus Relacionado à Síndrome Respiratória Aguda Grave/fisiologia , Replicação Viral/fisiologia , Animais , Chlorocebus aethiops , Regulação Viral da Expressão Gênica , Humanos , Cinética , Análise de Sequência com Séries de Oligonucleotídeos , Reação em Cadeia da Polimerase , RNA Viral/genética , RNA Viral/isolamento & purificação , Coronavírus Relacionado à Síndrome Respiratória Aguda Grave/genética , Coronavírus Relacionado à Síndrome Respiratória Aguda Grave/imunologia , Síndrome Respiratória Aguda Grave/imunologia , Células Vero
13.
Mol Biol Evol ; 21(6): 1146-51, 2004 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-15014147

RESUMO

With about 24,000 extant species, teleosts are the largest group of vertebrates. They constitute more than 99% of the ray-finned fishes (Actinopterygii) that diverged from the lobe-finned fish lineage (Sarcopterygii) about 450 MYA. Although the role of genome duplication in the evolution of vertebrates is now established, its role in structuring the teleost genomes has been controversial. At least two hypotheses have been proposed: a whole-genome duplication in an ancient ray-finned fish and independent gene duplications in different lineages. These hypotheses are, however, based on small data sets and lack adequate statistical and phylogenetic support. In this study, we have made a systematic comparison of the draft genome sequences of Fugu and humans to identify paralogous chromosomal regions ("paralogons") in the Fugu that arose in the ray-finned fish lineage ("fish-specific"). We identified duplicate genes in the Fugu by phylogenetic analyses of the Fugu, human, and invertebrate sequences. Our analyses provide evidence for 425 fish-specific duplicate genes in the Fugu and show that at least 6.6% of the genome is represented by fish-specific paralogons. We estimated the ages of Fugu duplicate genes and paralogons using the molecular clock. Remarkably, the ages of duplicate genes and paralogons are clustered, with a peak around 350 MYA. These data strongly suggest a whole-genome duplication event early during the evolution of ray-finned fishes, probably before the origin of teleosts.


Assuntos
Evolução Molecular , Duplicação Gênica , Genoma , Takifugu/genética , Animais , Caenorhabditis elegans/genética , Ciona intestinalis/genética , Análise por Conglomerados , Bases de Dados Genéticas , Drosophila melanogaster/genética , Humanos , Funções Verossimilhança , Modelos Genéticos
14.
Comp Funct Genomics ; 5(5): 403-18, 2004.
Artigo em Inglês | MEDLINE | ID: mdl-18629171

RESUMO

Studies on the zebrafish model have contributed to our understanding of several important developmental processes, especially those that can be easily studied in the embryo. However, our knowledge on late events such as gonad differentiation in the zebrafish is still limited. Here we provide an analysis on the gene sets expressed in the adult zebrafish testis and ovary in an attempt to identify genes with potential role in (zebra)fish gonad development and function. We produced 10,533 expressed sequence tags (ESTs) from zebrafish testis or ovary and downloaded an additional 23,642 gonad-derived sequences from the zebrafish EST database. We clustered these sequences together with over 13,000 kidney-derived zebrafish ESTs to study partial transcriptomes for these three organs. We searched for genes with gonad-specific expression by screening macroarrays containing at least 2600 unique cDNA inserts with testis-, ovary- and kidney-derived cDNA probes. Clones hybridizing to only one of the two gonad probes were selected, and subsequently screened with computational tools to identify 72 genes with potentially testis-specific and 97 genes with potentially ovary-specific expression, respectively. PCR-amplification confirmed gonad-specificity for 21 of the 45 clones tested (all without known function). Our study, which involves over 47,000 EST sequences and specialized cDNA arrays, is the first analysis of adult organ transcriptomes of zebrafish at such a scale. The study of genes expressed in adult zebrafish testis and ovary will provide useful information on regulation of gene expression in teleost gonads and might also contribute to our understanding of the development and differentiation of reproductive organs in vertebrates.

15.
Genome Res ; 13(8): 1904-15, 2003 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-12869579

RESUMO

We identify several challenges facing bioinformatics analysis today. Firstly, to fulfill the promise of comparative studies, bioinformatics analysis will need to accommodate different sources of data residing in a federation of databases that, in turn, come in different formats and modes of accessibility. Secondly, the tsunami of data to be handled will require robust systems that enable bioinformatics analysis to be carried out in a parallel fashion. Thirdly, the ever-evolving state of bioinformatics presents new algorithms and paradigms in conducting analysis. This means that any bioinformatics framework must be flexible and generic enough to accommodate such changes. In addition, we identify the need for introducing an explicit protocol-based approach to bioinformatics analysis that will lend rigorousness to the analysis. This makes it easier for experimentation and replication of results by external parties. Biopipe is designed in an effort to meet these goals. It aims to allow researchers to focus on protocol design. At the same time, it is designed to work over a compute farm and thus provides high-throughput performance. A common exchange format that encapsulates the entire protocol in terms of the analysis modules, parameters, and data versions has been developed to provide a powerful way in which to distribute and reproduce results. This will enable researchers to discuss and interpret the data better as the once implicit assumptions are now explicitly defined within the Biopipe framework.


Assuntos
Biologia Computacional/métodos , Software , Sequência de Aminoácidos , Animais , Bases de Dados de Proteínas , Proteínas de Drosophila/genética , Humanos , Filogenia , Proteínas/genética , Projetos de Pesquisa/tendências , Design de Software , Takifugu/genética
16.
Lancet ; 361(9371): 1779-85, 2003 May 24.
Artigo em Inglês | MEDLINE | ID: mdl-12781537

RESUMO

BACKGROUND: The cause of severe acute respiratory syndrome (SARS) has been identified as a new coronavirus. Whole genome sequence analysis of various isolates might provide an indication of potential strain differences of this new virus. Moreover, mutation analysis will help to develop effective vaccines. METHODS: We sequenced the entire SARS viral genome of cultured isolates from the index case (SIN2500) presenting in Singapore, from three primary contacts (SIN2774, SIN2748, and SIN2677), and one secondary contact (SIN2679). These sequences were compared with the isolates from Canada (TOR2), Hong Kong (CUHK-W1 and HKU39849), Hanoi (URBANI), Guangzhou (GZ01), and Beijing (BJ01, BJ02, BJ03, BJ04). FINDINGS: We identified 129 sequence variations among the 14 isolates, with 16 recurrent variant sequences. Common variant sequences at four loci define two distinct genotypes of the SARS virus. One genotype was linked with infections originating in Hotel M in Hong Kong, the second contained isolates from Hong Kong, Guangzhou, and Beijing with no association with Hotel M (p<0.0001). Moreover, other common sequence variants further distinguished the geographical origins of the isolates, especially between Singapore and Beijing. INTERPRETATION: Despite the recent onset of the SARS epidemic, genetic signatures are emerging that partition the worldwide SARS viral isolates into groups on the basis of contact source history and geography. These signatures can be used to trace sources of infection. In addition, a common variant associated with a non-conservative aminoacid change in the S1 region of the spike protein, suggests that immunological pressures might be starting to influence the evolution of the SARS virus in human populations.


Assuntos
Genoma Viral , Mutação , Coronavírus Relacionado à Síndrome Respiratória Aguda Grave/genética , Sequência de Aminoácidos , Sequência de Bases , Humanos , Dados de Sequência Molecular , Fases de Leitura Aberta , Filogenia , Polimorfismo Genético , Coronavírus Relacionado à Síndrome Respiratória Aguda Grave/classificação , Coronavírus Relacionado à Síndrome Respiratória Aguda Grave/isolamento & purificação , Singapura
17.
Science ; 297(5585): 1301-10, 2002 Aug 23.
Artigo em Inglês | MEDLINE | ID: mdl-12142439

RESUMO

The compact genome of Fugu rubripes has been sequenced to over 95% coverage, and more than 80% of the assembly is in multigene-sized scaffolds. In this 365-megabase vertebrate genome, repetitive DNA accounts for less than one-sixth of the sequence, and gene loci occupy about one-third of the genome. As with the human genome, gene loci are not evenly distributed, but are clustered into sparse and dense regions. Some "giant" genes were observed that had average coding sequence sizes but were spread over genomic lengths significantly larger than those of their human orthologs. Although three-quarters of predicted human proteins have a strong match to Fugu, approximately a quarter of the human proteins had highly diverged from or had no pufferfish homologs, highlighting the extent of protein evolution in the 450 million years since teleosts and mammals diverged. Conserved linkages between Fugu and human genes indicate the preservation of chromosomal segments from the common vertebrate ancestor, but with considerable scrambling of gene order.


Assuntos
Genoma Humano , Genoma , Análise de Sequência de DNA , Takifugu/genética , Animais , Evolução Biológica , Biologia Computacional , Sequência Conservada , Elementos de DNA Transponíveis , Evolução Molecular , Éxons , Proteínas de Peixes/química , Proteínas de Peixes/genética , Duplicação Gênica , Ordem dos Genes , Genômica , Humanos , Íntrons , Mapeamento Físico do Cromossomo , Proteínas/química , Proteínas/genética , Proteoma , Sequências Repetitivas de Ácido Nucleico , Sintenia
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...