Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 23
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
Genetics ; 227(1)2024 05 07.
Artigo em Inglês | MEDLINE | ID: mdl-38457127

RESUMO

Since 1999, The Arabidopsis Information Resource (www.arabidopsis.org) has been curating data about the Arabidopsis thaliana genome. Its primary focus is integrating experimental gene function information from the peer-reviewed literature and codifying it as controlled vocabulary annotations. Our goal is to produce a "gold standard" functional annotation set that reflects the current state of knowledge about the Arabidopsis genome. At the same time, the resource serves as a nexus for community-based collaborations aimed at improving data quality, access, and reuse. For the past decade, our work has been made possible by subscriptions from our global user base. This update covers our ongoing biocuration work, some of our modernization efforts that contribute to the first major infrastructure overhaul since 2011, the introduction of JBrowse2, and the resource's role in community activities such as organizing the structural reannotation of the genome. For gene function assessment, we used gene ontology annotations as a metric to evaluate: (1) what is currently known about Arabidopsis gene function and (2) the set of "unknown" genes. Currently, 74% of the proteome has been annotated to at least one gene ontology term. Of those loci, half have experimental support for at least one of the following aspects: molecular function, biological process, or cellular component. Our work sheds light on the genes for which we have not yet identified any published experimental data and have no functional annotation. Drawing attention to these unknown genes highlights knowledge gaps and potential sources of novel discoveries.


Assuntos
Arabidopsis , Bases de Dados Genéticas , Anotação de Sequência Molecular , Arabidopsis/genética , Genoma de Planta , Ontologia Genética , Proteínas de Arabidopsis/genética , Proteínas de Arabidopsis/metabolismo
3.
Curr Protoc ; 2(10): e574, 2022 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-36200836

RESUMO

The Arabidopsis Information Resource (TAIR; http://arabidopsis.org) is a comprehensive web resource of Arabidopsis biology for plant scientists. TAIR curates and integrates information about genes, proteins, gene function, orthologs, gene expression, mutant phenotypes, biological materials such as clones and seed stocks, genetic markers, genetic and physical maps, genome organization, images of mutant plants, protein sub-cellular localizations, publications, and the research community. The various data types are extensively interconnected and can be accessed through a variety of web-based search and display tools. This article primarily focuses on some basic methods for searching, browsing, visualizing, and analyzing information about Arabidopsis genes and genomes. Additionally, we describe how members of the community can share data via JBrowse and the Generic Online Annotation Submission Tool (GOAT) in order to make their published research more accessible and visible. © 2022 Wiley Periodicals LLC. Basic Protocol 1: TAIR homepage, sitemap, and navigation Basic Protocol 2: Finding comprehensive information about Arabidopsis genes Basic Protocol 3: Using the Arabidopsis genome browser: JBrowse Basic Protocol 4: Using the Gene Ontology annotations for gene discovery and gene function analysis Basic Protocol 5: Using gene lists to download bulk datasets Basic Protocol 6: Using TAIR's analysis tools to find short sequences and motifs Basic Protocol 7: Using the TAIR generic online annotation tool (GOAT) to submit functional annotations for Arabidopsis (or any other species) genes Basic Protocol 8: Using PhyloGenes to visualize gene families and predict functions Basic Protocol 9: Using TAIR to browse Arabidopsis literature Basic Protocol 10: Using the synteny viewer to find and display syntenic regions from Arabidopsis and other plant species.


Assuntos
Proteínas de Arabidopsis , Arabidopsis , Animais , Arabidopsis/genética , Proteínas de Arabidopsis/genética , Bases de Dados Genéticas , Marcadores Genéticos , Genoma de Planta/genética , Cabras/genética
4.
Plant Direct ; 4(12): e00293, 2020 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-33392435

RESUMO

We aim to enable the accurate and efficient transfer of knowledge about gene function gained from Arabidopsis thaliana and other model organisms to other plant species. This knowledge transfer is frequently challenging in plants due to duplications of individual genes and whole genomes in plant lineages. Such duplications result in complex evolutionary relationships between related genes, which may have similar sequences but highly divergent functions. In such cases, functional inference requires more than a simple sequence similarity calculation. We have developed an online resource, PhyloGenes (phylogenes.org), that displays precomputed phylogenetic trees for plant gene families along with experimentally validated function information for individual genes within the families. A total of 40 plant genomes and 10 non-plant model organisms are represented in over 8,000 gene families. Evolutionary events such as speciation and duplication are clearly labeled on gene trees to distinguish orthologs from paralogs. Nearly 6,000 families have at least one member with an experimentally supported annotation to a Gene Ontology (GO) molecular function or biological process term. By displaying experimentally validated gene functions associated to individual genes within a tree, PhyloGenes enables functional inference for genes of uncharacterized function, based on their evolutionary relationships to experimentally studied genes, in a visually traceable manner. For the many families containing genes that have evolved to perform different functions, PhyloGenes facilitates the use of evolutionary history to determine the most likely function of genes that have not been experimentally characterized. Future work will enrich the resource by incorporating additional gene function datasets such as plant gene expression atlas data.

5.
Database (Oxford) ; 20192019 01 01.
Artigo em Inglês | MEDLINE | ID: mdl-30715275

RESUMO

High-throughput studies constitute an essential and valued source of information for researchers. However, high-throughput experimental workflows are often complex, with multiple data sets that may contain large numbers of false positives. The representation of high-throughput data in the Gene Ontology (GO) therefore presents a challenging annotation problem, when the overarching goal of GO curation is to provide the most precise view of a gene's role in biology. To address this, representatives from annotation teams within the GO Consortium reviewed high-throughput data annotation practices. We present an annotation framework for high-throughput studies that will facilitate good standards in GO curation and, through the use of new high-throughput evidence codes, increase the visibility of these annotations to the research community.


Assuntos
Bases de Dados Genéticas , Ontologia Genética , Genômica/métodos , Anotação de Sequência Molecular/métodos , Animais , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Análise de Sequência de DNA
6.
Database (Oxford) ; 20182018 01 01.
Artigo em Inglês | MEDLINE | ID: mdl-30239679

RESUMO

The future of agricultural research depends on data. The sheer volume of agricultural biological data being produced today makes excellent data management essential. Governmental agencies, publishers and science funders require data management plans for publicly funded research. Furthermore, the value of data increases exponentially when they are properly stored, described, integrated and shared, so that they can be easily utilized in future analyses. AgBioData (https://www.agbiodata.org) is a consortium of people working at agricultural biological databases, data archives and knowledgbases who strive to identify common issues in database development, curation and management, with the goal of creating database products that are more Findable, Accessible, Interoperable and Reusable. We strive to promote authentic, detailed, accurate and explicit communication between all parties involved in scientific data. As a step toward this goal, we present the current state of biocuration, ontologies, metadata and persistence, database platforms, programmatic (machine) access to data, communication and sustainability with regard to data curation. Each section describes challenges and opportunities for these topics, along with recommendations and best practices.


Assuntos
Agricultura , Bases de Dados Genéticas , Genômica , Cruzamento , Ontologia Genética , Metadados , Inquéritos e Questionários
8.
Curr Protoc Bioinformatics ; 60: 1.11.1-1.11.45, 2017 12 08.
Artigo em Inglês | MEDLINE | ID: mdl-29220077

RESUMO

The Arabidopsis Information Resource (TAIR; http://arabidopsis.org) is a comprehensive Web resource of Arabidopsis biology for plant scientists. TAIR curates and integrates information about genes, proteins, gene function, orthologs, gene expression, mutant phenotypes, biological materials such as clones and seed stocks, genetic markers, genetic and physical maps, genome organization, images of mutant plants, protein sub-cellular localizations, publications, and the research community. The various data types are extensively interconnected and can be accessed through a variety of Web-based search and display tools. This unit primarily focuses on some basic methods for searching, browsing, visualizing, and analyzing information about Arabidopsis genes and genome. Additionally, we describe how members of the community can share data using TAIR's Online Annotation Submission Tool (TOAST), in order to make their published research more accessible and visible. © 2017 by John Wiley & Sons, Inc.


Assuntos
Arabidopsis/genética , Bases de Dados Genéticas , Genes de Plantas , Proteínas de Arabidopsis/genética , Biologia Computacional , DNA de Plantas/genética , Mineração de Dados , Bases de Dados Genéticas/estatística & dados numéricos , Ontologia Genética , Genoma de Planta , Internet , Anotação de Sequência Molecular
9.
Artigo em Inglês | MEDLINE | ID: mdl-26989150

RESUMO

Databases and data repositories provide essential functions for the research community by integrating, curating, archiving and otherwise packaging data to facilitate discovery and reuse. Despite their importance, funding for maintenance of these resources is increasingly hard to obtain. Fueled by a desire to find long term, sustainable solutions to database funding, staff from the Arabidopsis Information Resource (TAIR), founded the nonprofit organization, Phoenix Bioinformatics, using TAIR as a test case for user-based funding. Subscription-based funding has been proposed as an alternative to grant funding but its application has been very limited within the nonprofit sector. Our testing of this model indicates that it is a viable option, at least for some databases, and that it is possible to strike a balance that maximizes access while still incentivizing subscriptions. One year after transitioning to subscription support, TAIR is self-sustaining and Phoenix is poised to expand and support additional resources that wish to incorporate user-based funding strategies. Database URL: www.arabidopsis.org.


Assuntos
Arabidopsis/genética , Curadoria de Dados/economia , Modelos Teóricos , Apoio à Pesquisa como Assunto/economia , Bases de Dados Genéticas , Software
10.
Genesis ; 53(8): 474-85, 2015 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-26201819

RESUMO

The Arabidopsis Information Resource (TAIR) is a continuously updated, online database of genetic and molecular biology data for the model plant Arabidopsis thaliana that provides a global research community with centralized access to data for over 30,000 Arabidopsis genes. TAIR's biocurators systematically extract, organize, and interconnect experimental data from the literature along with computational predictions, community submissions, and high throughput datasets to present a high quality and comprehensive picture of Arabidopsis gene function. TAIR provides tools for data visualization and analysis, and enables ordering of seed and DNA stocks, protein chips, and other experimental resources. TAIR actively engages with its users who contribute expertise and data that augments the work of the curatorial staff. TAIR's focus in an extensive and evolving ecosystem of online resources for plant biology is on the critically important role of extracting experimentally based research findings from the literature and making that information computationally accessible. In response to the loss of government grant funding, the TAIR team founded a nonprofit entity, Phoenix Bioinformatics, with the aim of developing sustainable funding models for biological databases, using TAIR as a test case. Phoenix has successfully transitioned TAIR to subscription-based funding while still keeping its data relatively open and accessible.


Assuntos
Arabidopsis/genética , Curadoria de Dados/métodos , Curadoria de Dados/normas , Bases de Dados Genéticas/normas , Genoma de Planta , Alelos , Proteínas de Arabidopsis/genética , Estudos de Associação Genética
11.
Methods Mol Biol ; 1062: 65-96, 2014.
Artigo em Inglês | MEDLINE | ID: mdl-24057361

RESUMO

The volume of Arabidopsis information has increased enormously in recent years as a result of the sequencing of the reference genome and other large-scale functional genomics projects. Much of the data is stored in public databases, where data are organized, analyzed, and made freely accessible to the research community. These databases are resources that researchers can utilize for making predictions and developing testable hypotheses. The methods in this chapter describe ways to access and utilize Arabidopsis data and genomic resources found in databases and stock centers.


Assuntos
Arabidopsis/genética , Bases de Dados Genéticas , Sequência de Aminoácidos , Arabidopsis/metabolismo , Proteínas de Arabidopsis/química , Proteínas de Arabidopsis/genética , Mineração de Dados , Ontologia Genética , Genes de Plantas , Redes e Vias Metabólicas , Anotação de Sequência Molecular , Conformação Proteica , Sementes/genética
12.
Curr Protoc Bioinformatics ; Chapter 1: 1.11.1-1.11.51, 2010 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-20521243

RESUMO

The Arabidopsis Information Resource (TAIR; http://arabidopsis.org) is a comprehensive Web resource of Arabidopsis biology for plant scientists. TAIR curates and integrates information about genes, proteins, gene function, gene expression, mutant phenotypes, biological materials such as clones and seed stocks, genetic markers, genetic and physical maps, biochemical pathways, genome organization, images of mutant plants, protein sub-cellular localizations, publications, and the research community. The various data types are extensively interconnected and can be accessed through a variety of Web-based search and display tools. This unit primarily focuses on some basic methods for searching, browsing, visualizing, and analyzing information about Arabidopsis genes and describes several new tools such as a new TAIR genome browser (GBrowse), and the TAIR synteny viewer (GBrowse_syn). We also describe how to use AraCyc for mining plant metabolic pathways.


Assuntos
Arabidopsis/genética , Biologia Computacional , Bases de Dados Genéticas , Proteínas de Arabidopsis/genética , Mineração de Dados , Genes de Plantas
13.
Nucleic Acids Res ; 36(Database issue): D449-54, 2008 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-18194960

RESUMO

The Plant Ontology Consortium (POC, http://www.plantontology.org) is a collaborative effort among model plant genome databases and plant researchers that aims to create, maintain and facilitate the use of a controlled vocabulary (ontology) for plants. The ontology allows users to ascribe attributes of plant structure (anatomy and morphology) and developmental stages to data types, such as genes and phenotypes, to provide a semantic framework to make meaningful cross-species and database comparisons. The POC builds upon groundbreaking work by the Gene Ontology Consortium (GOC) by adopting and extending the GOC's principles, existing software and database structure. Over the past year, POC has added hundreds of ontology terms to associate with thousands of genes and gene products from Arabidopsis, rice and maize, which are available through a newly updated web-based browser (http://www.plantontology.org/amigo/go.cgi) for viewing, searching and querying. The Consortium has also implemented new functionalities to facilitate the application of PO in genomic research and updated the website to keep the contents current. In this report, we present a brief description of resources available from the website, changes to the interfaces, data updates, community activities and future enhancement.


Assuntos
Bases de Dados Genéticas , Genoma de Planta , Desenvolvimento Vegetal , Plantas/anatomia & histologia , Vocabulário Controlado , Genes de Plantas , Internet , Plantas/genética , Interface Usuário-Computador
14.
Plant Physiol ; 143(2): 587-99, 2007 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-17142475

RESUMO

Formal description of plant phenotypes and standardized annotation of gene expression and protein localization data require uniform terminology that accurately describes plant anatomy and morphology. This facilitates cross species comparative studies and quantitative comparison of phenotypes and expression patterns. A major drawback is variable terminology that is used to describe plant anatomy and morphology in publications and genomic databases for different species. The same terms are sometimes applied to different plant structures in different taxonomic groups. Conversely, similar structures are named by their species-specific terms. To address this problem, we created the Plant Structure Ontology (PSO), the first generic ontological representation of anatomy and morphology of a flowering plant. The PSO is intended for a broad plant research community, including bench scientists, curators in genomic databases, and bioinformaticians. The initial releases of the PSO integrated existing ontologies for Arabidopsis (Arabidopsis thaliana), maize (Zea mays), and rice (Oryza sativa); more recent versions of the ontology encompass terms relevant to Fabaceae, Solanaceae, additional cereal crops, and poplar (Populus spp.). Databases such as The Arabidopsis Information Resource, Nottingham Arabidopsis Stock Centre, Gramene, MaizeGDB, and SOL Genomics Network are using the PSO to describe expression patterns of genes and phenotypes of mutants and natural variants and are regularly contributing new annotations to the Plant Ontology database. The PSO is also used in specialized public databases, such as BRENDA, GENEVESTIGATOR, NASCArrays, and others. Over 10,000 gene annotations and phenotype descriptions from participating databases can be queried and retrieved using the Plant Ontology browser. The PSO, as well as contributed gene associations, can be obtained at www.plantontology.org.


Assuntos
Magnoliopsida/anatomia & histologia , Estruturas Vegetais/anatomia & histologia , Estruturas Vegetais/classificação , Terminologia como Assunto , Regulação da Expressão Gênica de Plantas , Proteínas de Plantas/genética , Proteínas de Plantas/metabolismo , Interface Usuário-Computador
15.
Plant Physiol ; 142(2): 414-28, 2006 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-16905665

RESUMO

Plant growth stages are identified as distinct morphological landmarks in a continuous developmental process. The terms describing these developmental stages record the morphological appearance of the plant at a specific point in its life cycle. The widely differing morphology of plant species consequently gave rise to heterogeneous vocabularies describing growth and development. Each species or family specific community developed distinct terminologies for describing whole-plant growth stages. This semantic heterogeneity made it impossible to use growth stage description contained within plant biology databases to make meaningful computational comparisons. The Plant Ontology Consortium (http://www.plantontology.org) was founded to develop standard ontologies describing plant anatomical as well as growth and developmental stages that can be used for annotation of gene expression patterns and phenotypes of all flowering plants. In this article, we describe the development of a generic whole-plant growth stage ontology that describes the spatiotemporal stages of plant growth as a set of landmark events that progress from germination to senescence. This ontology represents a synthesis and integration of terms and concepts from a variety of species-specific vocabularies previously used for describing phenotypes and genomic information. It provides a common platform for annotating gene function and gene expression in relation to the developmental trajectory of a plant described at the organismal level. As proof of concept the Plant Ontology Consortium used the plant ontology growth stage ontology to annotate genes and phenotypes in plants with initial emphasis on those represented in The Arabidopsis Information Resource, Gramene database, and MaizeGDB.


Assuntos
Arabidopsis/crescimento & desenvolvimento , Botânica/métodos , Oryza/crescimento & desenvolvimento , Terminologia como Assunto , Zea mays/crescimento & desenvolvimento , Germinação , Folhas de Planta , Brotos de Planta , Reprodução , Software
16.
Methods Mol Biol ; 323: 187-211, 2006.
Artigo em Inglês | MEDLINE | ID: mdl-16739578

RESUMO

The volume of Arabidopsis information has increased enormously in recent years as a result of the sequencing of the genome and other large-scale genomic projects. Much of the data are stored in public databases, where data are organized, analyzed, and made freely accessible to the research community. These databases are resources that researchers can utilize for making predictions and developing testable hypotheses. The methods in this chapter describe ways to access and utilize Arabidopsis data and genomic resources found in databases.


Assuntos
Arabidopsis/genética , Bases de Dados Genéticas , Regulação da Expressão Gênica de Plantas , Acesso à Informação , Proteínas de Arabidopsis/genética , Cromossomos de Plantas/genética , Biologia Computacional/métodos , Etiquetas de Sequências Expressas , Genes de Plantas , Técnicas Genéticas , Genoma de Planta , Análise de Sequência com Séries de Oligonucleotídeos , Análise de Sequência de DNA , Software
17.
Curr Protoc Bioinformatics ; Chapter 1: Unit 1.11, 2005 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-18428741

RESUMO

The Arabidopsis Information Resource (TAIR; http://www.arabidopsis.org) is a comprehensive Web resource of Arabidopsis biology for plant scientists. TAIR curates and integrates information about genes, proteins, gene expression, mutant phenotypes, biological materials such as DNA and seed stocks, genetic markers, genetic and physical maps, biochemical pathways, genome organization, images of mutant plants and protein sub-cellular localizations, publications, and the research community Data in TAIR are extensively interconnected and can be accessed through a variety of Web-based search and display tools. This unit primarily focuses on some basic methods for searching, browsing, visualizing, and analyzing information about Arabidopsis genes. Gene expression data from microarrays is a recent addition to the database and methods for accessing these data are also described. Two pattern identification programs are described for mining TAIR's unique Arabidopsis sequence data sets. We also describe how to use AraCyc for mining plant metabolic pathways.


Assuntos
Proteínas de Arabidopsis/química , Proteínas de Arabidopsis/fisiologia , Arabidopsis/fisiologia , Sistemas de Gerenciamento de Base de Dados , Bases de Dados Factuais , Armazenamento e Recuperação da Informação/métodos , Interface Usuário-Computador , Gráficos por Computador , Apresentação de Dados
18.
Comp Funct Genomics ; 6(7-8): 388-97, 2005.
Artigo em Inglês | MEDLINE | ID: mdl-18629207

RESUMO

The Plant Ontology Consortium (POC) (www.plantontology.org) is a collaborative effort among several plant databases and experts in plant systematics, botany and genomics. A primary goal of the POC is to develop simple yet robust and extensible controlled vocabularies that accurately reflect the biology of plant structures and developmental stages. These provide a network of vocabularies linked by relationships (ontology) to facilitate queries that cut across datasets within a database or between multiple databases. The current version of the ontology integrates diverse vocabularies used to describe Arabidopsis, maize and rice (Oryza sp.) anatomy, morphology and growth stages. Using the ontology browser, over 3500 gene annotations from three species-specific databases, The Arabidopsis Information Resource (TAIR) for Arabidopsis, Gramene for rice and MaizeGDB for maize, can now be queried and retrieved.

19.
Plant Physiol ; 135(2): 745-55, 2004 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-15173566

RESUMO

Controlled vocabularies are increasingly used by databases to describe genes and gene products because they facilitate identification of similar genes within an organism or among different organisms. One of The Arabidopsis Information Resource's goals is to associate all Arabidopsis genes with terms developed by the Gene Ontology Consortium that describe the molecular function, biological process, and subcellular location of a gene product. We have also developed terms describing Arabidopsis anatomy and developmental stages and use these to annotate published gene expression data. As of March 2004, we used computational and manual annotation methods to make 85,666 annotations representing 26,624 unique loci. We focus on associating genes to controlled vocabulary terms based on experimental data from the literature and use The Arabidopsis Information Resource-developed PubSearch software to facilitate this process. Each annotation is tagged with a combination of evidence codes, evidence descriptions, and references that provide a robust means to assess data quality. Annotation of all Arabidopsis genes will allow quantitative comparisons between sets of genes derived from sources such as microarray experiments. The Arabidopsis annotation data will also facilitate annotation of newly sequenced plant genomes by using sequence similarity to transfer annotations to homologous genes. In addition, complete and up-to-date annotations will make unknown genes easy to identify and target for experimentation. Here, we describe the process of Arabidopsis functional annotation using a variety of data sources and illustrate several ways in which this information can be accessed and used to infer knowledge about Arabidopsis and other plant species.


Assuntos
Arabidopsis/genética , Genoma de Planta , Vocabulário Controlado , Arabidopsis/crescimento & desenvolvimento , Bases de Dados Factuais , Regulação da Expressão Gênica no Desenvolvimento , Regulação da Expressão Gênica de Plantas
20.
Genetics ; 166(4): 1601-9, 2004 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-15126383

RESUMO

The National Science Foundation's recent mandate that all Principal Investigators address the broader impacts of their research has prompted an unprecedented number of scientists to seek opportunities to participate in precollege education and outreach. To help interested geneticists avoid duplicating efforts and make use of existing resources, we examined several precollege genetics, genomics, and biotechnology education efforts and noted the elements that contributed to their success, indicated by program expansion, participant satisfaction, or participant learning. Identifying a specific audience and their needs and resources, involving K-12 teachers in program development, and evaluating program efforts are integral to program success. We highlighted a few innovative programs to illustrate these findings. Challenges that may compromise further development and dissemination of these programs include absence of reward systems for participation in outreach as well as lack of training for scientists doing outreach. Several programs and institutions are tackling these issues in ways that will help sustain outreach efforts while allowing them to be modified to meet the changing needs of their participants, including scientists, teachers, and students. Most importantly, resources and personnel are available to facilitate greater and deeper involvement of scientists in precollege and public education.


Assuntos
Relações Comunidade-Instituição/tendências , Genética/educação , Pesquisadores/educação , Instituições Acadêmicas , Biotecnologia/educação , Genômica/educação , Estados Unidos
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...