Search | VHL Regional Portal

Computational Metabolomics Operations at BioCyc.org.

Karp, Peter D; Billington, Richard; Holland, Timothy A; Kothari, Anamika; Krummenacker, Markus; Weaver, Daniel; Latendresse, Mario; Paley, Suzanne.

Metabolites ; 5(2): 291-310, 2015 May 22.

Article in English | MEDLINE | ID: mdl-26011592

ABSTRACT

BioCyc.org is a genome and metabolic pathway web portal covering 5500 organisms, including Homo sapiens, Arabidopsis thaliana, Saccharomyces cerevisiae and Escherichia coli. These organism-specific databases have undergone variable degrees of curation. The EcoCyc (Escherichia coli Encyclopedia) database is the most highly curated; its contents have been derived from 27,000 publications. The MetaCyc (Metabolic Encyclopedia) database within BioCyc is a "universal" metabolic database that describes pathways, reactions, enzymes and metabolites from all domains of life. Metabolic pathways provide an organizing framework for analyzing metabolomics data, and the BioCyc website provides computational operations for metabolomics data that include metabolite search and translation of metabolite identifiers across multiple metabolite databases. The site allows researchers to store and manipulate metabolite lists using a facility called SmartTables, which supports metabolite enrichment analysis. That analysis operation identifies metabolite sets that are statistically over-represented for the substrates of specific metabolic pathways. BioCyc also enables visualization of metabolomics data on individual pathway diagrams and on the organism-specific metabolic map diagrams that are available for every BioCyc organism. Most of these operations are available both interactively and as programmatic web services.

Response to the case of short-term international development work : comment on "global health case: questioning our contributions" by Kelly Anderson.

Holland, Alyson V F; Holland, Timothy A.

J Bioeth Inq ; 12(1): 155-6, 2015 Mar.

Article in English | MEDLINE | ID: mdl-25630593

Subject(s)

Global Health , International Cooperation , Humans

The MetaCyc database of metabolic pathways and enzymes and the BioCyc collection of Pathway/Genome Databases.

Caspi, Ron; Altman, Tomer; Billington, Richard; Dreher, Kate; Foerster, Hartmut; Fulcher, Carol A; Holland, Timothy A; Keseler, Ingrid M; Kothari, Anamika; Kubo, Aya; Krummenacker, Markus; Latendresse, Mario; Mueller, Lukas A; Ong, Quang; Paley, Suzanne; Subhraveti, Pallavi; Weaver, Daniel S; Weerasinghe, Deepika; Zhang, Peifen; Karp, Peter D.

Nucleic Acids Res ; 42(Database issue): D459-71, 2014 Jan.

Article in English | MEDLINE | ID: mdl-24225315

ABSTRACT

The MetaCyc database (MetaCyc.org) is a comprehensive and freely accessible database describing metabolic pathways and enzymes from all domains of life. MetaCyc pathways are experimentally determined, mostly small-molecule metabolic pathways and are curated from the primary scientific literature. MetaCyc contains >2100 pathways derived from >37,000 publications, and is the largest curated collection of metabolic pathways currently available. BioCyc (BioCyc.org) is a collection of >3000 organism-specific Pathway/Genome Databases (PGDBs), each containing the full genome and predicted metabolic network of one organism, including metabolites, enzymes, reactions, metabolic pathways, predicted operons, transport systems and pathway-hole fillers. Additions to BioCyc over the past 2 years include YeastCyc, a PGDB for Saccharomyces cerevisiae, and 891 new genomes from the Human Microbiome Project. The BioCyc Web site offers a variety of tools for querying and analysis of PGDBs, including Omics Viewers and tools for comparative analysis. New developments include atom mappings in reactions, a new representation of glycan degradation pathways, improved compound structure display, better coverage of enzyme kinetic data, enhancements of the Web Groups functionality, improvements to the Omics viewers, a new representation of the Enzyme Commission system and, for the desktop version of the software, the ability to save display states.

Subject(s)

Databases, Chemical , Enzymes/metabolism , Metabolic Networks and Pathways , Enzymes/chemistry , Enzymes/classification , Gene Ontology , Genome , Internet , Kinetics , Metabolic Networks and Pathways/genetics , Polysaccharides/metabolism , Software

PortEco: a resource for exploring bacterial biology through high-throughput data and analysis tools.

Hu, James C; Sherlock, Gavin; Siegele, Deborah A; Aleksander, Suzanne A; Ball, Catherine A; Demeter, Janos; Gouni, Sushanth; Holland, Timothy A; Karp, Peter D; Lewis, John E; Liles, Nathan M; McIntosh, Brenley K; Mi, Huaiyu; Muruganujan, Anushya; Wymore, Farrell; Thomas, Paul D; Altman, Tomer.

Nucleic Acids Res ; 42(Database issue): D677-84, 2014 Jan.

Article in English | MEDLINE | ID: mdl-24285306

ABSTRACT

PortEco (http://porteco.org) aims to collect, curate and provide data and analysis tools to support basic biological research in Escherichia coli (and eventually other bacterial systems). PortEco is implemented as a 'virtual' model organism database that provides a single unified interface to the user, while integrating information from a variety of sources. The main focus of PortEco is to enable broad use of the growing number of high-throughput experiments available for E. coli, and to leverage community annotation through the EcoliWiki and GONUTS systems. Currently, PortEco includes curated data from hundreds of genome-wide RNA expression studies, from high-throughput phenotyping of single-gene knockouts under hundreds of annotated conditions, from chromatin immunoprecipitation experiments for tens of different DNA-binding factors and from ribosome profiling experiments that yield insights into protein expression. Conditions have been annotated with a consistent vocabulary, and data have been consistently normalized to enable users to find, compare and interpret relevant experiments. PortEco includes tools for data analysis, including clustering, enrichment analysis and exploration via genome browsers. PortEco search and data analysis tools are extensively linked to the curated gene, metabolic pathway and regulation content at its sister site, EcoCyc.

Subject(s)

Databases, Genetic , Escherichia coli/genetics , Alleles , DNA-Binding Proteins/metabolism , Escherichia coli/metabolism , Escherichia coli Proteins/metabolism , Genes, Bacterial , Genome, Bacterial , High-Throughput Nucleotide Sequencing , Internet , Phenotype , RNA, Messenger/metabolism , Ribosomes/metabolism , Software

Groups: knowledge spreadsheets for symbolic biocomputing.

Travers, Michael; Paley, Suzanne M; Shrager, Jeff; Holland, Timothy A; Karp, Peter D.

Database (Oxford) ; 2013: bat061, 2013.

Article in English | MEDLINE | ID: mdl-24037025

ABSTRACT

Knowledge spreadsheets (KSs) are a visual tool for interactive data analysis and exploration. They differ from traditional spreadsheets in that rather than being oriented toward numeric data, they work with symbolic knowledge representation structures and provide operations that take into account the semantics of the application domain. 'Groups' is an implementation of KSs within the Pathway Tools system. Groups allows Pathway Tools users to define a group of objects (e.g. groups of genes or metabolites) from a Pathway/Genome Database. Groups can be transformed (e.g. by transforming a metabolite group to the group of pathways in which those metabolites are substrates); combined through set operations; analysed (e.g. through enrichment analysis); and visualized (e.g. by painting onto a metabolic map diagram). Users of the Pathway Tools-based BioCyc.org website have made extensive use of Groups, and an informal survey of Groups users suggests that Groups has achieved the goal of allowing biologists themselves to perform some data manipulations that previously would have required the assistance of a programmer. Database URL: BioCyc.org.

Subject(s)

Computational Biology/methods , Databases as Topic , Knowledge Bases , Software , Databases, Genetic , Escherichia coli/genetics , Genes, Bacterial/genetics , Humans , Knowledge , Metabolic Networks and Pathways , Metabolome , Transcription Factors/genetics , User-Computer Interface

Partitioning protein structures into domains: why is it so difficult?

Holland, Timothy A; Veretnik, Stella; Shindyalov, Ilya N; Bourne, Philip E.

J Mol Biol ; 361(3): 562-90, 2006 Aug 18.

Article in English | MEDLINE | ID: mdl-16863650

ABSTRACT

This analysis takes an in-depth look into the difficulties encountered by automatic methods for domain decomposition from three-dimensional structure. The analysis involves a multi-faceted set of criteria including the integrity of secondary structure elements, the tendency toward fragmentation of domains, domain boundary consistency and topology. The strength of the analysis comes from the use of a new comprehensive benchmark dataset, which is based on consensus among experts (CATH, SCOP and AUTHORS of the 3D structures) and covers 30 distinct architectures and 211 distinct topologies as defined by CATH. Furthermore, over 66% of the structures are multi-domain proteins; each domain combination occurring once per dataset. The performance of four automatic domain assignment methods, DomainParser, NCBI, PDP and PUU, is carefully analyzed using this broad spectrum of topology combinations and knowledge of rules and assumptions built into each algorithm. We conclude that it is practically impossible for an automatic method to achieve the level of performance of human experts. However, we propose specific improvements to automatic methods as well as broadening the concept of a structural domain. Such work is prerequisite for establishing improved approaches to domain recognition. (The benchmark dataset is available from http://pdomains.sdsc.edu).

Subject(s)

Computer Simulation , Models, Molecular , Protein Structure, Secondary , Protein Structure, Tertiary , Computational Biology

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL