Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 33
Filter
Add more filters










Publication year range
1.
J Chem Inf Model ; 63(17): 5484-5495, 2023 09 11.
Article in English | MEDLINE | ID: mdl-37635298

ABSTRACT

Computer-assisted synthetic planning has seen major advancements that stem from the availability of large reaction databases and artificial intelligence methodologies. SynRoute is a new retrosynthetic planning software tool that uses a relatively small number of general reaction templates, currently 263, along with a literature-based reaction database to find short, practical synthetic routes for target compounds. For each reaction template, a machine learning classifier is trained using data from the Pistachio reaction database to predict whether new computer-generated reactions based on the template are likely to work experimentally in the laboratory. This reaction generation methodology is used together with a vectorized Dijkstra-like search of top-scoring routes organized by synthetic strategies for easy browsing by a synthetic chemist. SynRoute was able to find routes for an average of 83% of compounds based on selection of random subsets of drug-like compounds from the ChEMBL database. Laboratory evaluation of 12 routes produced by SynRoute, to synthesize compounds not from the previous random subsets, demonstrated the ability to produce feasible overall synthetic strategies for all compounds evaluated.


Subject(s)
Artificial Intelligence , Software , Databases, Factual , Machine Learning
2.
Methods Mol Biol ; 2349: 259-289, 2022.
Article in English | MEDLINE | ID: mdl-34718999

ABSTRACT

The MetaFlux software supports creating, executing, and solving quantitative metabolic flux models using flux balance analysis (FBA). MetaFlux offers four modes of operation: (1) solving mode executes an FBA model for an individual organism or for an organism community, (2) gene knockout mode executes an FBA model with one or many gene knockouts, (3) development mode assists the user in creating and improving FBA models, and (4) flux variability analysis mode generates a report of the robustness of an FBA model. MetaFlux also solves dynamic FBA (dFBA) for both individual organisms and communities of organisms. MetaFlux can be used in two different environments: on your local computer, which requires the installation of the Pathway Tools software, or through the web, which does not require installation of Pathway Tools. On your local computer, MetaFlux offers all four modes of operation, whereas the web environment provides only the solving mode.Several visualization tools are available to analyze model solutions. The Cellular Overview tool graphically shows the reaction fluxes on an organism's metabolic map once a model is solved. The Omics Dashboard provides a hierarchical approach to visualizing reaction fluxes, organized by metabolic subsystems. For a community of organisms, plotting of accumulated biomasses and metabolites can be performed using the Gnuplot tool.In this chapter, we present eight methods using MetaFlux. Five solving mode methods illustrate execution of models for individual organisms and for organism communities. One method illustrates the gene knockout mode. Two methods for the development mode illustrate steps for developing new metabolic models.


Subject(s)
Metabolic Networks and Pathways , Models, Biological , Software , Algorithms , Biomass , Gene Knockout Techniques , Metabolic Flux Analysis
3.
Brief Bioinform ; 22(1): 109-126, 2021 01 18.
Article in English | MEDLINE | ID: mdl-31813964

ABSTRACT

MOTIVATION: Biological systems function through dynamic interactions among genes and their products, regulatory circuits and metabolic networks. Our development of the Pathway Tools software was motivated by the need to construct biological knowledge resources that combine these many types of data, and that enable users to find and comprehend data of interest as quickly as possible through query and visualization tools. Further, we sought to support the development of metabolic flux models from pathway databases, and to use pathway information to leverage the interpretation of high-throughput data sets. RESULTS: In the past 4 years we have enhanced the already extensive Pathway Tools software in several respects. It can now support metabolic-model execution through the Web, it provides a more accurate gap filler for metabolic models; it supports development of models for organism communities distributed across a spatial grid; and model results may be visualized graphically. Pathway Tools supports several new omics-data analysis tools including the Omics Dashboard, multi-pathway diagrams called pathway collages, a pathway-covering algorithm for metabolomics data analysis and an algorithm for generating mechanistic explanations of multi-omics data. We have also improved the core pathway/genome databases management capabilities of the software, providing new multi-organism search tools for organism communities, improved graphics rendering, faster performance and re-designed gene and metabolite pages. AVAILABILITY: The software is free for academic use; a fee is required for commercial use. See http://pathwaytools.com. CONTACT: pkarp@ai.sri.com. SUPPLEMENTARY INFORMATION: Supplementary data are available at Briefings in Bioinformatics online.


Subject(s)
Genomics/methods , Metabolomics/methods , Software/standards , Systems Biology/methods , Animals , Humans
4.
Microbiome ; 7(1): 89, 2019 06 07.
Article in English | MEDLINE | ID: mdl-31174602

ABSTRACT

BACKGROUND: Microbiomes are complex aggregates of organisms, each of which has its own extensive metabolic network. A variety of metabolites are exchanged between the microbes. The challenge we address is understanding the overall metabolic capabilities of a microbiome: through what series of metabolic transformations can a microbiome convert a starting compound to an ending compound? RESULTS: We developed an efficient software tool to search for metabolic routes that include metabolic reactions from multiple organisms. The metabolic network for each organism is obtained from BioCyc, where the network was inferred from the annotated genome. The tool searches for optimal metabolic routes that minimize the number of reactions in each route, maximize the number of atoms conserved between the starting and ending compounds, and minimize the number of organism switches. The tool pre-computes the reaction sets found in each organism from BioCyc to facilitate fast computation of the reactions defined in a researcher-specified organism set. The generated routes are depicted graphically, and for each reaction in a route, the tool lists the organisms that can catalyze that reaction. We present solutions for three route-finding problems in the human gut microbiome: (1) production of indoxyl sulfate, (2) production of trimethylamine N-oxide (TMAO), and (3) synthesis and degradation of autoinducers. The optimal routes computed by our multi-organism route-search (MORS) tool for indoxyl sulfate and TMAO were the same as routes reported in the literature. CONCLUSIONS: Our tool quickly found plausible routes for the discussed multi-organism route-finding problems. The routes shed light on how diverse organisms cooperate to perform multi-step metabolic transformations. Our tool enables scientists to consider multiple alternative routes and identifies the organisms responsible for each reaction.


Subject(s)
Computational Biology/methods , Gastrointestinal Microbiome , Metabolic Networks and Pathways , Software , Databases, Genetic , Humans , Indican/biosynthesis , Metagenome , Methylamines/metabolism
5.
Metabolites ; 9(5)2019 May 02.
Article in English | MEDLINE | ID: mdl-31052521

ABSTRACT

Interpreting changes in metabolite abundance in response to experimental treatments or disease states remains a major challenge in metabolomics. Pathway Covering is a new algorithm that takes a list of metabolites (compounds) and determines a minimum-cost set of metabolic pathways in an organism that includes (covers) all the metabolites in the list. We used five functions for assigning costs to pathways, including assigning a constant for all pathways, which yields a solution with the smallest pathway count; two methods that penalize large pathways; one that prefers pathways based on the pathway's assigned function, and one that loosely corresponds to metabolic flux. The pathway covering set computed by the algorithm can be displayed as a multi-pathway diagram ("pathway collage") that highlights the covered metabolites. We investigated the pathway covering algorithm by using several datasets from the Metabolomics Workbench. The algorithm is best applied to a list of metabolites with significant statistics and fold-changes with a specified direction of change for each metabolite. The pathway covering algorithm is now available within the Pathway Tools software and BioCyc website.

6.
Front Microbiol ; 10: 208, 2019.
Article in English | MEDLINE | ID: mdl-30853946

ABSTRACT

Microbial genome web portals have a broad range of capabilities that address a number of information-finding and analysis needs for scientists. This article compares the capabilities of the major microbial genome web portals to aid researchers in determining which portal(s) are best suited to their needs. We assessed both the bioinformatics tools and the data content of BioCyc, KEGG, Ensembl Bacteria, KBase, IMG, and PATRIC. For each portal, our assessment compared and tallied the available capabilities. The strengths of BioCyc include its genomic and metabolic tools, multi-search capabilities, table-based analysis tools, regulatory network tools and data, omics data analysis tools, breadth of data content, and large amount of curated data. The strengths of KEGG include its genomic and metabolic tools. The strengths of Ensembl Bacteria include its genomic tools and large number of genomes. The strengths of KBase include its genomic tools and metabolic models. The strengths of IMG include its genomic tools, multi-search capabilities, large number of genomes, table-based analysis tools, and breadth of data content. The strengths of PATRIC include its large number of genomes, table-based analysis tools, metabolic models, and breadth of data content.

7.
Brief Bioinform ; 20(4): 1085-1093, 2019 07 19.
Article in English | MEDLINE | ID: mdl-29447345

ABSTRACT

BioCyc.org is a microbial genome Web portal that combines thousands of genomes with additional information inferred by computer programs, imported from other databases and curated from the biomedical literature by biologist curators. BioCyc also provides an extensive range of query tools, visualization services and analysis software. Recent advances in BioCyc include an expansion in the content of BioCyc in terms of both the number of genomes and the types of information available for each genome; an expansion in the amount of curated content within BioCyc; and new developments in the BioCyc software tools including redesigned gene/protein pages and metabolite pages; new search tools; a new sequence-alignment tool; a new tool for visualizing groups of related metabolic pathways; and a facility called SmartTables, which enables biologists to perform analyses that previously would have required a programmer's assistance.


Subject(s)
Genome, Microbial , Metabolic Networks and Pathways , Software , Computational Biology , Databases, Genetic , Escherichia coli/genetics , Escherichia coli/metabolism , Genomics , Internet , Models, Biological , Search Engine
8.
EcoSal Plus ; 8(1)2018 11.
Article in English | MEDLINE | ID: mdl-30406744

ABSTRACT

EcoCyc is a bioinformatics database available at EcoCyc.org that describes the genome and the biochemical machinery of Escherichia coli K-12 MG1655. The long-term goal of the project is to describe the complete molecular catalog of the E. coli cell, as well as the functions of each of its molecular parts, to facilitate a system-level understanding of E. coli. EcoCyc is an electronic reference source for E. coli biologists and for biologists who work with related microorganisms. The database includes information pages on each E. coli gene product, metabolite, reaction, operon, and metabolic pathway. The database also includes information on E. coli gene essentiality and on nutrient conditions that do or do not support the growth of E. coli. The website and downloadable software contain tools for analysis of high-throughput data sets. In addition, a steady-state metabolic flux model is generated from each new version of EcoCyc and can be executed via EcoCyc.org. The model can predict metabolic flux rates, nutrient uptake rates, and growth rates for different gene knockouts and nutrient conditions. This review outlines the data content of EcoCyc and of the procedures by which this content is generated.


Subject(s)
Databases, Genetic , Escherichia coli K12/genetics , Genome, Bacterial , Software , Computational Biology , Escherichia coli Proteins/genetics , Escherichia coli Proteins/metabolism , Gene Expression Regulation, Bacterial , Internet , Metabolic Flux Analysis , Metabolic Networks and Pathways/genetics , User-Computer Interface
9.
BMC Syst Biol ; 12(1): 73, 2018 06 19.
Article in English | MEDLINE | ID: mdl-29914471

ABSTRACT

BACKGROUND: Reaction gap filling is a computational technique for proposing the addition of reactions to genome-scale metabolic models to permit those models to run correctly. Gap filling completes what are otherwise incomplete models that lack fully connected metabolic networks. The models are incomplete because they are derived from annotated genomes in which not all enzymes have been identified. Here we compare the results of applying an automated likelihood-based gap filler within the Pathway Tools software with the results of manually gap filling the same metabolic model. Both gap-filling exercises were applied to the same genome-derived qualitative metabolic reconstruction for Bifidobacterium longum subsp. longum JCM 1217, and to the same modeling conditions - anaerobic growth under four nutrients producing 53 biomass metabolites. RESULTS: The solution computed by the gap-filling program GenDev contained 12 reactions, but closer examination showed that solution was not minimal; two of the twelve reactions can be removed to yield a set of ten reactions that enable model growth. The manually curated solution contained 13 reactions, eight of which were shared with the 12-reaction computed solution. Thus, GenDev achieved recall of 61.5% and precision of 66.6%. These results suggest that although computational gap fillers are populating metabolic models with significant numbers of correct reactions, automatically gap-filled metabolic models also contain significant numbers of incorrect reactions. CONCLUSIONS: Our conclusion is that manual curation of gap-filler results is needed to obtain high-accuracy models. Many of the differences between the manual and automatic solutions resulted from using expert biological knowledge to direct the choice of reactions within the curated solution, such as reactions specific to the anaerobic lifestyle of B. longum.


Subject(s)
Metabolism , Models, Biological , Algorithms , Automation , Bifidobacterium longum/metabolism , Biomass
10.
BMC Bioinformatics ; 19(1): 53, 2018 02 14.
Article in English | MEDLINE | ID: mdl-29444634

ABSTRACT

BACKGROUND: Completion of genome-scale flux-balance models using computational reaction gap-filling is a widely used approach, but its accuracy is not well known. RESULTS: We report on computational experiments of reaction gap filling in which we generated degraded versions of the EcoCyc-20.0-GEM model by randomly removing flux-carrying reactions from a growing model. We gap-filled the degraded models and compared the resulting gap-filled models with the original model. Gap-filling was performed by the Pathway Tools MetaFlux software using its General Development Mode (GenDev) and its Fast Development Mode (FastDev). We explored 12 GenDev variants including two linear solvers (SCIP and CPLEX) for solving the Mixed Integer Linear Programming (MILP) problems for gap filling; three different sets of linear constraints were applied; and two MILP methods were implemented. We compared these 13 variants according to accuracy, speed, and amount of information returned to the user. CONCLUSIONS: We observed large variation among the performance of the 13 gap-filling variants. Although no variant was best in all dimensions, we found one variant that was fast, accurate, and returned more information to the user. Some gap-filling variants were inaccurate, producing solutions that were non-minimum or invalid (did not enable model growth). The best GenDev variant showed a best average precision of 87% and a best average recall of 61%. FastDev showed an average precision of 71% and an average recall of 59%. Thus, using the most accurate variant, approximately 13% of the gap-filled reactions were incorrect (were not the reactions removed from the model), and 39% of gap-filled reactions were not found, suggesting that curation is still an important aspect of metabolic-model development.


Subject(s)
Metabolic Flux Analysis/methods , Models, Biological , Algorithms , Genomics , Programming, Linear , Software
11.
Nucleic Acids Res ; 46(D1): D633-D639, 2018 01 04.
Article in English | MEDLINE | ID: mdl-29059334

ABSTRACT

MetaCyc (https://MetaCyc.org) is a comprehensive reference database of metabolic pathways and enzymes from all domains of life. It contains more than 2570 pathways derived from >54 000 publications, making it the largest curated collection of metabolic pathways. The data in MetaCyc is strictly evidence-based and richly curated, resulting in an encyclopedic reference tool for metabolism. MetaCyc is also used as a knowledge base for generating thousands of organism-specific Pathway/Genome Databases (PGDBs), which are available in the BioCyc (https://BioCyc.org) and other PGDB collections. This article provides an update on the developments in MetaCyc during the past two years, including the expansion of data and addition of new features.


Subject(s)
Databases, Factual , Enzymes/metabolism , Metabolic Networks and Pathways , Animals , Archaea/metabolism , Bacteria/metabolism , Data Curation , Databases, Chemical , Databases, Protein , Humans , Internet , Phylogeny , Plants/metabolism , Software , Species Specificity
12.
Nucleic Acids Res ; 45(D1): D543-D550, 2017 01 04.
Article in English | MEDLINE | ID: mdl-27899573

ABSTRACT

EcoCyc (EcoCyc.org) is a freely accessible, comprehensive database that collects and summarizes experimental data for Escherichia coli K-12, the best-studied bacterial model organism. New experimental discoveries about gene products, their function and regulation, new metabolic pathways, enzymes and cofactors are regularly added to EcoCyc. New SmartTable tools allow users to browse collections of related EcoCyc content. SmartTables can also serve as repositories for user- or curator-generated lists. EcoCyc now supports running and modifying E. coli metabolic models directly on the EcoCyc website.


Subject(s)
Computational Biology/methods , Databases, Genetic , Escherichia coli K12/genetics , Escherichia coli K12/metabolism , Energy Metabolism , Escherichia coli Proteins/genetics , Escherichia coli Proteins/metabolism , Gene Expression Regulation, Bacterial , Metabolic Networks and Pathways , Signal Transduction , Software , Transcription Factors/metabolism , Web Browser
13.
Nucleic Acids Res ; 44(D1): D471-80, 2016 Jan 04.
Article in English | MEDLINE | ID: mdl-26527732

ABSTRACT

The MetaCyc database (MetaCyc.org) is a freely accessible comprehensive database describing metabolic pathways and enzymes from all domains of life. The majority of MetaCyc pathways are small-molecule metabolic pathways that have been experimentally determined. MetaCyc contains more than 2400 pathways derived from >46,000 publications, and is the largest curated collection of metabolic pathways. BioCyc (BioCyc.org) is a collection of 5700 organism-specific Pathway/Genome Databases (PGDBs), each containing the full genome and predicted metabolic network of one organism, including metabolites, enzymes, reactions, metabolic pathways, predicted operons, transport systems, and pathway-hole fillers. The BioCyc website offers a variety of tools for querying and analyzing PGDBs, including Omics Viewers and tools for comparative analysis. This article provides an update of new developments in MetaCyc and BioCyc during the last two years, including addition of Gibbs free energy values for compounds and reactions; redesign of the primary gene/protein page; addition of a tool for creating diagrams containing multiple linked pathways; several new search capabilities, including searching for genes based on sequence patterns, searching for databases based on an organism's phenotypes, and a cross-organism search; and a metabolite identifier translation service.


Subject(s)
Databases, Chemical , Enzymes/metabolism , Metabolic Networks and Pathways , Databases, Genetic , Electron Transport , Genome , Internet , Metabolic Networks and Pathways/genetics , Software
14.
Brief Bioinform ; 17(5): 877-90, 2016 09.
Article in English | MEDLINE | ID: mdl-26454094

ABSTRACT

Pathway Tools is a bioinformatics software environment with a broad set of capabilities. The software provides genome-informatics tools such as a genome browser, sequence alignments, a genome-variant analyzer and comparative-genomics operations. It offers metabolic-informatics tools, such as metabolic reconstruction, quantitative metabolic modeling, prediction of reaction atom mappings and metabolic route search. Pathway Tools also provides regulatory-informatics tools, such as the ability to represent and visualize a wide range of regulatory interactions. This article outlines the advances in Pathway Tools in the past 5 years. Major additions include components for metabolic modeling, metabolic route search, computation of atom mappings and estimation of compound Gibbs free energies of formation; addition of editors for signaling pathways, for genome sequences and for cellular architecture; storage of gene essentiality data and phenotype data; display of multiple alignments, and of signaling and electron-transport pathways; and development of Python and web-services application programming interfaces. Scientists around the world have created more than 9800 Pathway/Genome Databases by using Pathway Tools, many of which are curated databases for important model organisms.


Subject(s)
Genome , Computational Biology , Genomics , Internet , Metabolic Networks and Pathways , Software Design , Systems Biology
15.
Metabolites ; 5(2): 291-310, 2015 May 22.
Article in English | MEDLINE | ID: mdl-26011592

ABSTRACT

BioCyc.org is a genome and metabolic pathway web portal covering 5500 organisms, including Homo sapiens, Arabidopsis thaliana, Saccharomyces cerevisiae and Escherichia coli. These organism-specific databases have undergone variable degrees of curation. The EcoCyc (Escherichia coli Encyclopedia) database is the most highly curated; its contents have been derived from 27,000 publications. The MetaCyc (Metabolic Encyclopedia) database within BioCyc is a "universal" metabolic database that describes pathways, reactions, enzymes and metabolites from all domains of life. Metabolic pathways provide an organizing framework for analyzing metabolomics data, and the BioCyc website provides computational operations for metabolomics data that include metabolite search and translation of metabolite identifiers across multiple metabolite databases. The site allows researchers to store and manipulate metabolite lists using a facility called SmartTables, which supports metabolite enrichment analysis. That analysis operation identifies metabolite sets that are statistically over-represented for the substrates of specific metabolic pathways. BioCyc also enables visualization of metabolomics data on individual pathway diagrams and on the organism-specific metabolic map diagrams that are available for every BioCyc organism. Most of these operations are available both interactively and as programmatic web services.

16.
BMC Bioinformatics ; 15: 225, 2014 Jun 28.
Article in English | MEDLINE | ID: mdl-24972703

ABSTRACT

BACKGROUND: Flux Balance Analysis (FBA) is a genome-scale computational technique for modeling the steady-state fluxes of an organism's reaction network. When the organism's reaction network needs to be completed to obtain growth using FBA, without relying on the genome, the completion process is called reaction gap-filling. Currently, computational techniques used to gap-fill a reaction network compute the minimum set of reactions using Mixed-Integer Linear Programming (MILP). Depending on the number of candidate reactions used to complete the model, MILP can be computationally demanding. RESULTS: We present a computational technique, called FastGapFilling, that efficiently completes a reaction network by using only Linear Programming, not MILP. FastGapFilling creates a linear program with all candidate reactions, an objective function based on their weighted fluxes, and a variable weight on the biomass reaction: no integer variable is used. A binary search is performed by modifying the weight applied to the flux of the biomass reaction, and solving each corresponding linear program, to try reducing the number of candidate reactions to add to the network to generate a working model. We show that this method has proved effective on a series of incomplete E. coli and yeast models with, in some cases, a three orders of magnitude execution speedup compared with MILP. We have implemented FastGapFilling in MetaFlux as part of Pathway Tools (version 17.5), which is freely available to academic users, and for a fee to commercial users. Download from: biocyc.org/download.shtml. CONCLUSIONS: The computational technique presented is very efficient allowing interactive completion of reaction networks of FBA models. Computational techniques based on MILP cannot offer such fast and interactive completion.


Subject(s)
Genomics/methods , Metabolic Flux Analysis/methods , Metabolic Networks and Pathways , Algorithms , Escherichia coli/cytology , Escherichia coli/metabolism , Models, Biological , Yeasts/cytology , Yeasts/metabolism
17.
Bioinformatics ; 30(14): 2043-50, 2014 Jul 15.
Article in English | MEDLINE | ID: mdl-24642060

ABSTRACT

MOTIVATION: A key computational problem in metabolic engineering is finding efficient metabolic routes from a source to a target compound in genome-scale reaction networks, potentially considering the addition of new reactions. Efficiency can be based on many factors, such as route lengths, atoms conserved and the number of new reactions, and the new enzymes to catalyze them, added to the route. Fast algorithms are needed to systematically search these large genome-scale reaction networks. RESULTS: We present the algorithm used in the new RouteSearch tool within the Pathway Tools software. This algorithm is based on a general Branch-and-Bound search and involves constructing a network of atom mappings to facilitate efficient searching. As far as we know, it is the first published algorithm that finds guaranteed optimal routes where atom conservation is part of the optimality criteria. RouteSearch includes a graphical user interface that speeds user understanding of its search results. We evaluated the algorithm on five example metabolic-engineering problems from the literature; for one problem the published solution was equivalent to the optimal route found by RouteSearch; for the remaining four problems, RouteSearch found the published solution as one of its best-scored solutions. These problems were each solved in less than 5 s of computational time. AVAILABILITY AND IMPLEMENTATION: RouteSearch is accessible at BioCyc.org by using the menu command Metabolism --> Metabolic RouteSearch and by downloading Pathway Tools. Pathway Tools software is freely available to academic users, and for a fee to commercial users. Download from: http://biocyc.org/download.shtml.


Subject(s)
Algorithms , Metabolic Engineering/methods , Metabolic Networks and Pathways , Genome , Metabolic Networks and Pathways/genetics , Software
18.
Nucleic Acids Res ; 42(Database issue): D459-71, 2014 Jan.
Article in English | MEDLINE | ID: mdl-24225315

ABSTRACT

The MetaCyc database (MetaCyc.org) is a comprehensive and freely accessible database describing metabolic pathways and enzymes from all domains of life. MetaCyc pathways are experimentally determined, mostly small-molecule metabolic pathways and are curated from the primary scientific literature. MetaCyc contains >2100 pathways derived from >37,000 publications, and is the largest curated collection of metabolic pathways currently available. BioCyc (BioCyc.org) is a collection of >3000 organism-specific Pathway/Genome Databases (PGDBs), each containing the full genome and predicted metabolic network of one organism, including metabolites, enzymes, reactions, metabolic pathways, predicted operons, transport systems and pathway-hole fillers. Additions to BioCyc over the past 2 years include YeastCyc, a PGDB for Saccharomyces cerevisiae, and 891 new genomes from the Human Microbiome Project. The BioCyc Web site offers a variety of tools for querying and analysis of PGDBs, including Omics Viewers and tools for comparative analysis. New developments include atom mappings in reactions, a new representation of glycan degradation pathways, improved compound structure display, better coverage of enzyme kinetic data, enhancements of the Web Groups functionality, improvements to the Omics viewers, a new representation of the Enzyme Commission system and, for the desktop version of the software, the ability to save display states.


Subject(s)
Databases, Chemical , Enzymes/metabolism , Metabolic Networks and Pathways , Enzymes/chemistry , Enzymes/classification , Gene Ontology , Genome , Internet , Kinetics , Metabolic Networks and Pathways/genetics , Polysaccharides/metabolism , Software
19.
Nucleic Acids Res ; 41(Database issue): D605-12, 2013 Jan.
Article in English | MEDLINE | ID: mdl-23143106

ABSTRACT

EcoCyc (http://EcoCyc.org) is a model organism database built on the genome sequence of Escherichia coli K-12 MG1655. Expert manual curation of the functions of individual E. coli gene products in EcoCyc has been based on information found in the experimental literature for E. coli K-12-derived strains. Updates to EcoCyc content continue to improve the comprehensive picture of E. coli biology. The utility of EcoCyc is enhanced by new tools available on the EcoCyc web site, and the development of EcoCyc as a teaching tool is increasing the impact of the knowledge collected in EcoCyc.


Subject(s)
Databases, Genetic , Escherichia coli K12/genetics , Binding Sites , Escherichia coli K12/metabolism , Escherichia coli Proteins/classification , Escherichia coli Proteins/metabolism , Gene Expression Regulation, Bacterial , Internet , Membrane Transport Proteins/classification , Membrane Transport Proteins/metabolism , Models, Genetic , Molecular Sequence Annotation , Phenotype , Position-Specific Scoring Matrices , Promoter Regions, Genetic , Systems Biology , Transcription Factors/metabolism , Transcription, Genetic
20.
J Chem Inf Model ; 52(11): 2970-82, 2012 Nov 26.
Article in English | MEDLINE | ID: mdl-22963657

ABSTRACT

The complete atom mapping of a chemical reaction is a bijection of the reactant atoms to the product atoms that specifies the terminus of each reactant atom. Atom mapping of biochemical reactions is useful for many applications of systems biology, in particular for metabolic engineering where synthesizing new biochemical pathways has to take into account for the number of carbon atoms from a source compound that are conserved in the synthesis of a target compound. Rapid, accurate computation of the atom mapping(s) of a biochemical reaction remains elusive despite significant work on this topic. In particular, past researchers did not validate the accuracy of mapping algorithms. We introduce a new method for computing atom mappings called the minimum weighted edit-distance (MWED) metric. The metric is based on bond propensity to react and computes biochemically valid atom mappings for a large percentage of biochemical reactions. MWED models can be formulated efficiently as Mixed-Integer Linear Programs (MILPs). We have demonstrated this approach on 7501 reactions of the MetaCyc database for which 87% of the models could be solved in less than 10 s. For 2.1% of the reactions, we found multiple optimal atom mappings. We show that the error rate is 0.9% (22 reactions) by comparing these atom mappings to 2446 atom mappings of the manually curated Kyoto Encyclopedia of Genes and Genomes (KEGG) RPAIR database. To our knowledge, our computational atom-mapping approach is the most accurate and among the fastest published to date. The atom-mapping data will be available in the MetaCyc database later in 2012; the atom-mapping software will be available within the Pathway Tools software later in 2012.


Subject(s)
Algorithms , Carbon/chemistry , Metabolic Networks and Pathways , Software , Carbon/metabolism , Databases, Chemical , Kinetics , Metabolic Engineering , Models, Chemical , Stereoisomerism , Systems Biology , Thermodynamics
SELECTION OF CITATIONS
SEARCH DETAIL
...