Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 43
Filter
Add more filters










Publication year range
1.
Bioinformatics ; 39(2)2023 02 03.
Article in English | MEDLINE | ID: mdl-36759942

ABSTRACT

MOTIVATION: Knowledge graphs (KGs) are being adopted in industry, commerce and academia. Biomedical KG presents a challenge due to the complexity, size and heterogeneity of the underlying information. RESULTS: In this work, we present the Scalable Precision Medicine Open Knowledge Engine (SPOKE), a biomedical KG connecting millions of concepts via semantically meaningful relationships. SPOKE contains 27 million nodes of 21 different types and 53 million edges of 55 types downloaded from 41 databases. The graph is built on the framework of 11 ontologies that maintain its structure, enable mappings and facilitate navigation. SPOKE is built weekly by python scripts which download each resource, check for integrity and completeness, and then create a 'parent table' of nodes and edges. Graph queries are translated by a REST API and users can submit searches directly via an API or a graphical user interface. Conclusions/Significance: SPOKE enables the integration of seemingly disparate information to support precision medicine efforts. AVAILABILITY AND IMPLEMENTATION: The SPOKE neighborhood explorer is available at https://spoke.rbvi.ucsf.edu. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
Pattern Recognition, Automated , Precision Medicine , Databases, Factual
2.
Protein Sci ; 30(1): 70-82, 2021 01.
Article in English | MEDLINE | ID: mdl-32881101

ABSTRACT

UCSF ChimeraX is the next-generation interactive visualization program from the Resource for Biocomputing, Visualization, and Informatics (RBVI), following UCSF Chimera. ChimeraX brings (a) significant performance and graphics enhancements; (b) new implementations of Chimera's most highly used tools, many with further improvements; (c) several entirely new analysis features; (d) support for new areas such as virtual reality, light-sheet microscopy, and medical imaging data; (e) major ease-of-use advances, including toolbars with icons to perform actions with a single click, basic "undo" capabilities, and more logical and consistent commands; and (f) an app store for researchers to contribute new tools. ChimeraX includes full user documentation and is free for noncommercial use, with downloads available for Windows, Linux, and macOS from https://www.rbvi.ucsf.edu/chimerax.


Subject(s)
Computer Graphics , Imaging, Three-Dimensional , Models, Molecular , Software
3.
Protein Sci ; 27(1): 14-25, 2018 01.
Article in English | MEDLINE | ID: mdl-28710774

ABSTRACT

UCSF ChimeraX is next-generation software for the visualization and analysis of molecular structures, density maps, 3D microscopy, and associated data. It addresses challenges in the size, scope, and disparate types of data attendant with cutting-edge experimental methods, while providing advanced options for high-quality rendering (interactive ambient occlusion, reliable molecular surface calculations, etc.) and professional approaches to software design and distribution. This article highlights some specific advances in the areas of visualization and usability, performance, and extensibility. ChimeraX is free for noncommercial use and is available from http://www.rbvi.ucsf.edu/chimerax/ for Windows, Mac, and Linux.


Subject(s)
Imaging, Three-Dimensional , Software , Molecular Structure
5.
Database (Oxford) ; 2017(1)2017 01 01.
Article in English | MEDLINE | ID: mdl-28365730

ABSTRACT

With ever-increasing amounts of sequence data available in both the primary literature and sequence repositories, there is a bottleneck in annotating molecular function to a sequence. This article describes the biocuration process and methods used in the structure-function linkage database (SFLD) to help address some of the challenges. We discuss how the hierarchy within the SFLD allows us to infer detailed functional properties for functionally diverse enzyme superfamilies in which all members are homologous, conserve an aspect of their chemical function and have associated conserved structural features that enable the chemistry. Also presented is the Enzyme Structure-Function Ontology (ESFO), which has been designed to capture the relationships between enzyme sequence, structure and function that underlie the SFLD and is used to guide the biocuration processes within the SFLD. Database URL: http://sfld.rbvi.ucsf.edu/.


Subject(s)
Databases, Protein , Enzymes/chemistry , Enzymes/genetics , Gene Ontology , Molecular Sequence Annotation , Structural Homology, Protein , Structure-Activity Relationship
6.
Bioinformatics ; 31(9): 1484-6, 2015 May 01.
Article in English | MEDLINE | ID: mdl-25540183

ABSTRACT

MOTIVATION: Contact maps are a convenient method for the structural biologists to identify structural features through two-dimensional simplification. Binary (yes/no) contact maps with a single cutoff distance can be generalized to show continuous distance ranges. We have developed a UCSF Chimera tool, RRDistMaps, to compute such generalized maps in order to analyze pairwise variations in intramolecular contacts. An interactive utility, RRDistMaps, visualizes conformational changes, both local (e.g. binding-site residues) and global (e.g. hinge motion), between unbound and bound proteins through distance patterns. Users can target residue pairs in RRDistMaps for further navigation in Chimera. The interface contains the unique features of identifying long-range residue motion and aligning sequences to simultaneously compare distance maps. AVAILABILITY AND IMPLEMENTATION: RRDistMaps was developed as part of UCSF Chimera release 1.10, which is freely available at http://rbvi.ucsf.edu/chimera/download.html, and operates on Linux, Windows, and Mac OS. CONTACT: conrad@cgl.ucsf.edu.


Subject(s)
Protein Conformation , Software , Binding Sites , Models, Molecular , Protein Binding , Proteins/chemistry
7.
Nucleic Acids Res ; 42(Web Server issue): W478-84, 2014 Jul.
Article in English | MEDLINE | ID: mdl-24861624

ABSTRACT

Integrating access to web services with desktop applications allows for an expanded set of application features, including performing computationally intensive tasks and convenient searches of databases. We describe how we have enhanced UCSF Chimera (http://www.rbvi.ucsf.edu/chimera/), a program for the interactive visualization and analysis of molecular structures and related data, through the addition of several web services (http://www.rbvi.ucsf.edu/chimera/docs/webservices.html). By streamlining access to web services, including the entire job submission, monitoring and retrieval process, Chimera makes it simpler for users to focus on their science projects rather than data manipulation. Chimera uses Opal, a toolkit for wrapping scientific applications as web services, to provide scalable and transparent access to several popular software packages. We illustrate Chimera's use of web services with an example workflow that interleaves use of these services with interactive manipulation of molecular sequences and structures, and we provide an example Python program to demonstrate how easily Opal-based web services can be accessed from within an application. Web server availability: http://webservices.rbvi.ucsf.edu/opal2/dashboard?command=serviceList.


Subject(s)
Molecular Structure , Software , Internet , Models, Molecular
8.
Nucleic Acids Res ; 42(Database issue): D521-30, 2014 Jan.
Article in English | MEDLINE | ID: mdl-24271399

ABSTRACT

The Structure-Function Linkage Database (SFLD, http://sfld.rbvi.ucsf.edu/) is a manually curated classification resource describing structure-function relationships for functionally diverse enzyme superfamilies. Members of such superfamilies are diverse in their overall reactions yet share a common ancestor and some conserved active site features associated with conserved functional attributes such as a partial reaction. Thus, despite their different functions, members of these superfamilies 'look alike', making them easy to misannotate. To address this complexity and enable rational transfer of functional features to unknowns only for those members for which we have sufficient functional information, we subdivide superfamily members into subgroups using sequence information, and lastly into families, sets of enzymes known to catalyze the same reaction using the same mechanistic strategy. Browsing and searching options in the SFLD provide access to all of these levels. The SFLD offers manually curated as well as automatically classified superfamily sets, both accompanied by search and download options for all hierarchical levels. Additional information includes multiple sequence alignments, tab-separated files of functional and other attributes, and sequence similarity networks. The latter provide a new and intuitively powerful way to visualize functional trends mapped to the context of sequence similarity.


Subject(s)
Databases, Protein , Enzymes/chemistry , Enzymes/classification , Enzymes/metabolism , Internet , Molecular Sequence Annotation , Sequence Alignment , Structure-Activity Relationship
9.
J Struct Biol ; 179(3): 269-78, 2012 Sep.
Article in English | MEDLINE | ID: mdl-21963794

ABSTRACT

Structural modeling of macromolecular complexes greatly benefits from interactive visualization capabilities. Here we present the integration of several modeling tools into UCSF Chimera. These include comparative modeling by MODELLER, simultaneous fitting of multiple components into electron microscopy density maps by IMP MultiFit, computing of small-angle X-ray scattering profiles and fitting of the corresponding experimental profile by IMP FoXS, and assessment of amino acid sidechain conformations based on rotamer probabilities and local interactions by Chimera.


Subject(s)
Computer Simulation , Models, Molecular , Software , Amino Acid Sequence , Animals , Cattle , Escherichia coli Proteins/chemistry , Heat-Shock Proteins/chemistry , Macromolecular Substances/chemistry , Molecular Sequence Data , Protein Conformation , Protein Subunits/chemistry , Scattering, Small Angle , Structural Homology, Protein , X-Ray Diffraction
10.
Nucleic Acids Res ; 39(Database issue): D465-74, 2011 Jan.
Article in English | MEDLINE | ID: mdl-21097780

ABSTRACT

ModBase (http://salilab.org/modbase) is a database of annotated comparative protein structure models. The models are calculated by ModPipe, an automated modeling pipeline that relies primarily on Modeller for fold assignment, sequence-structure alignment, model building and model assessment (http://salilab.org/modeller/). ModBase currently contains 10,355,444 reliable models for domains in 2,421,920 unique protein sequences. ModBase allows users to update comparative models on demand, and request modeling of additional sequences through an interface to the ModWeb modeling server (http://salilab.org/modweb). ModBase models are available through the ModBase interface as well as the Protein Model Portal (http://www.proteinmodelportal.org/). Recently developed associated resources include the SALIGN server for multiple sequence and structure alignment (http://salilab.org/salign), the ModEval server for predicting the accuracy of protein structure models (http://salilab.org/modeval), the PCSS server for predicting which peptides bind to a given protein (http://salilab.org/pcss) and the FoXS server for calculating and fitting Small Angle X-ray Scattering profiles (http://salilab.org/foxs).


Subject(s)
Databases, Protein , Models, Molecular , Protein Structure, Tertiary , Bacterial Proteins/chemistry , Computer Graphics , Peptides/chemistry , Protein Interaction Mapping , Proteins/chemistry , Scattering, Small Angle , Sequence Alignment , Software , Structural Homology, Protein , User-Computer Interface , X-Ray Diffraction
11.
J Pharmacol Exp Ther ; 325(3): 859-68, 2008 Jun.
Article in English | MEDLINE | ID: mdl-18364470

ABSTRACT

ABCC4 encodes multidrug resistance protein 4 (MRP4), a member of the ATP-binding cassette family of membrane transporters involved in the efflux of endogenous and xenobiotic molecules. The aims of this study were to identify single nucleotide polymorphisms of ABCC4 and to functionally characterize selected nonsynonymous variants. Resequencing was performed in a large ethnically diverse population. Ten nonsynonymous variants were selected for analysis of transport function based on allele frequencies and evolutionary conservation. The reference and variant MRP4 cDNAs were constructed by site-directed mutagenesis and transiently transfected into human embryonic kidney cells (HEK 293T). The function of MRP4 variants was compared by measuring the intracellular accumulation of two antiviral agents, azidothymidine (AZT) and adefovir (PMEA). A total of 98 variants were identified in the coding and flanking intronic regions of ABCC4. Of these, 43 variants are in the coding region, and 22 are nonsynonymous. In a functional screen of ten variants, there was no evidence for a complete loss of function allele. However, two variants (G187W and G487E) showed a significantly reduced function compared to reference with both substrates, as evidenced by higher intracellular accumulation of AZT and PMEA compared to the reference MRP4 (43 and 69% increase in accumulation for G187W compared with the reference MRP4, with AZT and PMEA, respectively). The G187W variant also showed decreased expression following transient transfection of HEK 293T cells. Further studies are required to assess the clinical significance of this altered function and expression and to evaluate substrate specificity of this functional change.


Subject(s)
Adenine/analogs & derivatives , Antiviral Agents/metabolism , Multidrug Resistance-Associated Proteins/genetics , Multidrug Resistance-Associated Proteins/metabolism , Organophosphonates/metabolism , Polymorphism, Single Nucleotide , Zidovudine/metabolism , Adenine/metabolism , Base Sequence , California , Cell Line , Ethnicity/genetics , Haplotypes , Humans , Molecular Sequence Data , Sequence Alignment , Sequence Analysis, DNA , White People/genetics
12.
Pharmacogenet Genomics ; 17(9): 773-82, 2007 Sep.
Article in English | MEDLINE | ID: mdl-17700366

ABSTRACT

BACKGROUND: OCTN1 is a multispecific transporter of organic cations and zwitterions, including several clinically important drugs as well as the antioxidant ergothioneine. OCTN1 is highly expressed in the kidney, where it is thought to aid in active secretion of organic cations, and may facilitate the active reabsorption of ergothioneine. Genetic variation in OCTN1 may help to explain interindividual variability in the pharmacokinetics of many cationic or zwitterionic drugs. METHODS: We screened for human genetic variants in the OCTN1 coding region by direct sequencing in a large sample (n=270) of ethnically diverse healthy volunteers. RESULTS: Six protein sequence-altering variants were identified, including five-amino-acid substitutions and one nonsense mutation. Two of the variants, T306I and L503F, were polymorphic, occurring at frequencies of 37 and 19%, respectively, in the total sample. Allele frequencies are varied by ethnicity. In biochemical assays, two of the variants (D165G and R282X) resulted in complete loss of transport function, and one variant (M205I) caused a reduction in activity to approximately 50% of the reference sequence protein. One variant, L503F, showed altered substrate specificity; this variant occurred at particularly high allele frequency (42%) in the European-American participants in our sample. Subcellular localization and ergothioneine inhibition kinetics were similar among the common amino-acid sequence variants of OCTN1. CONCLUSIONS: The common OCTN1-L503F variant may explain a significant amount of population variation in the pharmacokinetics of OCTN1 substrate drugs. The rare loss-of-function variants provide a rational tool for studying the importance of ergothioneine in humans in vivo.


Subject(s)
Organic Cation Transport Proteins/genetics , Amino Acid Substitution , Cell Line , Codon, Nonsense , Green Fluorescent Proteins/genetics , Green Fluorescent Proteins/metabolism , Humans , Models, Molecular , Organic Cation Transport Proteins/chemistry , Organic Cation Transport Proteins/metabolism , Pharmacogenetics , Polymorphism, Genetic , Protein Structure, Secondary , RNA, Messenger/genetics , RNA, Messenger/metabolism , Recombinant Fusion Proteins/genetics , Recombinant Fusion Proteins/metabolism , Symporters , Transfection
13.
Bioinformatics ; 23(17): 2345-7, 2007 Sep 01.
Article in English | MEDLINE | ID: mdl-17623706

ABSTRACT

structureViz is a Cytoscape plug-in that links the visualization of biological networks provided by Cytoscape with the visualization and analysis of macromolecular structures and sequences provided by UCSF Chimera. When combined with Cytoscape and Chimera, structureViz provides the first tool that links these two critical aspects of computational analysis in a straightforward manner. structureViz includes commands to open structures in Chimera and align them using Chimera's sequence-structure analysis tools. When a structure is opened, structureViz provides an alternative interface to Chimera: the Cytoscape Molecular Structure Navigator. This interface uses a tree-based paradigm to allow users to select and affect the display of models, chains and residues, mostly through the use of context menus.


Subject(s)
Computer Graphics , Protein Interaction Mapping/methods , Proteome/chemistry , Proteome/metabolism , Signal Transduction/physiology , Software , User-Computer Interface , Computer Simulation , Models, Biological , Sequence Analysis, Protein/methods , Systems Integration
14.
J Struct Biol ; 157(1): 281-7, 2007 Jan.
Article in English | MEDLINE | ID: mdl-16963278

ABSTRACT

We describe methods for interactive visualization and analysis of density maps available in the UCSF Chimera molecular modeling package. The methods enable segmentation, fitting, coarse modeling, measuring and coloring of density maps for elucidating structures of large molecular assemblies such as virus particles, ribosomes, microtubules, and chromosomes. The methods are suitable for density maps with resolutions in the range spanned by electron microscope single particle reconstructions and tomography. All of the tools described are simple, robust and interactive, involving computations taking only seconds. An advantage of the UCSF Chimera package is its integration of a large collection of interactive methods. Interactive tools are sufficient for performing simple analyses and also serve to prepare input for and examine results from more complex, specialized, and algorithmic non-interactive analysis software. While both interactive and non-interactive analyses are useful, we discuss only interactive methods here.


Subject(s)
Computer Graphics , Image Enhancement/methods , Imaging, Three-Dimensional/methods , Software , Bacteriophage T4/chemistry , Computational Biology , Microscopy, Electron/methods , Models, Molecular , Molecular Conformation , Myosins/chemistry
15.
BMC Genomics ; 7: 236, 2006 Sep 18.
Article in English | MEDLINE | ID: mdl-16982004

ABSTRACT

BACKGROUND: Gene knockouts in a model organism such as mouse provide a valuable resource for the study of basic biology and human disease. Determining which gene has been inactivated by an untargeted gene trapping event poses a challenging annotation problem because gene trap sequence tags, which represent sequence near the vector insertion site of a trapped gene, are typically short and often contain unresolved residues. To understand better the localization of these sequences on the mouse genome, we compared stand-alone versions of the alignment programs BLAT, SSAHA, and MegaBLAST. A set of 3,369 sequence tags was aligned to build 34 of the mouse genome using default parameters for each algorithm. Known genome coordinates for the cognate set of full-length genes (1,659 sequences) were used to evaluate localization results. RESULTS: In general, all three programs performed well in terms of localizing sequences to a general region of the genome, with only relatively subtle errors identified for a small proportion of the sequence tags. However, large differences in performance were noted with regard to correctly identifying exon boundaries. BLAT correctly identified the vast majority of exon boundaries, while SSAHA and MegaBLAST missed the majority of exon boundaries. SSAHA consistently reported the fewest false positives and is the fastest algorithm. MegaBLAST was comparable to BLAT in speed, but was the most susceptible to localizing sequence tags incorrectly to pseudogenes. CONCLUSION: The differences in performance for sequence tags and full-length reference sequences were surprisingly small. Characteristic variations in localization results for each program were noted that affect the localization of sequence at exon boundaries, in particular.


Subject(s)
Algorithms , Computational Biology/methods , Genome/genetics , Sequence Alignment/methods , Animals , Cell Cycle Proteins/genetics , Chromatin Assembly Factor-1 , Chromosomal Proteins, Non-Histone/genetics , Chromosome Mapping/methods , DNA-Binding Proteins/genetics , Exons/genetics , Mice , Nuclear Proteins/genetics , Nucleotides/genetics , Pseudogenes/genetics , Reproducibility of Results , Time Factors
16.
BMC Bioinformatics ; 7: 339, 2006 Jul 12.
Article in English | MEDLINE | ID: mdl-16836757

ABSTRACT

BACKGROUND: Comparing related structures and viewing the structures in the context of sequence alignments are important tasks in protein structure-function research. While many programs exist for individual aspects of such work, there is a need for interactive visualization tools that: (a) provide a deep integration of sequence and structure, far beyond mapping where a sequence region falls in the structure and vice versa; (b) facilitate changing data of one type based on the other (for example, using only sequence-conserved residues to match structures, or adjusting a sequence alignment based on spatial fit); (c) can be used with a researcher's own data, including arbitrary sequence alignments and annotations, closely or distantly related sets of proteins, etc.; and (d) interoperate with each other and with a full complement of molecular graphics features. We describe enhancements to UCSF Chimera to achieve these goals. RESULTS: The molecular graphics program UCSF Chimera includes a suite of tools for interactive analyses of sequences and structures. Structures automatically associate with sequences in imported alignments, allowing many kinds of crosstalk. A novel method is provided to superimpose structures in the absence of a pre-existing sequence alignment. The method uses both sequence and secondary structure, and can match even structures with very low sequence identity. Another tool constructs structure-based sequence alignments from superpositions of two or more proteins. Chimera is designed to be extensible, and mechanisms for incorporating user-specific data without Chimera code development are also provided. CONCLUSION: The tools described here apply to many problems involving comparison and analysis of protein structures and their sequences. Chimera includes complete documentation and is intended for use by a wide range of scientists, not just those in the computational disciplines. UCSF Chimera is free for non-commercial use and is available for Microsoft Windows, Apple Mac OS X, Linux, and other platforms from http://www.cgl.ucsf.edu/chimera.


Subject(s)
Databases, Protein , Models, Chemical , Proteins/chemistry , Sequence Alignment/methods , Sequence Analysis, Protein/methods , Software , User-Computer Interface , Computer Graphics , Computer Simulation , Database Management Systems , Models, Molecular , Pattern Recognition, Automated/methods , Protein Conformation , Proteins/classification , Proteins/genetics , Proteins/ultrastructure , Structure-Activity Relationship , Systems Integration
17.
J Pharmacol Exp Ther ; 318(2): 521-9, 2006 Aug.
Article in English | MEDLINE | ID: mdl-16702441

ABSTRACT

Methotrexate (MTX) is used in patients with malignant and autoimmune diseases. This drug is primarily excreted unchanged in the urine, and its net excretion occurs via active secretory and reabsorptive processes. We characterized the interaction of MTX with human organic-anion transporting polypeptide transporter (OATP) 1A2, which is expressed in tissues important for MTX disposition and toxicity, such as the intestine, kidney, liver, and endothelial cells of the blood-brain barrier. In Xenopus laevis oocytes expressing OATP1A2, the uptake of the model substrate, estrone-3-sulfate (ES), was enhanced 30-fold compared with uninjected oocytes. MTX uptake in oocytes expressing OATP1A2 was saturable (Km = 457 +/- 118 microM; Vmax = 17.5 +/- 4.9 pmol/oocyte/60 min) and sensitive to extracellular pH. That is, acidic pHs stimulated MTX uptake by as much as 7-fold. Seven novel protein-altering variants were identified in 270 ethnically diverse DNA samples. Four protein-altering variants in OATP1A2 exhibited altered transport of ES and/or MTX. The common variant, protein reference sequence (p.) Ile13Thr, was hyperfunctional for ES and MTX and showed a 2-fold increase in the V(max) for ES. The common variant, p. Glu172Asp, exhibited reduced maximal transport capacity for ES and MTX. p. Arg168Cys was hypofunctional, and p. Asn277DEL was nonfunctional. Because of its expression on the apical membrane of the distal tubule and in tissues relevant to MTX disposition and toxicity, these findings suggest that OATP1A2 may play a role in active tubular reabsorption of MTX and in MTX-induced toxicities. Furthermore, genetic variation in OATP1A2 may contribute to variation in MTX disposition and response.


Subject(s)
Folic Acid Antagonists/pharmacology , Methotrexate/pharmacology , Organic Anion Transporters/drug effects , Organic Anion Transporters/genetics , Alleles , Amino Acid Sequence , Animals , Biological Transport, Active/drug effects , Folic Acid Antagonists/pharmacokinetics , Genetic Variation , Humans , Hydrogen-Ion Concentration , Methotrexate/analogs & derivatives , Methotrexate/pharmacokinetics , Molecular Sequence Data , Oocytes/metabolism , Xenopus laevis
SELECTION OF CITATIONS
SEARCH DETAIL
...