Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 22
Filter
Add more filters










Publication year range
1.
Yakugaku Zasshi ; 144(5): 539-543, 2024.
Article in Japanese | MEDLINE | ID: mdl-38692930

ABSTRACT

Researchers collect data and use various methods to organize it. Ensuring the reliability and reproducibility of data is crucial, and collaboration across different research fields is on the rise. However, when there is geographical distance, sharing data becomes a challenging task. Therefore, there is a need for the development of a mechanism for sharing data on the web. We have developed an integrated database to facilitate the sharing and management of research data, particularly focusing on small molecules. The integrated database serves as a platform for centralizing data related to small molecules, including their chemical structures, wet lab experimental data, simulation data, and more. It has been constructed as a web application, offering features such as library management for small molecules, registration and viewing of wet lab experiment results, generation of initial conformations for simulations, and data visualization. This enables researchers to efficiently share their research data and collaborate seamlessly, whether within their research group or via cloud-based access that allows project and team members to connect from anywhere. This integrated database plays a critical role in connecting wet lab experiments and simulations, enabling researchers to cross-reference and analyze experimental data comprehensively. It serves as an essential tool to advance research and foster idea generation.


Subject(s)
Databases, Factual , Computer Simulation , Information Dissemination , Internet , Reproducibility of Results , Small Molecule Libraries
2.
J Biol Chem ; 300(2): 105624, 2024 Feb.
Article in English | MEDLINE | ID: mdl-38176651

ABSTRACT

The glycosylation of proteins and lipids is known to be closely related to the mechanisms of various diseases such as influenza, cancer, and muscular dystrophy. Therefore, it has become clear that the analysis of post-translational modifications of proteins, including glycosylation, is important to accurately understand the functions of each protein molecule and the interactions among them. In order to conduct large-scale analyses more efficiently, it is essential to promote the accumulation, sharing, and reuse of experimental and analytical data in accordance with the FAIR (Findability, Accessibility, Interoperability, and Re-usability) data principles. However, a FAIR data repository for storing and sharing glycoconjugate information, including glycopeptides and glycoproteins, in a standardized format did not exist. Therefore, we have developed GlyComb (https://glycomb.glycosmos.org) as a new standardized data repository for glycoconjugate data. Currently, GlyComb can assign a unique identifier to a set of glycosylation information associated with a specific peptide sequence or UniProt ID. By standardizing glycoconjugate data via GlyComb identifiers and coordinating with existing web resources such as GlyTouCan and GlycoPOST, a comprehensive system for data submission and data sharing among researchers can be established. Here we introduce how GlyComb is able to integrate the variety of glycoconjugate data already registered in existing data repositories to obtain a better understanding of the available glycopeptides and glycoproteins, and their glycosylation patterns. We also explain how this system can serve as a foundation for a better understanding of glycan function.


Subject(s)
Databases, Chemical , Glycomics , Proteomics , Glycopeptides/metabolism , Glycoproteins/metabolism , Glycosylation , Polysaccharides/metabolism , Databases, Genetic
3.
Glycobiology ; 33(6): 454-463, 2023 06 21.
Article in English | MEDLINE | ID: mdl-37129482

ABSTRACT

The GlyCosmos Glycoscience Portal (https://glycosmos.org) and PubChem (https://pubchem.ncbi.nlm.nih.gov/) are major portals for glycoscience and chemistry, respectively. GlyCosmos is a portal for glycan-related repositories, including GlyTouCan, GlycoPOST, and UniCarb-DR, as well as for glycan-related data resources that have been integrated from a variety of 'omics databases. Glycogenes, glycoproteins, lectins, pathways, and disease information related to glycans are accessible from GlyCosmos. PubChem, on the other hand, is a chemistry-based portal at the National Center for Biotechnology Information. PubChem provides information not only on chemicals, but also genes, proteins, pathways, as well as patents, bioassays, and more, from hundreds of data resources from around the world. In this work, these 2 portals have made substantial efforts to integrate their complementary data to allow users to cross between these 2 domains. In addition to glycan structures, key information, such as glycan-related genes, relevant diseases, glycoproteins, and pathways, was integrated and cross-linked with one another. The interfaces were designed to enable users to easily find, access, download, and reuse data of interest across these resources. Use cases are described illustrating and highlighting the type of content that can be investigated. In total, these integrations provide life science researchers improved awareness and enhanced access to glycan-related information.


Subject(s)
Databases, Chemical , Polysaccharides , Glycosylation , Workflow , Informatics , Polysaccharides/chemistry , Glycoconjugates/chemistry
5.
Glycobiology ; 32(8): 646-650, 2022 07 13.
Article in English | MEDLINE | ID: mdl-35452093

ABSTRACT

High-performance liquid chromatography (HPLC) elution data provide a useful tool for quantitative glycosylation profiling, discriminating isomeric oligosaccharides. The web application Glycoanalysis by the Three Axes of MS and Chromatography (GALAXY), which is based on the three-dimensional HPLC map of N-linked oligosaccharides with pyridyl-2-amination developed by Dr. Noriko Takahashi, has been extensively used for N-glycosylation profiling at molecular, cellular, and tissue levels. Herein, we describe the updated GALAXY as version 3, which includes new HPLC data including those of glucuronylated and sulfated glycans, an improved graphical user interface using modern technologies, and linked to glycan information in GlyTouCan and the GlyCosmos Portal. This liaison will facilitate glycomic analyses of human and other organisms in conjunction with multiomics data.


Subject(s)
Oligosaccharides , Polysaccharides , Chromatography, High Pressure Liquid/methods , Glycosylation , Humans , Oligosaccharides/chemistry , Polysaccharides/chemistry
7.
Molecules ; 26(23)2021 Nov 25.
Article in English | MEDLINE | ID: mdl-34885724

ABSTRACT

In life science fields, database integration is progressing and contributing to collaboration between different research fields, including the glycosciences. The integration of glycan databases has greatly progressed collaboration worldwide with the development of the international glycan structure repository, GlyTouCan. This trend has increased the need for a tool by which researchers in various fields can easily search glycan structures from integrated databases. We have developed a web-based glycan structure search tool, SugarDrawer, which supports the depiction of glycans including ambiguity, such as glycan fragments which contain underdetermined linkages, and a database search for glycans drawn on the canvas. This tool provides an easy editing feature for various glycan structures in just a few steps using template structures and pop-up windows which allow users to select specific information for each structure element. This tool has a unique feature for selecting possible attachment sites, which is defined in the Symbol Nomenclature for Glycans (SNFG). In addition, this tool can input and output glycans in WURCS and GlycoCT formats, which are the most commonly-used text formats for glycan structures.


Subject(s)
Databases, Factual , Internet , Polysaccharides/genetics , Software , Biological Science Disciplines , Humans , Polysaccharides/chemistry , Polysaccharides/classification , Polysaccharides/ultrastructure
8.
Glycobiology ; 31(7): 741-750, 2021 08 07.
Article in English | MEDLINE | ID: mdl-33677548

ABSTRACT

Recent years have seen great advances in the development of glycoproteomics protocols and methods resulting in a sustainable increase in the reporting proteins, their attached glycans and glycosylation sites. However, only very few of these reports find their way into databases or data repositories. One of the major reasons is the absence of digital standard to represent glycoproteins and the challenging annotations with glycans. Depending on the experimental method, such a standard must be able to represent glycans as complete structures or as compositions, store not just single glycans but also represent glycoforms on a specific glycosylation side, deal with partially missing site information if no site mapping was performed, and store abundances or ratios of glycans within a glycoform of a specific site. To support the above, we have developed the GlycoConjugate Ontology (GlycoCoO) as a standard semantic framework to describe and represent glycoproteomics data. GlycoCoO can be used to represent glycoproteomics data in triplestores and can serve as a basis for data exchange formats. The ontology, database providers and supporting documentation are available online (https://github.com/glycoinfo/GlycoCoO).


Subject(s)
Glycoproteins , Polysaccharides , Glycoproteins/metabolism , Glycosylation , Polysaccharides/metabolism
9.
Structure ; 29(4): 393-400.e1, 2021 04 01.
Article in English | MEDLINE | ID: mdl-33657417

ABSTRACT

The Worldwide Protein Data Bank (wwPDB) has provided validation reports based on recommendations from community Validation Task Forces for structures in the PDB since 2013. To further enhance validation of small molecules as recommended from the 2016 Ligand Validation Workshop, wwPDB, Global Phasing Ltd., and the Noguchi Institute, recently formed a public/private partnership to incorporate some of their software tools into the wwPDB validation package. Augmented wwPDB validation report features include: two-dimensional (2D) diagrams of small-molecule ligands and carbohydrates, highlighting geometric validation outcomes; 2D topological diagrams of oligosaccharides present in branched entities generated using 2D Symbol Nomenclature for Glycan representation; and views of 3D electron density maps for ligands and carbohydrates, illustrating the goodness-of-fit between the atomic structure and experimental data (X-ray crystallographic structures only). These improvements will impact confidence in ligand conformation and ligand-macromolecular interactions that will aid in understanding biochemical function and contribute to small-molecule drug discovery.


Subject(s)
Carbohydrates/chemistry , Databases, Protein/standards , Molecular Docking Simulation/methods , Proteomics/methods , Small Molecule Libraries/chemistry , Cheminformatics/methods , Databases, Chemical/standards , Humans , Ligands , Protein Binding , Proteome/chemistry , Proteome/metabolism
10.
Nucleic Acids Res ; 49(D1): D1529-D1533, 2021 01 08.
Article in English | MEDLINE | ID: mdl-33125071

ABSTRACT

Glycans serve important roles in signaling events and cell-cell communication, and they are recognized by lectins, viruses and bacteria, playing a variety of roles in many biological processes. However, there was no system to organize the plethora of glycan-related data in the literature. Thus GlyTouCan (https://glytoucan.org) was developed as the international glycan repository, allowing researchers to assign accession numbers to glycans. This also aided in the integration of glycan data across various databases. GlyTouCan assigns accession numbers to glycans which are defined as sets of monosaccharides, which may or may not be characterized with linkage information. GlyTouCan was developed to be able to recognize any level of ambiguity in glycans and uniquely assign accession numbers to each of them, regardless of the input text format. In this manuscript, we describe the latest update to GlyTouCan in version 3.0, its usage, and plans for future development.


Subject(s)
Computational Biology/methods , Databases, Factual , Polysaccharides/classification , Software , Humans , International Cooperation , Internet , Polysaccharides/analysis , Polysaccharides/chemistry , Terminology as Topic
12.
Bioinformatics ; 35(14): 2434-2440, 2019 07 15.
Article in English | MEDLINE | ID: mdl-30535258

ABSTRACT

MOTIVATION: Glycans are biomolecules that take an important role in the biological processes of living organisms. They form diverse, complicated structures such as branched and cyclic forms. Web3 Unique Representation of Carbohydrate Structures (WURCS) was proposed as a new linear notation for uniquely representing glycans during the GlyTouCan project. WURCS defines rules for complex glycan structures that other text formats did not support, and so it is possible to represent a wide variety glycans. However, WURCS uses a complicated nomenclature, so it is not human-readable. Therefore, we aimed to support the interpretation of WURCS by converting WURCS to the most basic and widely used format IUPAC. RESULTS: In this study, we developed GlycanFormatConverter and succeeded in converting WURCS to the three kinds of IUPAC formats (IUPAC-Extended, IUPAC-Condensed and IUPAC-Short). Furthermore, we have implemented functionality to import IUPAC-Extended, KEGG Chemical Function (KCF) and LinearCode formats and to export WURCS. We have thoroughly tested our GlycanFormatConverter and were able to show that it was possible to convert all the glycans registered in the GlyTouCan repository, with exceptions owing only to the limitations of the original format. The source code for this conversion tool has been released as an open source tool. AVAILABILITY AND IMPLEMENTATION: https://github.com/glycoinfo/GlycanFormatConverter.git. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
Software , Polysaccharides
13.
Glycobiology ; 27(10): 915-919, 2017 10 01.
Article in English | MEDLINE | ID: mdl-28922742

ABSTRACT

Rapid and continued growth in the generation of glycomic data has revealed the need for enhanced development of basic infrastructure for presenting and interpreting these datasets in a manner that engages the broader biomedical research community. Early in their growth, the genomic and proteomic fields implemented mechanisms for assigning unique gene and protein identifiers that were essential for organizing data presentation and for enhancing bioinformatic approaches to extracting knowledge. Similar unique identifiers are currently absent from glycomic data. In order to facilitate continued growth and expanded accessibility of glycomic data, the authors strongly encourage the glycomics community to coordinate the submission of their glycan structures to the GlyTouCan Repository and to make use of GlyTouCan identifiers in their communications and publications. The authors also deeply encourage journals to recommend a submission workflow in which submitted publications utilize GlyTouCan identifiers as a standard reference for explicitly describing glycan structures cited in manuscripts.


Subject(s)
Databases, Chemical , Glycomics/methods , Polysaccharides/chemistry , Glycomics/standards , Polysaccharides/classification
14.
Carbohydr Res ; 445: 104-116, 2017 Jun 05.
Article in English | MEDLINE | ID: mdl-28525772

ABSTRACT

GlyTouCan version 1.0 was released in 2015 as the international glycan structure repository, and a new sequence format called WURCS (Web3 Unique Representation of Carbohydrate Structures) was proposed during the early stages of the GlyTouCan project. GlyTouCan uses WURCS as its base representation for glycans because existing formats were insufficient in their flexibility to represent any and all glycans universally. Therefore, in order to obtain WURCS strings for existing or new glycan structures, conversion tools or glycan structure editors that can export WURCS became necessary. GlycanBuilder was an obvious choice to extend due to its wide usage by the community. However, GlycanBuilder was limited because it was originally developed to support mammalian glycans. It also did not support the newly proposed monosaccharide symbol standard called Symbol Nomenclature for Glycans (SNFG). Therefore in this work, we implemented a new version of GlycanBuilder to greatly increase its usability. The glycan rendering system was refactored so that cyclic glycans, nested repeating units, monosaccharide compositions and cross-linked glycan structures can be represented. Both import and export utilities for WURCS were also implemented and SNFG symbols were incorporated to allow glycans to be exported as graphics using the latest glycan symbol nomenclature. This new version of GlycanBuilder called "GlycanBuilder2", is able to support a wide variety of ambiguous glycans, including structures containing monosaccharides from bacteria and plants. These glycans can also be displayed using the new SNFG symbols. This tool can aid researchers in communicating about the complex, diverse, and ambiguous structures of glycans more rapidly. Moreover, the new GlycanBuilder can now easily output WURCS sequences from glycans drawn on the canvas. Most importantly, because GlyTouCan employs WURCS as the basic format for registration and searching of glycan information, a wider variety of glycans can now be readily registered and queried in GlyTouCan.


Subject(s)
Glycomics/methods , Polysaccharides/chemistry , Carbohydrate Sequence , Computer Graphics
15.
J Chem Inf Model ; 57(4): 632-637, 2017 04 24.
Article in English | MEDLINE | ID: mdl-28263066

ABSTRACT

Accurate representation of structural ambiguity is important for storing carbohydrate structures containing varying levels of ambiguity in the literature and databases. Although many representations for carbohydrates have been developed in the past, a generalized but discrete representation format did not exist. We had previously developed the Web3 Unique Representation of Carbohydrate Structures (WURCS) in an attempt to define a generalizable and unique linear representation for carbohydrate structures. However, it lacked sufficient rules to uniquely describe ambiguous structures. In this work, we updated WURCS to handle such ambiguous monosaccharide structures. In particular, to handle structural ambiguity around (potential) carbonyl groups incidental to the carbohydrate analysis, we defined a representation of backbone carbons containing atomic-level ambiguity. As a result, we show that WURCS 2.0 can represent a wider variety of carbohydrate structures containing ambiguous monosaccharides, such as those whose ring closure is undefined or whose anomeric information is only known. This new format provides a representation of carbohydrates that was not possible before, and it is currently being used by the International Glycan Structure Repository GlyTouCan.


Subject(s)
Carbohydrates/chemistry , Computational Biology/methods , Monosaccharides/analysis , Stereoisomerism
16.
Nucleic Acids Res ; 44(D1): D1237-42, 2016 Jan 04.
Article in English | MEDLINE | ID: mdl-26476458

ABSTRACT

Glycans are known as the third major class of biopolymers, next to DNA and proteins. They cover the surfaces of many cells, serving as the 'face' of cells, whereby other biomolecules and viruses interact. The structure of glycans, however, differs greatly from DNA and proteins in that they are branched, as opposed to linear sequences of amino acids or nucleotides. Therefore, the storage of glycan information in databases, let alone their curation, has been a difficult problem. This has caused many duplicated efforts when integration is attempted between different databases, making an international repository for glycan structures, where unique accession numbers are assigned to every identified glycan structure, necessary. As such, an international team of developers and glycobiologists have collaborated to develop this repository, called GlyTouCan and is available at http://glytoucan.org/, to provide a centralized resource for depositing glycan structures, compositions and topologies, and to retrieve accession numbers for each of these registered entries. This will thus enable researchers to reference glycan structures simply by accession number, as opposed to by chemical structure, which has been a burden to integrate glycomics databases in the past.


Subject(s)
Databases, Chemical , Polysaccharides/chemistry , Molecular Structure
17.
Bioinformatics ; 31(6): 919-25, 2015 Mar 15.
Article in English | MEDLINE | ID: mdl-25388145

ABSTRACT

MOTIVATION: Over the last decades several glycomics-based bioinformatics resources and databases have been created and released to the public. Unfortunately, there is no common standard in the representation of the stored information or a common machine-readable interface allowing bioinformatics groups to easily extract and cross-reference the stored information. RESULTS: An international group of bioinformatics experts in the field of glycomics have worked together to create a standard Resource Description Framework (RDF) representation for glycomics data, focused on glycan sequences and related biological source, publications and experimental data. This RDF standard is defined by the GlycoRDF ontology and will be used by database providers to generate common machine-readable exports of the data stored in their databases. AVAILABILITY AND IMPLEMENTATION: The ontology, supporting documentation and source code used by database providers to generate standardized RDF are available online (http://www.glycoinfo.org/GlycoRDF/).


Subject(s)
Computational Biology/methods , Database Management Systems/standards , Databases, Factual/standards , Glycomics/methods , Information Storage and Retrieval/methods , Polysaccharides/chemistry , Software , Documentation , Gene Ontology , Humans
18.
J Chem Inf Model ; 54(6): 1558-66, 2014 Jun 23.
Article in English | MEDLINE | ID: mdl-24897372

ABSTRACT

In recent years, the Semantic Web has become the focus of life science database development as a means to link life science data in an effective and efficient manner. In order for carbohydrate data to be applied to this new technology, there are two requirements for carbohydrate data representations: (1) a linear notation which can be used as a URI (Uniform Resource Identifier) if needed and (2) a unique notation such that any published glycan structure can be represented distinctively. This latter requirement includes the possible representation of nonstandard monosaccharide units as a part of the glycan structure, as well as compositions, repeating units, and ambiguous structures where linkages/linkage positions are unidentified. Therefore, we have developed the Web3 Unique Representation of Carbohydrate Structures (WURCS) as a new linear notation for representing carbohydrates for the Semantic Web.


Subject(s)
Carbohydrates/chemistry , Databases, Chemical , Carbohydrate Sequence , Internet , Models, Molecular , Molecular Sequence Data , Software
19.
J Biomed Semantics ; 4(1): 39, 2013 Nov 26.
Article in English | MEDLINE | ID: mdl-24280648

ABSTRACT

BACKGROUND: Glycoscience is a research field focusing on complex carbohydrates (otherwise known as glycans)a, which can, for example, serve as "switches" that toggle between different functions of a glycoprotein or glycolipid. Due to the advancement of glycomics technologies that are used to characterize glycan structures, many glycomics databases are now publicly available and provide useful information for glycoscience research. However, these databases have almost no link to other life science databases. RESULTS: In order to implement support for the Semantic Web most efficiently for glycomics research, the developers of major glycomics databases agreed on a minimal standard for representing glycan structure and annotation information using RDF (Resource Description Framework). Moreover, all of the participants implemented this standard prototype and generated preliminary RDF versions of their data. To test the utility of the converted data, all of the data sets were uploaded into a Virtuoso triple store, and several SPARQL queries were tested as "proofs-of-concept" to illustrate the utility of the Semantic Web in querying across databases which were originally difficult to implement. CONCLUSIONS: We were able to successfully retrieve information by linking UniCarbKB, GlycomeDB and JCGGDB in a single SPARQL query to obtain our target information. We also tested queries linking UniProt with GlycoEpitope as well as lectin data with GlycomeDB through PDB. As a result, we have been able to link proteomics data with glycomics data through the implementation of Semantic Web technologies, allowing for more flexible queries across these domains.

SELECTION OF CITATIONS
SEARCH DETAIL
...