Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 13 de 13
Filter
Add more filters










Publication year range
1.
Methods Mol Biol ; 2836: 77-96, 2024.
Article in English | MEDLINE | ID: mdl-38995537

ABSTRACT

Glycosylation is a unique posttranslational modification that dynamically shapes the surface of cells. Glycans attached to proteins or lipids in a cell or tissue are studied as a whole and collectively designated as a glycome. UniCarb-DB is a glycomic spectral library of tandem mass spectrometry (MS/MS) fragment data. The current version of the database consists of over 1500 entries and over 1000 unique structures. Each entry contains parent ion information with associated MS/MS spectra, metadata about the original publication, experimental conditions, and biological origin. Each structure is also associated with the GlyTouCan glycan structure repository allowing easy access to other glycomic resources. The database can be directly utilized by mass spectrometry (MS) experimentalists through the conversion of data generated by MS into structural information. Flexible online search tools along with a downloadable version of the database are easily incorporated in either commercial or open-access MS software. This chapter highlights UniCarb-DB online search tool to browse differences of isomeric structures between spectra, a peak matching search between user-generated MS/MS spectra and spectra stored in UniCarb-DB and more advanced MS tools for combined quantitative and qualitative glycomics.


Subject(s)
Glycomics , Polysaccharides , Software , Tandem Mass Spectrometry , Tandem Mass Spectrometry/methods , Glycomics/methods , Polysaccharides/chemistry , Polysaccharides/analysis , Databases, Factual , Glycosylation , Humans
3.
Glycobiology ; 31(7): 741-750, 2021 08 07.
Article in English | MEDLINE | ID: mdl-33677548

ABSTRACT

Recent years have seen great advances in the development of glycoproteomics protocols and methods resulting in a sustainable increase in the reporting proteins, their attached glycans and glycosylation sites. However, only very few of these reports find their way into databases or data repositories. One of the major reasons is the absence of digital standard to represent glycoproteins and the challenging annotations with glycans. Depending on the experimental method, such a standard must be able to represent glycans as complete structures or as compositions, store not just single glycans but also represent glycoforms on a specific glycosylation side, deal with partially missing site information if no site mapping was performed, and store abundances or ratios of glycans within a glycoform of a specific site. To support the above, we have developed the GlycoConjugate Ontology (GlycoCoO) as a standard semantic framework to describe and represent glycoproteomics data. GlycoCoO can be used to represent glycoproteomics data in triplestores and can serve as a basis for data exchange formats. The ontology, database providers and supporting documentation are available online (https://github.com/glycoinfo/GlycoCoO).


Subject(s)
Glycoproteins , Polysaccharides , Glycoproteins/metabolism , Glycosylation , Polysaccharides/metabolism
4.
Nucleic Acids Res ; 49(D1): D1529-D1533, 2021 01 08.
Article in English | MEDLINE | ID: mdl-33125071

ABSTRACT

Glycans serve important roles in signaling events and cell-cell communication, and they are recognized by lectins, viruses and bacteria, playing a variety of roles in many biological processes. However, there was no system to organize the plethora of glycan-related data in the literature. Thus GlyTouCan (https://glytoucan.org) was developed as the international glycan repository, allowing researchers to assign accession numbers to glycans. This also aided in the integration of glycan data across various databases. GlyTouCan assigns accession numbers to glycans which are defined as sets of monosaccharides, which may or may not be characterized with linkage information. GlyTouCan was developed to be able to recognize any level of ambiguity in glycans and uniquely assign accession numbers to each of them, regardless of the input text format. In this manuscript, we describe the latest update to GlyTouCan in version 3.0, its usage, and plans for future development.


Subject(s)
Computational Biology/methods , Databases, Factual , Polysaccharides/classification , Software , Humans , International Cooperation , Internet , Polysaccharides/analysis , Polysaccharides/chemistry , Terminology as Topic
6.
Nat Commun ; 10(1): 3275, 2019 07 22.
Article in English | MEDLINE | ID: mdl-31332201

ABSTRACT

The mass spectrometry (MS)-based analysis of free polysaccharides and glycans released from proteins, lipids and proteoglycans increasingly relies on databases and software. Here, we review progress in the bioinformatics analysis of protein-released N- and O-linked glycans (N- and O-glycomics) and propose an e-infrastructure to overcome current deficits in data and experimental transparency. This workflow enables the standardized submission of MS-based glycomics information into the public repository UniCarb-DR. It implements the MIRAGE (Minimum Requirement for A Glycomics Experiment) reporting guidelines, storage of unprocessed MS data in the GlycoPOST repository and glycan structure registration using the GlyTouCan registry, thereby supporting the development and extension of a glycan structure knowledgebase.


Subject(s)
Computational Biology/methods , Glycomics/methods , Glycoproteins/metabolism , Polysaccharides/metabolism , Animals , Computational Biology/standards , Databases, Factual/standards , Databases, Factual/statistics & numerical data , Humans , Mass Spectrometry/methods , Reference Standards
7.
Carbohydr Res ; 464: 44-56, 2018 Jul 15.
Article in English | MEDLINE | ID: mdl-29859376

ABSTRACT

Glycan-binding protein (GBP) interaction experiments, such as glycan microarrays, are often used to understand glycan recognition patterns. However, oftentimes the interpretation of glycan array experimental data makes it difficult to identify discrete GBP binding patterns due to their ambiguity. It is known that lectins, for example, are non-specific in their binding affinities; the same lectin can bind to different monosaccharides or even different glycan structures. In bioinformatics, several tools to mine the data generated from these sorts of experiments have been developed. These tools take a library of predefined motifs, which are commonly-found glycan patterns such as sialyl-Lewis X, and attempt to identify the motif(s) that are specific to the GBP being analyzed. In our previous work, as opposed to using predefined motifs, we developed the Multiple Carbohydrate Alignment with Weights (MCAW) tool to visualize the state of the glycans being recognized by the GBP under analysis. We previously reported on the effectiveness of our tool and algorithm by analyzing several glycan array datasets from the Consortium of Functional Glycomics (CFG). In this work, we report on our analysis of 1081 data sets which we collected from the CFG, the results of which we have made publicly and freely available as a database called MCAW-DB. We introduce this database, its usage and describe several analysis results. We show how MCAW-DB can be used to analyze glycan-binding patterns of GBPs amidst their ambiguity. For example, the visualization of glycan-binding patterns in MCAW-DB show how they correlate with the concentrations of the samples used in the array experiments. Using MCAW-DB, the patterns of glycans found to bind to various GBP-glycan binding proteins are visualized, indicating the binding "environment" of the glycans. Thus, the ambiguity of glycan recognition is numerically represented, along with the patterns of monosaccharides surrounding the binding region. The profiles in MCAW-DB could potentially be used as predictors of affinity of unknown or novel glycans to particular GBPs by comparing how well they match the existing profiles for those GBPs. Moreover, as the glycan profiles of diseased tissues become available, glycan alignments could also be used to identify glycan biomarkers unique to that tissue. Databases of these alignments may be of great use for drug discovery.


Subject(s)
Databases, Factual , Glycomics , Polysaccharides/metabolism , Animals , Binding Sites , Carbohydrate Sequence , Influenza A Virus, H1N1 Subtype/metabolism , Microarray Analysis , Polysaccharides/chemistry , Receptors, Cell Surface/metabolism
8.
Glycobiology ; 27(10): 915-919, 2017 10 01.
Article in English | MEDLINE | ID: mdl-28922742

ABSTRACT

Rapid and continued growth in the generation of glycomic data has revealed the need for enhanced development of basic infrastructure for presenting and interpreting these datasets in a manner that engages the broader biomedical research community. Early in their growth, the genomic and proteomic fields implemented mechanisms for assigning unique gene and protein identifiers that were essential for organizing data presentation and for enhancing bioinformatic approaches to extracting knowledge. Similar unique identifiers are currently absent from glycomic data. In order to facilitate continued growth and expanded accessibility of glycomic data, the authors strongly encourage the glycomics community to coordinate the submission of their glycan structures to the GlyTouCan Repository and to make use of GlyTouCan identifiers in their communications and publications. The authors also deeply encourage journals to recommend a submission workflow in which submitted publications utilize GlyTouCan identifiers as a standard reference for explicitly describing glycan structures cited in manuscripts.


Subject(s)
Databases, Chemical , Glycomics/methods , Polysaccharides/chemistry , Glycomics/standards , Polysaccharides/classification
9.
Carbohydr Res ; 445: 104-116, 2017 Jun 05.
Article in English | MEDLINE | ID: mdl-28525772

ABSTRACT

GlyTouCan version 1.0 was released in 2015 as the international glycan structure repository, and a new sequence format called WURCS (Web3 Unique Representation of Carbohydrate Structures) was proposed during the early stages of the GlyTouCan project. GlyTouCan uses WURCS as its base representation for glycans because existing formats were insufficient in their flexibility to represent any and all glycans universally. Therefore, in order to obtain WURCS strings for existing or new glycan structures, conversion tools or glycan structure editors that can export WURCS became necessary. GlycanBuilder was an obvious choice to extend due to its wide usage by the community. However, GlycanBuilder was limited because it was originally developed to support mammalian glycans. It also did not support the newly proposed monosaccharide symbol standard called Symbol Nomenclature for Glycans (SNFG). Therefore in this work, we implemented a new version of GlycanBuilder to greatly increase its usability. The glycan rendering system was refactored so that cyclic glycans, nested repeating units, monosaccharide compositions and cross-linked glycan structures can be represented. Both import and export utilities for WURCS were also implemented and SNFG symbols were incorporated to allow glycans to be exported as graphics using the latest glycan symbol nomenclature. This new version of GlycanBuilder called "GlycanBuilder2", is able to support a wide variety of ambiguous glycans, including structures containing monosaccharides from bacteria and plants. These glycans can also be displayed using the new SNFG symbols. This tool can aid researchers in communicating about the complex, diverse, and ambiguous structures of glycans more rapidly. Moreover, the new GlycanBuilder can now easily output WURCS sequences from glycans drawn on the canvas. Most importantly, because GlyTouCan employs WURCS as the basic format for registration and searching of glycan information, a wider variety of glycans can now be readily registered and queried in GlyTouCan.


Subject(s)
Glycomics/methods , Polysaccharides/chemistry , Carbohydrate Sequence , Computer Graphics
10.
Nucleic Acids Res ; 44(D1): D1237-42, 2016 Jan 04.
Article in English | MEDLINE | ID: mdl-26476458

ABSTRACT

Glycans are known as the third major class of biopolymers, next to DNA and proteins. They cover the surfaces of many cells, serving as the 'face' of cells, whereby other biomolecules and viruses interact. The structure of glycans, however, differs greatly from DNA and proteins in that they are branched, as opposed to linear sequences of amino acids or nucleotides. Therefore, the storage of glycan information in databases, let alone their curation, has been a difficult problem. This has caused many duplicated efforts when integration is attempted between different databases, making an international repository for glycan structures, where unique accession numbers are assigned to every identified glycan structure, necessary. As such, an international team of developers and glycobiologists have collaborated to develop this repository, called GlyTouCan and is available at http://glytoucan.org/, to provide a centralized resource for depositing glycan structures, compositions and topologies, and to retrieve accession numbers for each of these registered entries. This will thus enable researchers to reference glycan structures simply by accession number, as opposed to by chemical structure, which has been a burden to integrate glycomics databases in the past.


Subject(s)
Databases, Chemical , Polysaccharides/chemistry , Molecular Structure
11.
Bioinformatics ; 31(6): 919-25, 2015 Mar 15.
Article in English | MEDLINE | ID: mdl-25388145

ABSTRACT

MOTIVATION: Over the last decades several glycomics-based bioinformatics resources and databases have been created and released to the public. Unfortunately, there is no common standard in the representation of the stored information or a common machine-readable interface allowing bioinformatics groups to easily extract and cross-reference the stored information. RESULTS: An international group of bioinformatics experts in the field of glycomics have worked together to create a standard Resource Description Framework (RDF) representation for glycomics data, focused on glycan sequences and related biological source, publications and experimental data. This RDF standard is defined by the GlycoRDF ontology and will be used by database providers to generate common machine-readable exports of the data stored in their databases. AVAILABILITY AND IMPLEMENTATION: The ontology, supporting documentation and source code used by database providers to generate standardized RDF are available online (http://www.glycoinfo.org/GlycoRDF/).


Subject(s)
Computational Biology/methods , Database Management Systems/standards , Databases, Factual/standards , Glycomics/methods , Information Storage and Retrieval/methods , Polysaccharides/chemistry , Software , Documentation , Gene Ontology , Humans
12.
J Biomed Semantics ; 4(1): 39, 2013 Nov 26.
Article in English | MEDLINE | ID: mdl-24280648

ABSTRACT

BACKGROUND: Glycoscience is a research field focusing on complex carbohydrates (otherwise known as glycans)a, which can, for example, serve as "switches" that toggle between different functions of a glycoprotein or glycolipid. Due to the advancement of glycomics technologies that are used to characterize glycan structures, many glycomics databases are now publicly available and provide useful information for glycoscience research. However, these databases have almost no link to other life science databases. RESULTS: In order to implement support for the Semantic Web most efficiently for glycomics research, the developers of major glycomics databases agreed on a minimal standard for representing glycan structure and annotation information using RDF (Resource Description Framework). Moreover, all of the participants implemented this standard prototype and generated preliminary RDF versions of their data. To test the utility of the converted data, all of the data sets were uploaded into a Virtuoso triple store, and several SPARQL queries were tested as "proofs-of-concept" to illustrate the utility of the Semantic Web in querying across databases which were originally difficult to implement. CONCLUSIONS: We were able to successfully retrieve information by linking UniCarbKB, GlycomeDB and JCGGDB in a single SPARQL query to obtain our target information. We also tested queries linking UniProt with GlycoEpitope as well as lectin data with GlycomeDB through PDB. As a result, we have been able to link proteomics data with glycomics data through the implementation of Semantic Web technologies, allowing for more flexible queries across these domains.

13.
OMICS ; 14(4): 475-86, 2010 Aug.
Article in English | MEDLINE | ID: mdl-20726803

ABSTRACT

In the bioinformatics field, many computer algorithmic and data mining technologies have been developed for gene prediction, protein-protein interaction analysis, sequence analysis, and protein folding predictions, to name a few. This kind of research has branched off from the genomics field, creating the transcriptomics, proteomics, metabolomics, and glycomics research areas in the postgenomic age. In the glycomics field, given the complexity of glycan structures with their branches of monosaccharides in various conformations, new data mining and algorithmic methods have been developed in an attempt to gain a better understanding of glycans. However, these methods have not all been implemented as tools such that the glycobiology community may utilize them in their research. Thus, we have developed RINGS (Resource for INformatics of Glycomes at Soka) as a freely available Web resource for glycobiologists to analyze their data using the latest data mining and algorithmic techniques. It provides a number of tools including a 2D glycan drawing and querying interface called DrawRINGS, a Glycan Pathway Predictor (GPP) tool for dynamically computing the N-glycan biosynthesis pathway from a given glycan structure, and data mining tools Glycan Miner Tool and Profile PSTMM. These tools and other utilities provided by RINGS will be described. The URL for RINGS is http://rings.t.soka.ac.jp/.


Subject(s)
Data Mining , Glycomics/methods , Informatics/methods , Internet , Software , Algorithms , Carbohydrate Conformation , Carbohydrate Sequence , Databases, Factual , Molecular Sequence Data
SELECTION OF CITATIONS
SEARCH DETAIL
...