Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 23
Filtrar
Mais filtros











Base de dados
Intervalo de ano de publicação
1.
Anal Bioanal Chem ; 2024 Aug 30.
Artigo em Inglês | MEDLINE | ID: mdl-39212696

RESUMO

Integration of glycan-related databases between different research fields is essential in glycoscience. It requires knowledge across the breadth of science because most glycans exist as glycoconjugates. On the other hand, especially between chemistry and biology, glycan data has not been easy to integrate due to the huge variety of glycan structure representations. We have developed WURCS (Web 3.0 Unique Representation of Carbohydrate Structures) as a notation for representing all glycan structures uniquely for the purpose of integrating data across scientific data resources. While the integration of glycan data in the field of biology has been greatly advanced, in the field of chemistry, progress has been hampered due to the lack of appropriate rules to extract sugars from chemical structures. Thus, we developed a unique algorithm to determine the range of structures allowed to be considered as sugars from the structural formulae of compounds, and we developed software to extract sugars in WURCS format according to this algorithm. In this manuscript, we show that our algorithm can extract sugars from glycoconjugate molecules represented at the molecular level and can distinguish them from other biomolecules, such as amino acids, nucleic acids, and lipids. Available as software, MolWURCS is freely available and downloadable ( https://gitlab.com/glycoinfo/molwurcs ).

2.
Yakugaku Zasshi ; 144(5): 539-543, 2024.
Artigo em Japonês | MEDLINE | ID: mdl-38692930

RESUMO

Researchers collect data and use various methods to organize it. Ensuring the reliability and reproducibility of data is crucial, and collaboration across different research fields is on the rise. However, when there is geographical distance, sharing data becomes a challenging task. Therefore, there is a need for the development of a mechanism for sharing data on the web. We have developed an integrated database to facilitate the sharing and management of research data, particularly focusing on small molecules. The integrated database serves as a platform for centralizing data related to small molecules, including their chemical structures, wet lab experimental data, simulation data, and more. It has been constructed as a web application, offering features such as library management for small molecules, registration and viewing of wet lab experiment results, generation of initial conformations for simulations, and data visualization. This enables researchers to efficiently share their research data and collaborate seamlessly, whether within their research group or via cloud-based access that allows project and team members to connect from anywhere. This integrated database plays a critical role in connecting wet lab experiments and simulations, enabling researchers to cross-reference and analyze experimental data comprehensively. It serves as an essential tool to advance research and foster idea generation.


Assuntos
Bases de Dados Factuais , Simulação por Computador , Disseminação de Informação , Internet , Reprodutibilidade dos Testes , Bibliotecas de Moléculas Pequenas
3.
J Biol Chem ; 300(2): 105624, 2024 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-38176651

RESUMO

The glycosylation of proteins and lipids is known to be closely related to the mechanisms of various diseases such as influenza, cancer, and muscular dystrophy. Therefore, it has become clear that the analysis of post-translational modifications of proteins, including glycosylation, is important to accurately understand the functions of each protein molecule and the interactions among them. In order to conduct large-scale analyses more efficiently, it is essential to promote the accumulation, sharing, and reuse of experimental and analytical data in accordance with the FAIR (Findability, Accessibility, Interoperability, and Re-usability) data principles. However, a FAIR data repository for storing and sharing glycoconjugate information, including glycopeptides and glycoproteins, in a standardized format did not exist. Therefore, we have developed GlyComb (https://glycomb.glycosmos.org) as a new standardized data repository for glycoconjugate data. Currently, GlyComb can assign a unique identifier to a set of glycosylation information associated with a specific peptide sequence or UniProt ID. By standardizing glycoconjugate data via GlyComb identifiers and coordinating with existing web resources such as GlyTouCan and GlycoPOST, a comprehensive system for data submission and data sharing among researchers can be established. Here we introduce how GlyComb is able to integrate the variety of glycoconjugate data already registered in existing data repositories to obtain a better understanding of the available glycopeptides and glycoproteins, and their glycosylation patterns. We also explain how this system can serve as a foundation for a better understanding of glycan function.


Assuntos
Bases de Dados de Compostos Químicos , Glicômica , Proteômica , Glicopeptídeos/metabolismo , Glicoproteínas/metabolismo , Glicosilação , Polissacarídeos/metabolismo , Bases de Dados Genéticas
4.
Glycobiology ; 33(6): 454-463, 2023 06 21.
Artigo em Inglês | MEDLINE | ID: mdl-37129482

RESUMO

The GlyCosmos Glycoscience Portal (https://glycosmos.org) and PubChem (https://pubchem.ncbi.nlm.nih.gov/) are major portals for glycoscience and chemistry, respectively. GlyCosmos is a portal for glycan-related repositories, including GlyTouCan, GlycoPOST, and UniCarb-DR, as well as for glycan-related data resources that have been integrated from a variety of 'omics databases. Glycogenes, glycoproteins, lectins, pathways, and disease information related to glycans are accessible from GlyCosmos. PubChem, on the other hand, is a chemistry-based portal at the National Center for Biotechnology Information. PubChem provides information not only on chemicals, but also genes, proteins, pathways, as well as patents, bioassays, and more, from hundreds of data resources from around the world. In this work, these 2 portals have made substantial efforts to integrate their complementary data to allow users to cross between these 2 domains. In addition to glycan structures, key information, such as glycan-related genes, relevant diseases, glycoproteins, and pathways, was integrated and cross-linked with one another. The interfaces were designed to enable users to easily find, access, download, and reuse data of interest across these resources. Use cases are described illustrating and highlighting the type of content that can be investigated. In total, these integrations provide life science researchers improved awareness and enhanced access to glycan-related information.


Assuntos
Bases de Dados de Compostos Químicos , Polissacarídeos , Glicosilação , Fluxo de Trabalho , Informática , Polissacarídeos/química , Glicoconjugados/química
6.
Glycobiology ; 32(8): 646-650, 2022 07 13.
Artigo em Inglês | MEDLINE | ID: mdl-35452093

RESUMO

High-performance liquid chromatography (HPLC) elution data provide a useful tool for quantitative glycosylation profiling, discriminating isomeric oligosaccharides. The web application Glycoanalysis by the Three Axes of MS and Chromatography (GALAXY), which is based on the three-dimensional HPLC map of N-linked oligosaccharides with pyridyl-2-amination developed by Dr. Noriko Takahashi, has been extensively used for N-glycosylation profiling at molecular, cellular, and tissue levels. Herein, we describe the updated GALAXY as version 3, which includes new HPLC data including those of glucuronylated and sulfated glycans, an improved graphical user interface using modern technologies, and linked to glycan information in GlyTouCan and the GlyCosmos Portal. This liaison will facilitate glycomic analyses of human and other organisms in conjunction with multiomics data.


Assuntos
Oligossacarídeos , Polissacarídeos , Cromatografia Líquida de Alta Pressão/métodos , Glicosilação , Humanos , Oligossacarídeos/química , Polissacarídeos/química
8.
Molecules ; 26(23)2021 Nov 25.
Artigo em Inglês | MEDLINE | ID: mdl-34885724

RESUMO

In life science fields, database integration is progressing and contributing to collaboration between different research fields, including the glycosciences. The integration of glycan databases has greatly progressed collaboration worldwide with the development of the international glycan structure repository, GlyTouCan. This trend has increased the need for a tool by which researchers in various fields can easily search glycan structures from integrated databases. We have developed a web-based glycan structure search tool, SugarDrawer, which supports the depiction of glycans including ambiguity, such as glycan fragments which contain underdetermined linkages, and a database search for glycans drawn on the canvas. This tool provides an easy editing feature for various glycan structures in just a few steps using template structures and pop-up windows which allow users to select specific information for each structure element. This tool has a unique feature for selecting possible attachment sites, which is defined in the Symbol Nomenclature for Glycans (SNFG). In addition, this tool can input and output glycans in WURCS and GlycoCT formats, which are the most commonly-used text formats for glycan structures.


Assuntos
Bases de Dados Factuais , Internet , Polissacarídeos/genética , Software , Disciplinas das Ciências Biológicas , Humanos , Polissacarídeos/química , Polissacarídeos/classificação , Polissacarídeos/ultraestrutura
9.
Glycobiology ; 31(7): 741-750, 2021 08 07.
Artigo em Inglês | MEDLINE | ID: mdl-33677548

RESUMO

Recent years have seen great advances in the development of glycoproteomics protocols and methods resulting in a sustainable increase in the reporting proteins, their attached glycans and glycosylation sites. However, only very few of these reports find their way into databases or data repositories. One of the major reasons is the absence of digital standard to represent glycoproteins and the challenging annotations with glycans. Depending on the experimental method, such a standard must be able to represent glycans as complete structures or as compositions, store not just single glycans but also represent glycoforms on a specific glycosylation side, deal with partially missing site information if no site mapping was performed, and store abundances or ratios of glycans within a glycoform of a specific site. To support the above, we have developed the GlycoConjugate Ontology (GlycoCoO) as a standard semantic framework to describe and represent glycoproteomics data. GlycoCoO can be used to represent glycoproteomics data in triplestores and can serve as a basis for data exchange formats. The ontology, database providers and supporting documentation are available online (https://github.com/glycoinfo/GlycoCoO).


Assuntos
Glicoproteínas , Polissacarídeos , Glicoproteínas/metabolismo , Glicosilação , Polissacarídeos/metabolismo
10.
Structure ; 29(4): 393-400.e1, 2021 04 01.
Artigo em Inglês | MEDLINE | ID: mdl-33657417

RESUMO

The Worldwide Protein Data Bank (wwPDB) has provided validation reports based on recommendations from community Validation Task Forces for structures in the PDB since 2013. To further enhance validation of small molecules as recommended from the 2016 Ligand Validation Workshop, wwPDB, Global Phasing Ltd., and the Noguchi Institute, recently formed a public/private partnership to incorporate some of their software tools into the wwPDB validation package. Augmented wwPDB validation report features include: two-dimensional (2D) diagrams of small-molecule ligands and carbohydrates, highlighting geometric validation outcomes; 2D topological diagrams of oligosaccharides present in branched entities generated using 2D Symbol Nomenclature for Glycan representation; and views of 3D electron density maps for ligands and carbohydrates, illustrating the goodness-of-fit between the atomic structure and experimental data (X-ray crystallographic structures only). These improvements will impact confidence in ligand conformation and ligand-macromolecular interactions that will aid in understanding biochemical function and contribute to small-molecule drug discovery.


Assuntos
Carboidratos/química , Bases de Dados de Proteínas/normas , Simulação de Acoplamento Molecular/métodos , Proteômica/métodos , Bibliotecas de Moléculas Pequenas/química , Quimioinformática/métodos , Bases de Dados de Compostos Químicos/normas , Humanos , Ligantes , Ligação Proteica , Proteoma/química , Proteoma/metabolismo
11.
Nucleic Acids Res ; 49(D1): D1529-D1533, 2021 01 08.
Artigo em Inglês | MEDLINE | ID: mdl-33125071

RESUMO

Glycans serve important roles in signaling events and cell-cell communication, and they are recognized by lectins, viruses and bacteria, playing a variety of roles in many biological processes. However, there was no system to organize the plethora of glycan-related data in the literature. Thus GlyTouCan (https://glytoucan.org) was developed as the international glycan repository, allowing researchers to assign accession numbers to glycans. This also aided in the integration of glycan data across various databases. GlyTouCan assigns accession numbers to glycans which are defined as sets of monosaccharides, which may or may not be characterized with linkage information. GlyTouCan was developed to be able to recognize any level of ambiguity in glycans and uniquely assign accession numbers to each of them, regardless of the input text format. In this manuscript, we describe the latest update to GlyTouCan in version 3.0, its usage, and plans for future development.


Assuntos
Biologia Computacional/métodos , Bases de Dados Factuais , Polissacarídeos/classificação , Software , Humanos , Cooperação Internacional , Internet , Polissacarídeos/análise , Polissacarídeos/química , Terminologia como Assunto
13.
Bioinformatics ; 35(14): 2434-2440, 2019 07 15.
Artigo em Inglês | MEDLINE | ID: mdl-30535258

RESUMO

MOTIVATION: Glycans are biomolecules that take an important role in the biological processes of living organisms. They form diverse, complicated structures such as branched and cyclic forms. Web3 Unique Representation of Carbohydrate Structures (WURCS) was proposed as a new linear notation for uniquely representing glycans during the GlyTouCan project. WURCS defines rules for complex glycan structures that other text formats did not support, and so it is possible to represent a wide variety glycans. However, WURCS uses a complicated nomenclature, so it is not human-readable. Therefore, we aimed to support the interpretation of WURCS by converting WURCS to the most basic and widely used format IUPAC. RESULTS: In this study, we developed GlycanFormatConverter and succeeded in converting WURCS to the three kinds of IUPAC formats (IUPAC-Extended, IUPAC-Condensed and IUPAC-Short). Furthermore, we have implemented functionality to import IUPAC-Extended, KEGG Chemical Function (KCF) and LinearCode formats and to export WURCS. We have thoroughly tested our GlycanFormatConverter and were able to show that it was possible to convert all the glycans registered in the GlyTouCan repository, with exceptions owing only to the limitations of the original format. The source code for this conversion tool has been released as an open source tool. AVAILABILITY AND IMPLEMENTATION: https://github.com/glycoinfo/GlycanFormatConverter.git. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Software , Polissacarídeos
14.
Glycobiology ; 27(10): 915-919, 2017 10 01.
Artigo em Inglês | MEDLINE | ID: mdl-28922742

RESUMO

Rapid and continued growth in the generation of glycomic data has revealed the need for enhanced development of basic infrastructure for presenting and interpreting these datasets in a manner that engages the broader biomedical research community. Early in their growth, the genomic and proteomic fields implemented mechanisms for assigning unique gene and protein identifiers that were essential for organizing data presentation and for enhancing bioinformatic approaches to extracting knowledge. Similar unique identifiers are currently absent from glycomic data. In order to facilitate continued growth and expanded accessibility of glycomic data, the authors strongly encourage the glycomics community to coordinate the submission of their glycan structures to the GlyTouCan Repository and to make use of GlyTouCan identifiers in their communications and publications. The authors also deeply encourage journals to recommend a submission workflow in which submitted publications utilize GlyTouCan identifiers as a standard reference for explicitly describing glycan structures cited in manuscripts.


Assuntos
Bases de Dados de Compostos Químicos , Glicômica/métodos , Polissacarídeos/química , Glicômica/normas , Polissacarídeos/classificação
15.
Carbohydr Res ; 445: 104-116, 2017 Jun 05.
Artigo em Inglês | MEDLINE | ID: mdl-28525772

RESUMO

GlyTouCan version 1.0 was released in 2015 as the international glycan structure repository, and a new sequence format called WURCS (Web3 Unique Representation of Carbohydrate Structures) was proposed during the early stages of the GlyTouCan project. GlyTouCan uses WURCS as its base representation for glycans because existing formats were insufficient in their flexibility to represent any and all glycans universally. Therefore, in order to obtain WURCS strings for existing or new glycan structures, conversion tools or glycan structure editors that can export WURCS became necessary. GlycanBuilder was an obvious choice to extend due to its wide usage by the community. However, GlycanBuilder was limited because it was originally developed to support mammalian glycans. It also did not support the newly proposed monosaccharide symbol standard called Symbol Nomenclature for Glycans (SNFG). Therefore in this work, we implemented a new version of GlycanBuilder to greatly increase its usability. The glycan rendering system was refactored so that cyclic glycans, nested repeating units, monosaccharide compositions and cross-linked glycan structures can be represented. Both import and export utilities for WURCS were also implemented and SNFG symbols were incorporated to allow glycans to be exported as graphics using the latest glycan symbol nomenclature. This new version of GlycanBuilder called "GlycanBuilder2", is able to support a wide variety of ambiguous glycans, including structures containing monosaccharides from bacteria and plants. These glycans can also be displayed using the new SNFG symbols. This tool can aid researchers in communicating about the complex, diverse, and ambiguous structures of glycans more rapidly. Moreover, the new GlycanBuilder can now easily output WURCS sequences from glycans drawn on the canvas. Most importantly, because GlyTouCan employs WURCS as the basic format for registration and searching of glycan information, a wider variety of glycans can now be readily registered and queried in GlyTouCan.


Assuntos
Glicômica/métodos , Polissacarídeos/química , Sequência de Carboidratos , Gráficos por Computador
16.
J Chem Inf Model ; 57(4): 632-637, 2017 04 24.
Artigo em Inglês | MEDLINE | ID: mdl-28263066

RESUMO

Accurate representation of structural ambiguity is important for storing carbohydrate structures containing varying levels of ambiguity in the literature and databases. Although many representations for carbohydrates have been developed in the past, a generalized but discrete representation format did not exist. We had previously developed the Web3 Unique Representation of Carbohydrate Structures (WURCS) in an attempt to define a generalizable and unique linear representation for carbohydrate structures. However, it lacked sufficient rules to uniquely describe ambiguous structures. In this work, we updated WURCS to handle such ambiguous monosaccharide structures. In particular, to handle structural ambiguity around (potential) carbonyl groups incidental to the carbohydrate analysis, we defined a representation of backbone carbons containing atomic-level ambiguity. As a result, we show that WURCS 2.0 can represent a wider variety of carbohydrate structures containing ambiguous monosaccharides, such as those whose ring closure is undefined or whose anomeric information is only known. This new format provides a representation of carbohydrates that was not possible before, and it is currently being used by the International Glycan Structure Repository GlyTouCan.


Assuntos
Carboidratos/química , Biologia Computacional/métodos , Monossacarídeos/análise , Estereoisomerismo
17.
Nucleic Acids Res ; 44(D1): D1237-42, 2016 Jan 04.
Artigo em Inglês | MEDLINE | ID: mdl-26476458

RESUMO

Glycans are known as the third major class of biopolymers, next to DNA and proteins. They cover the surfaces of many cells, serving as the 'face' of cells, whereby other biomolecules and viruses interact. The structure of glycans, however, differs greatly from DNA and proteins in that they are branched, as opposed to linear sequences of amino acids or nucleotides. Therefore, the storage of glycan information in databases, let alone their curation, has been a difficult problem. This has caused many duplicated efforts when integration is attempted between different databases, making an international repository for glycan structures, where unique accession numbers are assigned to every identified glycan structure, necessary. As such, an international team of developers and glycobiologists have collaborated to develop this repository, called GlyTouCan and is available at http://glytoucan.org/, to provide a centralized resource for depositing glycan structures, compositions and topologies, and to retrieve accession numbers for each of these registered entries. This will thus enable researchers to reference glycan structures simply by accession number, as opposed to by chemical structure, which has been a burden to integrate glycomics databases in the past.


Assuntos
Bases de Dados de Compostos Químicos , Polissacarídeos/química , Estrutura Molecular
18.
Bioinformatics ; 31(6): 919-25, 2015 Mar 15.
Artigo em Inglês | MEDLINE | ID: mdl-25388145

RESUMO

MOTIVATION: Over the last decades several glycomics-based bioinformatics resources and databases have been created and released to the public. Unfortunately, there is no common standard in the representation of the stored information or a common machine-readable interface allowing bioinformatics groups to easily extract and cross-reference the stored information. RESULTS: An international group of bioinformatics experts in the field of glycomics have worked together to create a standard Resource Description Framework (RDF) representation for glycomics data, focused on glycan sequences and related biological source, publications and experimental data. This RDF standard is defined by the GlycoRDF ontology and will be used by database providers to generate common machine-readable exports of the data stored in their databases. AVAILABILITY AND IMPLEMENTATION: The ontology, supporting documentation and source code used by database providers to generate standardized RDF are available online (http://www.glycoinfo.org/GlycoRDF/).


Assuntos
Biologia Computacional/métodos , Sistemas de Gerenciamento de Base de Dados/normas , Bases de Dados Factuais/normas , Glicômica/métodos , Armazenamento e Recuperação da Informação/métodos , Polissacarídeos/química , Software , Documentação , Ontologia Genética , Humanos
19.
J Chem Inf Model ; 54(6): 1558-66, 2014 Jun 23.
Artigo em Inglês | MEDLINE | ID: mdl-24897372

RESUMO

In recent years, the Semantic Web has become the focus of life science database development as a means to link life science data in an effective and efficient manner. In order for carbohydrate data to be applied to this new technology, there are two requirements for carbohydrate data representations: (1) a linear notation which can be used as a URI (Uniform Resource Identifier) if needed and (2) a unique notation such that any published glycan structure can be represented distinctively. This latter requirement includes the possible representation of nonstandard monosaccharide units as a part of the glycan structure, as well as compositions, repeating units, and ambiguous structures where linkages/linkage positions are unidentified. Therefore, we have developed the Web3 Unique Representation of Carbohydrate Structures (WURCS) as a new linear notation for representing carbohydrates for the Semantic Web.


Assuntos
Carboidratos/química , Bases de Dados de Compostos Químicos , Sequência de Carboidratos , Internet , Modelos Moleculares , Dados de Sequência Molecular , Software
20.
J Biomed Semantics ; 4(1): 39, 2013 Nov 26.
Artigo em Inglês | MEDLINE | ID: mdl-24280648

RESUMO

BACKGROUND: Glycoscience is a research field focusing on complex carbohydrates (otherwise known as glycans)a, which can, for example, serve as "switches" that toggle between different functions of a glycoprotein or glycolipid. Due to the advancement of glycomics technologies that are used to characterize glycan structures, many glycomics databases are now publicly available and provide useful information for glycoscience research. However, these databases have almost no link to other life science databases. RESULTS: In order to implement support for the Semantic Web most efficiently for glycomics research, the developers of major glycomics databases agreed on a minimal standard for representing glycan structure and annotation information using RDF (Resource Description Framework). Moreover, all of the participants implemented this standard prototype and generated preliminary RDF versions of their data. To test the utility of the converted data, all of the data sets were uploaded into a Virtuoso triple store, and several SPARQL queries were tested as "proofs-of-concept" to illustrate the utility of the Semantic Web in querying across databases which were originally difficult to implement. CONCLUSIONS: We were able to successfully retrieve information by linking UniCarbKB, GlycomeDB and JCGGDB in a single SPARQL query to obtain our target information. We also tested queries linking UniProt with GlycoEpitope as well as lectin data with GlycomeDB through PDB. As a result, we have been able to link proteomics data with glycomics data through the implementation of Semantic Web technologies, allowing for more flexible queries across these domains.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA