Pesquisa | Portal Regional da BVS (teste)

Dark Proteome Database: Studies on Disorder.

Perdigão, Nelson; Pina, Pedro M C; Rocha, Cátia; Tavares, João Manuel R S; Rosa, Agostinho.

High Throughput ; 9(3)2020 Jun 30.

Artigo em Inglês | MEDLINE | ID: mdl-32629790

RESUMO

There is a misconception that intrinsic disorder in proteins is equivalent to darkness. The present study aims to establish, in the scope of the Swiss-Prot and Dark Proteome databases, the relationship between disorder and darkness. Three distinct predictors were used to calculate the disorder of Swiss-Prot proteins. The analysis of the results obtained with the used predictors and visualization paradigms resulted in the same conclusion that was reached before: disorder is mostly unrelated to darkness.

Dark Proteome Database: Studies on Dark Proteins.

Perdigão, Nelson; Rosa, Agostinho.

High Throughput ; 8(2)2019 Mar 27.

Artigo em Inglês | MEDLINE | ID: mdl-30934744

RESUMO

The dark proteome, as we define it, is the part of the proteome where 3D structure has not been observed either by homology modeling or by experimental characterization in the protein universe. From the 550.116 proteins available in Swiss-Prot (as of July 2016), 43.2% of the eukarya universe and 49.2% of the virus universe are part of the dark proteome. In bacteria and archaea, the percentage of the dark proteome presence is significantly less, at 12.6% and 13.3% respectively. In this work, we present a necessary step to complete the dark proteome picture by introducing the map of the dark proteome in the human and in other model organisms of special importance to mankind. The most significant result is that around 40% to 50% of the proteome of these organisms are still in the dark, where the higher percentages belong to higher eukaryotes (mouse and human organisms). Due to the amount of darkness present in the human organism being more than 50%, deeper studies were made, including the identification of 'dark' genes that are responsible for the production of so-called dark proteins, as well as the identification of the 'dark' tissues where dark proteins are over represented, namely, the heart, cervical mucosa, and natural killer cells. This is a step forward in the direction of gaining a deeper knowledge of the human dark proteome.

The Dark Proteome Database.

Perdigão, Nelson; Rosa, Agostinho C; O'Donoghue, Seán I.

BioData Min ; 10: 24, 2017.

Artigo em Inglês | MEDLINE | ID: mdl-28736578

RESUMO

BACKGROUND: Recently we surveyed the dark-proteome, i.e., regions of proteins never observed by experimental structure determination and inaccessible to homology modelling. Surprisingly, we found that most of the dark proteome could not be accounted for by conventional explanations (e.g., intrinsic disorder, transmembrane domains, and compositional bias), and that nearly half of the dark proteome comprised dark proteins, in which the entire sequence lacked similarity to any known structure. In this paper we will present the Dark Proteome Database (DPD) and associated web services that provide access to updated information about the dark proteome. RESULTS: We assembled DPD from several external web resources (primarily Aquaria and Swiss-Prot) and stored it in a relational database currently containing ~10 million entries and occupying ~2 GBytes of disk space. This database comprises two key tables: one giving information on the 'darkness' of each protein, and a second table that breaks each protein into dark and non-dark regions. In addition, a second version of the database is created using also information from the Protein Model Portal (PMP) to determine darkness. To provide access to DPD, a web server has been implemented giving access to all underlying data, as well as providing access to functional analyses derived from these data. CONCLUSIONS: Availability of this database and its web service will help focus future structural and computational biology efforts to study the dark proteome, thus providing a basis for understanding a wide variety of biological functions that currently remain unknown. AVAILABILITY AND IMPLEMENTATION: DPD is available at http://darkproteome.ws. The complete database is also available upon request. Data use is permitted via the Creative Commons Attribution-NonCommercial International license (http://creativecommons.org/licenses/by-nc/4.0/).

Unexpected features of the dark proteome.

Perdigão, Nelson; Heinrich, Julian; Stolte, Christian; Sabir, Kenneth S; Buckley, Michael J; Tabor, Bruce; Signal, Beth; Gloss, Brian S; Hammang, Christopher J; Rost, Burkhard; Schafferhans, Andrea; O'Donoghue, Seán I.

Proc Natl Acad Sci U S A ; 112(52): 15898-903, 2015 Dec 29.

Artigo em Inglês | MEDLINE | ID: mdl-26578815

RESUMO

We surveyed the "dark" proteome-that is, regions of proteins never observed by experimental structure determination and inaccessible to homology modeling. For 546,000 Swiss-Prot proteins, we found that 44-54% of the proteome in eukaryotes and viruses was dark, compared with only â¼14% in archaea and bacteria. Surprisingly, most of the dark proteome could not be accounted for by conventional explanations, such as intrinsic disorder or transmembrane regions. Nearly half of the dark proteome comprised dark proteins, in which the entire sequence lacked similarity to any known structure. Dark proteins fulfill a wide variety of functions, but a subset showed distinct and largely unexpected features, such as association with secretion, specific tissues, the endoplasmic reticulum, disulfide bonding, and proteolytic cleavage. Dark proteins also had short sequence length, low evolutionary reuse, and few known interactions with other proteins. These results suggest new research directions in structural and computational biology.

Assuntos

Biologia Computacional/métodos , Bases de Dados de Proteínas , Proteínas/metabolismo , Proteoma/metabolismo , Algoritmos , Animais , Archaea/genética , Archaea/metabolismo , Bactérias/genética , Bactérias/metabolismo , Eucariotos/metabolismo , Humanos , Modelos Moleculares , Conformação Proteica , Proteínas/química , Proteínas/genética , Proteoma/química , Proteoma/genética , Vírus/genética , Vírus/metabolismo

Aquaria: simplifying discovery and insight from protein structures.

O'Donoghue, Seán I; Sabir, Kenneth S; Kalemanov, Maria; Stolte, Christian; Wellmann, Benjamin; Ho, Vivian; Roos, Manfred; Perdigão, Nelson; Buske, Fabian A; Heinrich, Julian; Rost, Burkhard; Schafferhans, Andrea.

Nat Methods ; 12(2): 98-9, 2015 Feb.

Artigo em Inglês | MEDLINE | ID: mdl-25633501

Assuntos

Bases de Dados de Proteínas , Proteínas/química , Sequência de Aminoácidos , Dados de Sequência Molecular , Conformação Proteica

How to learn about gene function: text-mining or ontologies?

Soldatos, Theodoros G; Perdigão, Nelson; Brown, Nigel P; Sabir, Kenneth S; O'Donoghue, Seán I.

Methods ; 74: 3-15, 2015 Mar.

Artigo em Inglês | MEDLINE | ID: mdl-25088781

RESUMO

As the amount of genome information increases rapidly, there is a correspondingly greater need for methods that provide accurate and automated annotation of gene function. For example, many high-throughput technologies--e.g., next-generation sequencing--are being used today to generate lists of genes associated with specific conditions. However, their functional interpretation remains a challenge and many tools exist trying to characterize the function of gene-lists. Such systems rely typically in enrichment analysis and aim to give a quick insight into the underlying biology by presenting it in a form of a summary-report. While the load of annotation may be alleviated by such computational approaches, the main challenge in modern annotation remains to develop a systems form of analysis in which a pipeline can effectively analyze gene-lists quickly and identify aggregated annotations through computerized resources. In this article we survey some of the many such tools and methods that have been developed to automatically interpret the biological functions underlying gene-lists. We overview current functional annotation aspects from the perspective of their epistemology (i.e., the underlying theories used to organize information about gene function into a body of verified and documented knowledge) and find that most of the currently used functional annotation methods fall broadly into one of two categories: they are based either on 'known' formally-structured ontology annotations created by 'experts' (e.g., the GO terms used to describe the function of Entrez Gene entries), or--perhaps more adventurously--on annotations inferred from literature (e.g., many text-mining methods use computer-aided reasoning to acquire knowledge represented in natural languages). Overall however, deriving detailed and accurate insight from such gene lists remains a challenging task, and improved methods are called for. In particular, future methods need to (1) provide more holistic insight into the underlying molecular systems; (2) provide better follow-up experimental testing and treatment options, and (3) better manage gene lists derived from organisms that are not well-studied. We discuss some promising approaches that may help achieve these advances, especially the use of extended dictionaries of biomedical concepts and molecular mechanisms, as well as greater use of annotation benchmarks.

Assuntos

Mineração de Dados/métodos , Bases de Dados Genéticas , Ontologia Genética , Animais , Mineração de Dados/tendências , Bases de Dados Genéticas/tendências , Ontologia Genética/tendências , Humanos

RESUMO

RESUMO

RESUMO

RESUMO

Assuntos

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA