Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 14 de 14
Filter
1.
Curr Pharm Des ; 20(38): 5945-56, 2014.
Article in English | MEDLINE | ID: mdl-24641232

ABSTRACT

Non-Communicable Diseases (NCDs) are among the most pressing global health problems of the twenty-first century. Their rising incidence and prevalence is linked to severe morbidity and mortality, and they are putting economic and managerial pressure on healthcare systems around the world. Moreover, NCDs are impeding healthy aging by negatively affecting the quality of life of a growing number of the global population. NCDs result from the interaction of various genetic, environmental and habitual factors, and cluster in complex ways, making the complex identification of resulting phenotypes not only difficult, but also a top research priority. The degree of complexity required to interpret large patient datasets generated by advanced high-throughput functional genomics assays has now increased to the point that novel computational biology approaches are essential to extract information that is relevant to the clinical decision-making process. Consequently, system-level models that interpret the interactions between extensive tissues, cellular and molecular measurements and clinical features are also being created to identify new disease phenotypes, so that disease definition and treatment are optimized, and novel therapeutic targets discovered. Likewise, Systems Medicine (SM) platforms applied to extensively-characterized patients provide a basis for more targeted clinical trials, and represent a promising tool to achieve better prevention and patient care, thereby promoting healthy aging globally. The present paper: (1) reviews the novel systems approaches to NCDs; (2) discusses how to move efficiently from Systems Biology to Systems Medicine; and (3) presents the scientific and clinical background of the San Raffaele Systems Medicine Platform.


Subject(s)
Clinical Medicine/methods , Comprehension , Disease Management , Systems Biology/methods , Clinical Medicine/trends , Humans , Systems Biology/trends
2.
PLoS Comput Biol ; 8(2): e1002365, 2012 Feb.
Article in English | MEDLINE | ID: mdl-22319435

ABSTRACT

Elucidation of new biomarkers and potential drug targets from high-throughput profiling data is a challenging task due to a limited number of available biological samples and questionable reproducibility of differential changes in cross-dataset comparisons. In this paper we propose a novel computational approach for drug and biomarkers discovery using comprehensive analysis of multiple expression profiling datasets.The new method relies on aggregation of individual profiling experiments combined with leave-one-dataset-out validation approach. Aggregated datasets were studied using Sub-Network Enrichment Analysis algorithm (SNEA) to find consistent statistically significant key regulators within the global literature-extracted expression regulation network. These regulators were linked to the consistent differentially expressed genes.We have applied our approach to several publicly available human muscle gene expression profiling datasets related to Duchenne muscular dystrophy (DMD). In order to detect both enhanced and repressed processes we considered up- and down-regulated genes separately. Applying the proposed approach to the regulators search we discovered the disturbance in the activity of several muscle-related transcription factors (e.g. MYOG and MYOD1), regulators of inflammation, regeneration, and fibrosis. Almost all SNEA-derived regulators of down-regulated genes (e.g. AMPK, TORC2, PPARGC1A) correspond to a single common pathway important for fast-to-slow twitch fiber type transition. We hypothesize that this process can affect the severity of DMD symptoms, making corresponding regulators and downstream genes valuable candidates for being potential drug targets and exploratory biomarkers.


Subject(s)
Computational Biology/methods , Drug Discovery/methods , Gene Expression Profiling/methods , Muscular Dystrophy, Duchenne/drug therapy , Algorithms , Biomarkers/analysis , Databases, Genetic , Humans , Male , Meta-Analysis as Topic , Muscular Dystrophy, Duchenne/genetics , Muscular Dystrophy, Duchenne/metabolism , Oligonucleotide Array Sequence Analysis
3.
Am J Cancer Res ; 2(1): 93-103, 2012.
Article in English | MEDLINE | ID: mdl-22206048

ABSTRACT

Non-small cell lung cancer (NSCLC) is the most common type of lung cancer, with a poor response to chemotherapy and low survival rate. This unfavorable treatment response is likely to derive from both late diagnosis and from complex, incompletely understood biology, and heterogeneity among NSCLC subtypes. To define the relative contributions of major cellular pathways to the biogenesis of NSCLC and highlight major differences between NSCLC subtypes, we studied the molecular signatures of lung adenocarcinoma (ADC) and squamous cell carcinoma (SCC), based on analysis of gene expression and comparison of tumor samples with normal lung tissue. Our results suggest the existence of specific molecular networks and subtype-specific differences between lung ADC and SCC subtypes, mostly found in cell cycle, DNA repair, and metabolic pathways. However, we also observed similarities across major gene interaction networks and pathways in ADC and SCC. These data provide a new insight into the biology of ADC and SCC and can be used to explore novel therapeutic interventions in lung cancer chemoprevention and treatment.

4.
Genes Cancer ; 2(9): 870-9, 2011 Sep.
Article in English | MEDLINE | ID: mdl-22593799

ABSTRACT

Triple-negative breast cancer (TNBC) is an aggressive breast cancer subtype with a high rate of proliferation and metastasis, as well as poor prognosis for advanced-stage disease. Although TNBC was previously classified together with basal-like and BRCA1/2-related breast cancers, genomic profiling now shows that there is incomplete overlap, with important distinctions associated with each subtype. The biology of TNBC is still poorly understood; therefore, to define the relative contributions of major cellular pathways in TNBC, we have studied its molecular signature based on analysis of gene expression. Comparisons were then made with normal breast tissue. Our results suggest the existence of molecular networks in TNBC, characterized by explicit alterations in the cell cycle, DNA repair, nucleotide synthesis, metabolic pathways, NF-κB signaling, inflammatory response, and angiogenesis. Moreover, we also characterized TNBC as a cancer of mixed phenotypes, suggesting that TNBC extends beyond the basal-like molecular signature and may constitute an independent subtype of breast cancer. The data provide a new insight into the biology of TNBC.

5.
J Bioinform Comput Biol ; 8(3): 593-606, 2010 Jun.
Article in English | MEDLINE | ID: mdl-20556864

ABSTRACT

Heterogeneous high-throughput biological data become readily available for various diseases. The amount of data points generated by such experiments does not allow manual integration of the information to design the most optimal therapy for a disease. We describe a novel computational workflow for designing therapy using Ariadne Genomics Pathway Studio software. We use publically available microarray experiments for glioblastoma and automatically constructed ResNet and ChemEffect databases to exemplify how to find potentially effective chemicals for glioblastoma--the disease yet without effective treatment. Our first approach involved construction of signaling pathway affected in glioblastoma using scientific literature and data available in ResNet database. Compounds known to affect multiple proteins in this pathway were found in ChemEffect database. Another approach involved analysis of differential expression in glioblastoma patients using Sub-Network Enrichment Analysis (SNEA). SNEA identified angiogenesis-related protein Cyr61 as the major positive regulator upstream of genes differentially expressed in glioblastoma. Using our findings, we then identified breast cancer drug Fulvestrant as a major inhibitor of glioblastoma pathway as well as Cyr61. This suggested Fulvestrant as a potential treatment against glioblastoma. We further show how to increase efficacy of glioblastoma treatment by finding optimal combinations of Fulvestrant with other drugs.


Subject(s)
Antineoplastic Agents/administration & dosage , Combinatorial Chemistry Techniques/methods , Glioblastoma/drug therapy , Glioblastoma/metabolism , Models, Biological , Neoplasm Proteins/metabolism , Signal Transduction/drug effects , Animals , Computer Simulation , Drug Design , Humans
6.
PLoS One ; 5(2): e9256, 2010 Feb 17.
Article in English | MEDLINE | ID: mdl-20174649

ABSTRACT

Microarray-based expression profiling of living systems is a quick and inexpensive method to obtain insights into the nature of various diseases and phenotypes. A typical microarray profile can yield hundreds or even thousands of differentially expressed genes and finding biologically plausible themes or regulatory mechanisms underlying these changes is a non-trivial and daunting task. We describe a novel approach for systems-level interpretation of microarray expression data using a manually constructed "overview" pathway depicting the main cellular signaling channels (Atlas of Signaling). Currently, the developed pathway focuses on signal transduction from surface receptors to transcription factors and further transcriptional regulation of cellular "workhorse" proteins. We show how the constructed Atlas of Signaling in combination with an enrichment analysis algorithm allows quick identification and visualization of the main signaling cascades and cellular processes affected in a gene expression profiling experiment. We validate our approach using several publicly available gene expression datasets.


Subject(s)
Gene Expression Profiling/methods , Gene Regulatory Networks , Oligonucleotide Array Sequence Analysis/methods , Signal Transduction/genetics , Algorithms , Gene Expression Regulation , Models, Genetic , Proteome/genetics , Software
7.
Expert Opin Drug Discov ; 4(12): 1307-18, 2009 Dec.
Article in English | MEDLINE | ID: mdl-23480468

ABSTRACT

IMPORTANCE OF THE FIELD: Drug discovery and development is a very complex and costly process. Understanding the detailed molecular mechanisms of a disease and drug actions can make it more efficient not only for new target discovery but also for lead prioritization, drug repositioning and development of biomarkers for drug efficacy and safety. Access to formalized knowledge about functions of proteins and small molecules is crucial for rationalization of the drug development process, and scientific publications are the main source of this knowledge. Protein knowledge networks capturing protein functions, protein-protein relations and organization of proteins in complex cellular sub-systems are making their way into modern drug discovery. Chemical networks representing multiple aspects of chemical functional information integrated into a protein systems biology network is even more advanced and promising paradigm. AREAS COVERED IN THIS REVIEW: This review describes utilization of literature-derived protein and chemical functional knowledge bases in drug development. WHAT THE READER WILL GAIN: Readers will gain an understanding of how integrated protein and chemical knowledge networks can be used for understanding and building the models of cellular events, disease mechanisms, and drug actions, finding biomarkers of drug efficacy and safety, as well as interpretation of high-throughput gene expression, proteomic and metabolomic experiments. TAKE HOME MESSAGE: Integrated literature-derived protein and chemical knowledge bases can rationalize many aspects of drug development process including drug repositioning and biomarker design.

8.
BMC Bioinformatics ; 8: 243, 2007 Jul 10.
Article in English | MEDLINE | ID: mdl-17620146

ABSTRACT

BACKGROUND: Uncovering cellular roles of a protein is a task of tremendous importance and complexity that requires dedicated experimental work as well as often sophisticated data mining and processing tools. Protein functions, often referred to as its annotations, are believed to manifest themselves through topology of the networks of inter-proteins interactions. In particular, there is a growing body of evidence that proteins performing the same function are more likely to interact with each other than with proteins with other functions. However, since functional annotation and protein network topology are often studied separately, the direct relationship between them has not been comprehensively demonstrated. In addition to having the general biological significance, such demonstration would further validate the data extraction and processing methods used to compose protein annotation and protein-protein interactions datasets. RESULTS: We developed a method for automatic extraction of protein functional annotation from scientific text based on the Natural Language Processing (NLP) technology. For the protein annotation extracted from the entire PubMed, we evaluated the precision and recall rates, and compared the performance of the automatic extraction technology to that of manual curation used in public Gene Ontology (GO) annotation. In the second part of our presentation, we reported a large-scale investigation into the correspondence between communities in the literature-based protein networks and GO annotation groups of functionally related proteins. We found a comprehensive two-way match: proteins within biological annotation groups form significantly denser linked network clusters than expected by chance and, conversely, densely linked network communities exhibit a pronounced non-random overlap with GO groups. We also expanded the publicly available GO biological process annotation using the relations extracted by our NLP technology. An increase in the number and size of GO groups without any noticeable decrease of the link density within the groups indicated that this expansion significantly broadens the public GO annotation without diluting its quality. We revealed that functional GO annotation correlates mostly with clustering in a physical interaction protein network, while its overlap with indirect regulatory network communities is two to three times smaller. CONCLUSION: Protein functional annotations extracted by the NLP technology expand and enrich the existing GO annotation system. The GO functional modularity correlates mostly with the clustering in the physical interaction network, suggesting that the essential role of structural organization maintained by these interactions. Reciprocally, clustering of proteins in physical interaction networks can serve as an evidence for their functional similarity.


Subject(s)
Computational Biology/methods , Databases, Genetic/classification , Genes , Pattern Recognition, Automated/methods , Proteins/physiology , Cluster Analysis , Computational Biology/standards , Databases, Genetic/standards , Databases, Protein , Information Storage and Retrieval , Natural Language Processing , Pattern Recognition, Automated/standards , Protein Interaction Mapping , PubMed , Reproducibility of Results , Terminology as Topic
9.
J Bioinform Comput Biol ; 5(2B): 429-56, 2007 Apr.
Article in English | MEDLINE | ID: mdl-17636854

ABSTRACT

Microarray-based characterization of tissues, cellular and disease states, and environmental condition and treatment responses provides genome-wide snapshots containing large amounts of invaluable information. However, the lack of inherent structure within the data and strong noise make extracting and interpreting this information and formulating and prioritizing domain relevant hypotheses difficult tasks. Integration with different types of biological data is required to place the expression measurements into a biologically meaningful context. A few approaches in microarray data interpretation are discussed with the emphasis on the use of molecular network information. Statistical procedures are demonstrated that superimpose expression data onto the transcription regulation network mined from scientific literature and aim at selecting transcription regulators with significant patterns of expression changes downstream. Tests are suggested that take into account network topology and signs of transcription regulation effects. The approaches are illustrated using two different expression datasets, the performance is compared, and biological relevance of the predictions is discussed.


Subject(s)
Algorithms , Gene Expression Profiling/methods , Models, Biological , Oligonucleotide Array Sequence Analysis/methods , Proteome/metabolism , Signal Transduction/physiology , Transcription, Genetic/physiology , Computer Simulation
10.
BMC Bioinformatics ; 7: 171, 2006 Mar 24.
Article in English | MEDLINE | ID: mdl-16563163

ABSTRACT

BACKGROUND: Scientific literature is a source of the most reliable and comprehensive knowledge about molecular interaction networks. Formalization of this knowledge is necessary for computational analysis and is achieved by automatic fact extraction using various text-mining algorithms. Most of these techniques suffer from high false positive rates and redundancy of the extracted information. The extracted facts form a large network with no pathways defined. RESULTS: We describe the methodology for automatic curation of Biological Association Networks (BANs) derived by a natural language processing technology called Medscan. The curated data is used for automatic pathway reconstruction. The algorithm for the reconstruction of signaling pathways is also described and validated by comparison with manually curated pathways and tissue-specific gene expression profiles. CONCLUSION: Biological Association Networks extracted by MedScan technology contain sufficient information for constructing thousands of mammalian signaling pathways for multiple tissues. The automatically curated MedScan data is adequate for automatic generation of good quality signaling networks. The automatically generated Regulome pathways and manually curated pathways used for their validation are available free in the ResNetCore database from Ariadne Genomics, Inc. 1. The pathways can be viewed and analyzed through the use of a free demo version of PathwayStudio software. The Medscan technology is also available for evaluation using the free demo version of PathwayStudio software.


Subject(s)
Databases, Bibliographic , Natural Language Processing , Periodicals as Topic , Protein Interaction Mapping/methods , Proteins/classification , Proteins/metabolism , Signal Transduction/physiology , Information Storage and Retrieval/methods , Software
11.
Bioinformatics ; 20(5): 604-11, 2004 Mar 22.
Article in English | MEDLINE | ID: mdl-15033866

ABSTRACT

MOTIVATION: The living cell is a complex machine that depends on the proper functioning of its numerous parts, including proteins. Understanding protein functions and how they modify and regulate each other is the next great challenge for life-sciences researchers. The collective knowledge about protein functions and pathways is scattered throughout numerous publications in scientific journals. Bringing the relevant information together becomes a bottleneck in a research and discovery process. The volume of such information grows exponentially, which renders manual curation impractical. As a viable alternative, automated literature processing tools could be employed to extract and organize biological data into a knowledge base, making it amenable to computational analysis and data mining. RESULTS: We present MedScan, a completely automated natural language processing-based information extraction system. We have used MedScan to extract 2976 interactions between human proteins from MEDLINE abstracts dated after 1988. The precision of the extracted information was found to be 91%. Comparison with the existing protein interaction databases BIND and DIP revealed that 96% of extracted information is novel. The recall rate of MedScan was found to be 21%. Additional experiments with MedScan suggest that MEDLINE is a unique source of diverse protein function information, which can be extracted in a completely automated way with a reasonably high precision. Further directions of the MedScan technology improvement are discussed. AVAILABILITY: MedScan is available for commercial licensing from Ariadne Genomics, Inc.


Subject(s)
Artificial Intelligence , Databases, Protein , Information Storage and Retrieval/methods , MEDLINE , Natural Language Processing , Protein Interaction Mapping/methods , Proteins/metabolism , Abstracting and Indexing , Database Management Systems , Humans , Pattern Recognition, Automated , Periodicals as Topic , Reproducibility of Results , Semantics , Sensitivity and Specificity , Vocabulary, Controlled
12.
J Am Med Inform Assoc ; 11(3): 174-8, 2004.
Article in English | MEDLINE | ID: mdl-14764613

ABSTRACT

OBJECTIVE: The aim of this study was to develop a practical and efficient protein identification system for biomedical corpora. DESIGN: The developed system, called ProtScan, utilizes a carefully constructed dictionary of mammalian proteins in conjunction with a specialized tokenization algorithm to identify and tag protein name occurrences in biomedical texts and also takes advantage of Medline "Name-of-Substance" (NOS) annotation. The dictionaries for ProtScan were constructed in a semi-automatic way from various public-domain sequence databases followed by an intensive expert curation step. MEASUREMENTS: The recall and precision of the system have been determined using 1000 randomly selected and hand-tagged Medline abstracts. RESULTS: The developed system is capable of identifying protein occurrences in Medline abstracts with a 98% precision and 88% recall. It was also found to be capable of processing approximately 300 abstracts per second. Without utilization of NOS annotation, precision and recall were found to be 98.5% and 84%, respectively. CONCLUSION: The developed system appears to be well suited for protein-based Medline indexing and can help to improve biomedical information retrieval. Further approaches to ProtScan's recall improvement also are discussed.


Subject(s)
Information Storage and Retrieval/methods , MEDLINE , Proteins , Terminology as Topic , Abstracting and Indexing , Algorithms , Animals , Dictionaries as Topic , Mammals
13.
Bioinformatics ; 19(16): 2155-7, 2003 Nov 01.
Article in English | MEDLINE | ID: mdl-14594725

ABSTRACT

SUMMARY: PathwayAssist is a software application developed for navigation and analysis of biological pathways, gene regulation networks and protein interaction maps. It comes with the built-in natural language processing module MedScan and the comprehensive database describing more than 100 000 events of regulation, interaction and modification between proteins, cell processes and small molecules. AVAILABILITY: PathwayAssist is available for commercial licensing from Ariadne Genomics, Inc. The light version with limited functionality will be available for free for academic users at www.ariadnegenomics.com/downloads/.


Subject(s)
Database Management Systems , Databases, Protein , Gene Expression Regulation/physiology , Information Storage and Retrieval/methods , Metabolism/physiology , Protein Interaction Mapping/methods , Software , User-Computer Interface , Models, Biological
14.
Bioinformatics ; 19(13): 1699-706, 2003 Sep 01.
Article in English | MEDLINE | ID: mdl-12967967

ABSTRACT

MOTIVATION: The importance of extracting biomedical information from scientific publications is well recognized. A number of information extraction systems for the biomedical domain have been reported, but none of them have become widely used in practical applications. Most proposals to date make rather simplistic assumptions about the syntactic aspect of natural language. There is an urgent need for a system that has broad coverage and performs well in real-text applications. RESULTS: We present a general biomedical domain-oriented NLP engine called MedScan that efficiently processes sentences from MEDLINE abstracts and produces a set of regularized logical structures representing the meaning of each sentence. The engine utilizes a specially developed context-free grammar and lexicon. Preliminary evaluation of the system's performance, accuracy, and coverage exhibited encouraging results. Further approaches for increasing the coverage and reducing parsing ambiguity of the engine, as well as its application for information extraction are discussed.


Subject(s)
Abstracting and Indexing/methods , Artificial Intelligence , Database Management Systems , Information Storage and Retrieval/methods , MEDLINE , Natural Language Processing , Pattern Recognition, Automated , Periodicals as Topic , Subject Headings , Vocabulary, Controlled
SELECTION OF CITATIONS
SEARCH DETAIL
...