Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 76
Filter
1.
Article in English | MEDLINE | ID: mdl-38110246

ABSTRACT

An abundance of data, including decades of greenhouse gas (GHG) emission rates, atmospheric concentrations, and global average temperatures, is sufficient to allow a strictly empirical evaluation of the U.S. plan for controlling GHGs. This article presents an analysis, based solely on such data, that shows that the difference between atmospheric GHG levels that will be reached if current trends continue, and levels that would be achieved if the goals of the plan are met-even with worldwide implementation-is inconsequential. Further, the expected globally averaged temperature differences are well within measurement error. The results lend additional support to the argument that any mitigation strategy must include drawdown of atmospheric GHGs. Equally important, a particular drawdown strategy, agrigenomics, offers the opportunity for a revolutionary trifecta: climate change mitigation, food security, and medical advances.

2.
mSystems ; 8(4): e0096122, 2023 08 31.
Article in English | MEDLINE | ID: mdl-37338270

ABSTRACT

Microbes commonly organize into communities consisting of hundreds of species involved in complex interactions with each other. 16S ribosomal RNA (16S rRNA) amplicon profiling provides snapshots that reveal the phylogenies and abundance profiles of these microbial communities. These snapshots, when collected from multiple samples, can reveal the co-occurrence of microbes, providing a glimpse into the network of associations in these communities. However, the inference of networks from 16S data involves numerous steps, each requiring specific tools and parameter choices. Moreover, the extent to which these steps affect the final network is still unclear. In this study, we perform a meticulous analysis of each step of a pipeline that can convert 16S sequencing data into a network of microbial associations. Through this process, we map how different choices of algorithms and parameters affect the co-occurrence network and identify the steps that contribute substantially to the variance. We further determine the tools and parameters that generate robust co-occurrence networks and develop consensus network algorithms based on benchmarks with mock and synthetic data sets. The Microbial Co-occurrence Network Explorer, or MiCoNE (available at https://github.com/segrelab/MiCoNE) follows these default tools and parameters and can help explore the outcome of these combinations of choices on the inferred networks. We envisage that this pipeline could be used for integrating multiple data sets and generating comparative analyses and consensus networks that can guide our understanding of microbial community assembly in different biomes. IMPORTANCE Mapping the interrelationships between different species in a microbial community is important for understanding and controlling their structure and function. The surge in the high-throughput sequencing of microbial communities has led to the creation of thousands of data sets containing information about microbial abundances. These abundances can be transformed into co-occurrence networks, providing a glimpse into the associations within microbiomes. However, processing these data sets to obtain co-occurrence information relies on several complex steps, each of which involves numerous choices of tools and corresponding parameters. These multiple options pose questions about the robustness and uniqueness of the inferred networks. In this study, we address this workflow and provide a systematic analysis of how these choices of tools affect the final network and guidelines on appropriate tool selection for a particular data set. We also develop a consensus network algorithm that helps generate more robust co-occurrence networks based on benchmark synthetic data sets.


Subject(s)
Microbial Consortia , Microbiota , RNA, Ribosomal, 16S/genetics , Microbiota/genetics , Algorithms , High-Throughput Nucleotide Sequencing
4.
Res Sq ; 2020 Oct 29.
Article in English | MEDLINE | ID: mdl-33140039

ABSTRACT

Background: As the SARS-Cov-2/Covid-19 pandemic continues to ravage the world, it is important to understanding the characteristics of its spread and possible correlates for control to develop strategies of response. Methods: Here we show how a simple Susceptible-Infective-Recovered (SIR) model applied to data for eight European countries and the United Kingdom (UK) can be used to forecast the descending limb (post-peak) of confirmed cases and deaths as a function of time, and predict the duration of the pandemic once it has peaked, by estimating and fixing parameters using only characteristics of the ascending limb and the magnitude of the first peak. Results: The predicted and actual case fatality ratio, or number of deaths per million population from the start of the pandemic to when daily deaths number less than five for the first time, was lowest in Norway (predicted: 44 ± 5 deaths/million; actual: 36 deaths/million) and highest for the United Kingdom (predicted: 578 +/- 65 deaths/million; actual 621 deaths/million). The inferred pandemic characteristics separated into two distinct groups: those that are largely invariant across countries, and those that are highly variable. Among the former is the infective period, TL = 16.3 ± 2.7 days, the average time between contacts, TR = 3.8+/- 0.5 days and the average number of contacts while infective R = 4.4 +/- 0.5. In contrast, there is a highly variable time lag TD between the peak in the daily number of confirmed cases and the peak in the daily number of deaths, ranging from lows of TD = 2,4 days for Denmark and Italy respectively, to highs of TD = 12, 15 for Germany and Norway respectively. The mortality fraction, or ratio of deaths to confirmed cases, was also highly variable, ranging from low values 3%, 5% and 5% for Norway, Denmark and Germany respectively, to high values of 18%, 20% and 21% for Sweden, France, and the UK respectively. The probability of mortality rather than recovery was a significant correlate of the duration of the pandemic, defined as the time from 12/31/2019 to when the number of daily deaths fell below 5. Finally, we observed a small but detectable effect of average temperature on the probability α of infection per contact, with higher temperatures associated with lower infectivity. Conclusions: Our simple model captures the dynamics of the initial stages of the pandemic, from its exponential beginning to the first peak and beyond, with remarkable precision. As with all epidemiological analyses, unanticipated behavioral changes will result in deviations between projection and observation. This is abundantly clear for the current pandemic. Nonetheless, accurate short-term projections are possible, and the methodology we present is a useful addition to the epidemiologist's armamentarium. Our predictions assume that control measures such as lockdown, social distancing, use of masks etc. remain the same post-peak as before peak. Consequently, deviations from our predictions are a measure of the extent to which loosening of control measures have impacted case-loads and deaths since the first peak and initial decline in daily cases and deaths. Our findings suggest that the two key parameters to control and reduce the impact of a developing pandemic are the infective period and the mortality fraction, which are achievable by early case identification, contact tracing and quarantine (which would reduce the former) and improving quality of care for identified cases (which would reduce the latter).

5.
medRxiv ; 2020 Oct 15.
Article in English | MEDLINE | ID: mdl-32511530

ABSTRACT

Understanding the characteristics of the SARS-Cov-2/Covid-19 pandemic is central to developing control strategies. Here we show how a simple Susceptible-Infective-Recovered (SIR) model applied to data for eight European countries and the United Kingdom (UK) can be used to forecast the descending limb (post-peak) of confirmed cases and deaths as a function of time, and predict the duration of the pandemic once it has peaked, by estimating and fixing parameters using only characteristics of the ascending limb and the magnitude of the first peak. As with all epidemiological analyses, unanticipated behavioral changes will result in deviations between projection and observation. This is abundantly clear for the current pandemic. Nonetheless, accurate short-term projections are possible, and the methodology we present is a useful addition to the epidemiologist's armamentarium. Since our predictions assume that control measures such as lockdown, social distancing, use of masks etc. remain the same post-peak as before peak, deviations from our predictions are a measure of the extent to which loosening of control measures have impacted case-loads and deaths since the first peak and initial decline in daily cases and deaths. The predicted and actual case fatality ratio, or number of deaths per million population from the start of the pandemic to when daily deaths number less than five for the first time, was lowest in Norway (pred: 44 ± 5 deaths/million; actual: 36 deaths/million) and highest for the United Kingdom (pred: 578 +/- 65 deaths/million; actual 621 deaths/million). The inferred pandemic characteristics separated into two distinct groups: those that are largely invariant across countries, and those that are highly variable. Among the former is the infective period, T L ( T L ¯ = 16.3 ± 2.7  days ) ; the average time between contacts, T R ( T R ¯ = 3.8 ± 0.5 ) days and the average number of contacts while infective, R ( R ¯ = 4.4 ± 0.5 ) . In contrast, there is a highly variable time lag T D between the peak in the daily number of confirmed cases and the peak in the daily number of deaths, ranging from a low of T D = 2,4 days for Denmark and Italy respectively, to highs of T D = 12, 15 for Germany and Norway respectively. The mortality fraction, or ratio of deaths to confirmed cases, was also highly variable, ranging from low values 3%, 5% and 5% for Norway, Denmark and Germany respectively, to high values of 18%, 20% and 21% for Sweden, France, and the UK respectively. The probability of mortality rather than recovery was a significant correlate of the duration of the pandemic, defined as the time from 12/31/2019 to when the number of daily deaths fell below 5. Finally, we observed a small but detectable effect of average temperature on the probability α of infection per contact, with higher temperatures associated with lower infectivity. Policy implications of our findings are also briefly discussed.

6.
Biodes Res ; 2020: 1016207, 2020.
Article in English | MEDLINE | ID: mdl-37849905

ABSTRACT

The long atmospheric residence time of CO2 creates an urgent need to add atmospheric carbon drawdown to CO2 regulatory strategies. Synthetic and systems biology (SSB), which enables manipulation of cellular phenotypes, offers a powerful approach to amplifying and adding new possibilities to current land management practices aimed at reducing atmospheric carbon. The participants (in attendance: Christina Agapakis, George Annas, Adam Arkin, George Church, Robert Cook-Deegan, Charles DeLisi, Dan Drell, Sheldon Glashow, Steve Hamburg, Henry Jacoby, Henry Kelly, Mark Kon, Todd Kuiken, Mary Lidstrom, Mike MacCracken, June Medford, Jerry Melillo, Ron Milo, Pilar Ossorio, Ari Patrinos, Keith Paustian, Kristala Jones Prather, Kent Redford, David Resnik, John Reilly, Richard J. Roberts, Daniel Segre, Susan Solomon, Elizabeth Strychalski, Chris Voigt, Dominic Woolf, Stan Wullschleger, and Xiaohan Yang) identified a range of possibilities by which SSB might help reduce greenhouse gas concentrations and which might also contribute to environmental sustainability and adaptation. These include, among other possibilities, engineering plants to convert CO2 produced by respiration into a stable carbonate, designing plants with an increased root-to-shoot ratio, and creating plants with the ability to self-fertilize. A number of serious ecological and societal challenges must, however, be confronted and resolved before any such application can be fully assessed, realized, and deployed.

7.
Biol Direct ; 14(1): 24, 2019 Nov 29.
Article in English | MEDLINE | ID: mdl-31783901

ABSTRACT

After publication of this article [1], the author brought to our attention that there are some errors in the article.

8.
Biol Direct ; 14(1): 14, 2019 08 20.
Article in English | MEDLINE | ID: mdl-31429783

ABSTRACT

There is growing agreement that the aim of United Nations Framework Convention on Climate Change, which is to avoid dangerous anthropogenic interference with the climate system, is not likely to be met without inclusion of methods to physically remove atmospheric carbon. A number of approaches have been suggested, but the community appears to be silent on the potential of one of the most revolutionary technologies of the current century, systems and synthetic biology (SSB). The potential of SSB to modulate the fast carbon cycle, and thereby mitigate climate change is in itself enormous, but if the history of genomics is any measure, it is also reasonable to expect sizeable economic returns on any investment. More generally, the approach to climate control has been badly unbalanced. The last three decades have seen intense international attention to emission control, with no parallel plan to test, scale and implement carbon removal technologies, including attention to their economic, legal and ethical implications. REVIEWERS: This article was reviewed by Richard Roberts, Aristides Patrinos, and Eugene Koonin, all of whom were nominated by Itai Yanai. For the full reviews, please go to the Reviewers' comments section.


Subject(s)
Air Pollution/prevention & control , Climate Change , Synthetic Biology/methods , Carbon/analysis , Global Warming/prevention & control
9.
BMC Med Genomics ; 9(1): 51, 2016 07 30.
Article in English | MEDLINE | ID: mdl-27475327

ABSTRACT

BACKGROUND: The high cost and the long time required to bring drugs into commerce is driving efforts to repurpose FDA approved drugs-to find new uses for which they weren't intended, and to thereby reduce the overall cost of commercialization, and shorten the lag between drug discovery and availability. We report on the development, testing and application of a promising new approach to repositioning. METHODS: Our approach is based on mining a human functional linkage network for inversely correlated modules of drug and disease gene targets. The method takes account of multiple information sources, including gene mutation, gene expression, and functional connectivity and proximity of within module genes. RESULTS: The method was used to identify candidates for treating breast and prostate cancer. We found that (i) the recall rate for FDA approved drugs for breast (prostate) cancer is 20/20 (10/11), while the rates for drugs in clinical trials were 131/154 and 82/106; (ii) the ROC/AUC performance substantially exceeds that of comparable methods; (iii) preliminary in vitro studies indicate that 5/5 candidates have therapeutic indices superior to that of Doxorubicin in MCF7 and SUM149 cancer cell lines. We briefly discuss the biological plausibility of the candidates at a molecular level in the context of the biological processes that they mediate. CONCLUSIONS: Our method appears to offer promise for the identification of multi-targeted drug candidates that can correct aberrant cellular functions. In particular the computational performance exceeded that of other CMap-based methods, and in vitro experiments indicate that 5/5 candidates have therapeutic indices superior to that of Doxorubicin in MCF7 and SUM149 cancer cell lines. The approach has the potential to provide a more efficient drug discovery pipeline.


Subject(s)
Breast Neoplasms/drug therapy , Computational Biology/methods , Drug Repositioning/methods , Prostatic Neoplasms/drug therapy , Breast Neoplasms/pathology , Cell Line, Tumor , Data Mining , Doxorubicin/pharmacology , Doxorubicin/therapeutic use , Humans , MCF-7 Cells , Male , Prostatic Neoplasms/pathology
10.
PLoS Comput Biol ; 12(4): e1004875, 2016 Apr.
Article in English | MEDLINE | ID: mdl-27081850

ABSTRACT

The complexity of metabolic networks in microbial communities poses an unresolved visualization and interpretation challenge. We address this challenge in the newly expanded version of a software tool for the analysis of biological networks, VisANT 5.0. We focus in particular on facilitating the visual exploration of metabolic interaction between microbes in a community, e.g. as predicted by COMETS (Computation of Microbial Ecosystems in Time and Space), a dynamic stoichiometric modeling framework. Using VisANT's unique metagraph implementation, we show how one can use VisANT 5.0 to explore different time-dependent ecosystem-level metabolic networks. In particular, we analyze the metabolic interaction network between two bacteria previously shown to display an obligate cross-feeding interdependency. In addition, we illustrate how a putative minimal gut microbiome community could be represented in our framework, making it possible to highlight interactions across multiple coexisting species. We envisage that the "symbiotic layout" of VisANT can be employed as a general tool for the analysis of metabolism in complex microbial communities as well as heterogeneous human tissues. VisANT is freely available at: http://visant.bu.edu and COMETS at http://comets.bu.edu.


Subject(s)
Metabolic Networks and Pathways , Microbial Consortia/physiology , Models, Biological , Software , Computational Biology , Computer Graphics , Computer Simulation , Humans , Microbiota/physiology , Systems Biology
11.
Mol Cancer Ther ; 15(1): 184-9, 2016 Jan.
Article in English | MEDLINE | ID: mdl-26625895

ABSTRACT

Adjuvant therapy following breast cancer surgery generally consists of either a course of chemotherapy, if the cancer lacks hormone receptors, or a course of hormonal therapy, otherwise. Here, we report a correlation between adjuvant strategy and mutated pathway patterns. In particular, we find that for breast cancer patients, pathways enriched in nonsynonymous mutations in the chemotherapy group are distinct from those of the hormonal therapy group. We apply a recently developed method that identifies collaborative pathway groups for hormone and chemotherapy patients. A collaborative group of pathways is one in which each member is altered in the same-generally large-number of samples. In particular, we find the following: (i) a chemotherapy group consisting of three pathways and a hormone therapy group consisting of 20, the members of the two groups being mutually exclusive; (ii) each group is highly enriched in breast cancer drivers; and (iii) the pathway groups are correlates of subtype-based therapeutic recommendations. These results suggest that patient profiling using these pathway groups can potentially enable the development of personalized treatment plans that may be more accurate and specific than those currently available.


Subject(s)
Breast Neoplasms/genetics , Breast Neoplasms/metabolism , Mutation , Signal Transduction , Antineoplastic Agents, Hormonal/pharmacology , Antineoplastic Agents, Hormonal/therapeutic use , Breast Neoplasms/drug therapy , Breast Neoplasms/pathology , Chemotherapy, Adjuvant , Cluster Analysis , Computational Biology , Databases, Genetic , Female , Gene Expression Profiling , Humans , Signal Transduction/drug effects
12.
Sci Rep ; 5: 10204, 2015 May 11.
Article in English | MEDLINE | ID: mdl-25961669

ABSTRACT

The number of mutated genes in cancer cells is far larger than the number of mutations that drive cancer. The difficulty this creates for identifying relevant alterations has stimulated the development of various computational approaches to distinguishing drivers from bystanders. We develop and apply an ensemble classifier (EC) machine learning method, which integrates 10 classifiers that are publically available, and apply it to breast and ovarian cancer. In particular we find the following: (1) Using both standard and non-standard metrics, EC almost always outperforms single method classifiers, often by wide margins. (2) Of the 50 highest ranked genes for breast (ovarian) cancer, 34 (30) are associated with other cancers in either the OMIM, CGC or NCG database (P < 10(-22)). (3) Another 10, for both breast and ovarian cancer, have been identified by GWAS studies. (4) Several of the remaining genes--including a protein kinase that regulates the Fra-1 transcription factor which is overexpressed in ER negative breast cancer cells; and Fyn, which is overexpressed in pancreatic and prostate cancer, among others--are biologically plausible. Biological implications are briefly discussed. Source codes and detailed results are available at http://www.visantnet.org/misi/driver_integration.zip.


Subject(s)
Databases, Genetic , Genes, Neoplasm , Machine Learning , Mutation , Neoplasm Proteins , Animals , Humans , Neoplasm Proteins/classification , Neoplasm Proteins/genetics , United States
13.
Article in English | MEDLINE | ID: mdl-26941962

ABSTRACT

A shift of the delicate balance between apoptosis and survival-inducing signals determines the fate of neurons during the development of the central nervous system and its homeostasis throughout adulthood. Both pathways, promoting or protecting from apoptosis, trigger a transcriptional program. We conducted whole-genome expression profiling to decipher the transcriptional regulatory elements controlling the apoptotic/survival switch in cerebellar granule neurons following the induction of apoptosis by serum and potassium deprivation or their rescue by either insulin-like growth factor-1 (Igf1) or pituitary adenylyl cyclase-activating polypeptide (Pacap). Although depending on different upstream signaling pathways, the survival effects of Igf1 and Pacap converged into common transcriptional cascades, thus suggesting the existence of a general transcriptional program underlying neuronal survival.

14.
Nucleic Acids Res ; 41(Web Server issue): W225-31, 2013 Jul.
Article in English | MEDLINE | ID: mdl-23716640

ABSTRACT

With the rapid accumulation of our knowledge on diseases, disease-related genes and drug targets, network-based analysis plays an increasingly important role in systems biology, systems pharmacology and translational science. The new release of VisANT aims to provide new functions to facilitate the convenient network analysis of diseases, therapies, genes and drugs. With improved understanding of the mechanisms of complex diseases and drug actions through network analysis, novel drug methods (e.g., drug repositioning, multi-target drug and combination therapy) can be designed. More specifically, the new update includes (i) integrated search and navigation of disease and drug hierarchies; (ii) integrated disease-gene, therapy-drug and drug-target association to aid the network construction and filtering; (iii) annotation of genes/drugs using disease/therapy information; (iv) prediction of associated diseases/therapies for a given set of genes/drugs using enrichment analysis; (v) network transformation to support construction of versatile network of drugs, genes, diseases and therapies; (vi) enhanced user interface using docking windows to allow easy customization of node and edge properties with build-in legend node to distinguish different node type. VisANT is freely available at: http://visant.bu.edu.


Subject(s)
Disease/genetics , Drug Discovery , Software , Drug Therapy , Genes , Humans , Internet
15.
Methods Mol Biol ; 939: 233-51, 2013.
Article in English | MEDLINE | ID: mdl-23192550

ABSTRACT

A host of data on genetic variation from the Human Genome and International HapMap projects, and advances in high-throughput genotyping technologies, have made genome-wide association (GWA) studies technically feasible. GWA studies help in the discovery and quantification of the genetic components of disease risks, many of which have not been unveiled before and have opened a new avenue to understanding disease, treatment, and prevention. This chapter presents an overview of GWA, an important tool for discovering regions of the genome that harbor common genetic variants to confer susceptibility for various diseases or health outcomes in the post-Human Genome Project era. A tutorial on how to conduct a GWA study and some practical challenges specifically related to the GWA design is presented, followed by a detailed GWA case study involving the identification of loci associated with glioma as an example and an illustration of current technologies.


Subject(s)
Computational Biology/methods , Genome, Human , Genome-Wide Association Study/methods , Genetic Loci , Genetic Markers , Genetic Predisposition to Disease , Genotyping Techniques , HapMap Project , Humans , Linkage Disequilibrium , Meta-Analysis as Topic , Polymorphism, Single Nucleotide , Reproducibility of Results
16.
Biophys J ; 103(7): 1510-7, 2012 Oct 03.
Article in English | MEDLINE | ID: mdl-23062343

ABSTRACT

We demonstrate an accurate, quantitative, and label-free optical technology for high-throughput studies of receptor-ligand interactions, and apply it to TATA binding protein (TBP) interactions with oligonucleotides. We present a simple method to prepare single-stranded and double-stranded DNA microarrays with comparable surface density, ensuring an accurate comparison of TBP activity with both types of DNA. In particular, we find that TBP binds tightly to single-stranded DNA, especially to stretches of polythymine (poly-T), as well as to the traditional TATA box. We further investigate the correlation of TBP activity with various lengths of DNA and find that the number of TBPs bound to DNA increases >7-fold as the oligomer length increases from 9 to 40. Finally, we perform a full human genome analysis and discover that 35.5% of human promoters have poly-T stretches. In summary, we report, for the first time to our knowledge, the activity of TBP with poly-T stretches by presenting an elegant stepwise analysis of multiple techniques: discovery by a novel quantitative detection of microarrays, confirmation by a traditional gel electrophoresis, and a full genome prediction with computational analyses.


Subject(s)
DNA/genetics , DNA/metabolism , TATA-Box Binding Protein/metabolism , Base Sequence , DNA, Single-Stranded/genetics , DNA, Single-Stranded/metabolism , Humans , Poly T/metabolism , Protein Binding , Substrate Specificity , TATA Box
17.
Biol Direct ; 7: 21, 2012 Jul 03.
Article in English | MEDLINE | ID: mdl-22759382

ABSTRACT

BACKGROUND: Molecular markers based on gene expression profiles have been used in experimental and clinical settings to distinguish cancerous tumors in stage, grade, survival time, metastasis, and drug sensitivity. However, most significant gene markers are unstable (not reproducible) among data sets. We introduce a standardized method for representing cancer markers as 2-level hierarchical feature vectors, with a basic gene level as well as a second level of (more stable) pathway markers, for the purpose of discriminating cancer subtypes. This extends standard gene expression arrays with new pathway-level activation features obtained directly from off-the-shelf gene set enrichment algorithms such as GSEA. Such so-called pathway-based expression arrays are significantly more reproducible across datasets. Such reproducibility will be important for clinical usefulness of genomic markers, and augment currently accepted cancer classification protocols. RESULTS: The present method produced more stable (reproducible) pathway-based markers for discriminating breast cancer metastasis and ovarian cancer survival time. Between two datasets for breast cancer metastasis, the intersection of standard significant gene biomarkers totaled 7.47% of selected genes, compared to 17.65% using pathway-based markers; the corresponding percentages for ovarian cancer datasets were 20.65% and 33.33% respectively. Three pathways, consisting of Type_1_diabetes mellitus, Cytokine-cytokine_receptor_interaction and Hedgehog_signaling (all previously implicated in cancer), are enriched in both the ovarian long survival and breast non-metastasis groups. In addition, integrating pathway and gene information, we identified five (ID4, ANXA4, CXCL9, MYLK, FBXL7) and six (SQLE, E2F1, PTTG1, TSTA3, BUB1B, MAD2L1) known cancer genes significant for ovarian and breast cancer respectively. CONCLUSIONS: Standardizing the analysis of genomic data in the process of cancer staging, classification and analysis is important as it has implications for both pre-clinical as well as clinical studies. The paradigm of diagnosis and prediction using pathway-based biomarkers as features can be an important part of the process of biomarker-based cancer analysis, and the resulting canonical (clinically reproducible) biomarkers can be important in standardizing genomic data. We expect that identification of such canonical biomarkers will improve clinical utility of high-throughput datasets for diagnostic and prognostic applications.


Subject(s)
Biomarkers, Tumor/analysis , Breast Neoplasms/diagnosis , Neoplasm Staging/methods , Ovarian Neoplasms/diagnosis , Signal Transduction , Algorithms , Biomarkers, Tumor/metabolism , Breast Neoplasms/genetics , Breast Neoplasms/metabolism , Diabetes Mellitus, Type 1/pathology , Female , Genes, Neoplasm , Hedgehog Proteins/metabolism , Humans , Neoplasm Grading , Neoplasm Metastasis/diagnosis , Ovarian Neoplasms/genetics , Ovarian Neoplasms/metabolism , Prognosis , RNA, Messenger/genetics , RNA, Messenger/metabolism , Receptors, Cytokine/metabolism , Reproducibility of Results , Survival Analysis , Time Factors , Transcriptome
18.
BMC Bioinformatics ; 13: 46, 2012 Mar 23.
Article in English | MEDLINE | ID: mdl-22443377

ABSTRACT

BACKGROUND: Identification of active causal regulators is a crucial problem in understanding mechanism of diseases or finding drug targets. Methods that infer causal regulators directly from primary data have been proposed and successfully validated in some cases. These methods necessarily require very large sample sizes or a mix of different data types. Recent studies have shown that prior biological knowledge can successfully boost a method's ability to find regulators. RESULTS: We present a simple data-driven method, Correlation Set Analysis (CSA), for comprehensively detecting active regulators in disease populations by integrating co-expression analysis and a specific type of literature-derived causal relationships. Instead of investigating the co-expression level between regulators and their regulatees, we focus on coherence of regulatees of a regulator. Using simulated datasets we show that our method performs very well at recovering even weak regulatory relationships with a low false discovery rate. Using three separate real biological datasets we were able to recover well known and as yet undescribed, active regulators for each disease population. The results are represented as a rank-ordered list of regulators, and reveals both single and higher-order regulatory relationships. CONCLUSIONS: CSA is an intuitive data-driven way of selecting directed perturbation experiments that are relevant to a disease population of interest and represent a starting point for further investigation. Our findings demonstrate that combining co-expression analysis on regulatee sets with a literature-derived network can successfully identify causal regulators and help develop possible hypothesis to explain disease progression.


Subject(s)
Computational Biology/methods , Gene Regulatory Networks , Computer Simulation , Female , Humans , Lymphoma, B-Cell/genetics , Metabolic Diseases/genetics , Ovarian Neoplasms/genetics , Sample Size , Transcription, Genetic
19.
PLoS Comput Biol ; 8(2): e1002347, 2012 Feb.
Article in English | MEDLINE | ID: mdl-22346740

ABSTRACT

The cost and time to develop a drug continues to be a major barrier to widespread distribution of medication. Although the genomic revolution appears to have had little impact on this problem, and might even have exacerbated it because of the flood of additional and usually ineffective leads, the emergence of high throughput resources promises the possibility of rapid, reliable and systematic identification of approved drugs for originally unintended uses. In this paper we develop and apply a method for identifying such repositioned drug candidates against breast cancer, myelogenous leukemia and prostate cancer by looking for inverse correlations between the most perturbed gene expression levels in human cancer tissue and the most perturbed expression levels induced by bioactive compounds. The method uses variable gene signatures to identify bioactive compounds that modulate a given disease. This is in contrast to previous methods that use small and fixed signatures. This strategy is based on the observation that diseases stem from failed/modified cellular functions, irrespective of the particular genes that contribute to the function, i.e., this strategy targets the functional signatures for a given cancer. This function-based strategy broadens the search space for the effective drugs with an impressive hit rate. Among the 79, 94 and 88 candidate drugs for breast cancer, myelogenous leukemia and prostate cancer, 32%, 13% and 17% respectively are either FDA-approved/in-clinical-trial drugs, or drugs with suggestive literature evidences, with an FDR of 0.01. These findings indicate that the method presented here could lead to a substantial increase in efficiency in drug discovery and development, and has potential application for the personalized medicine.


Subject(s)
Breast Neoplasms/drug therapy , Drug Repositioning , Gene Expression Profiling/methods , Leukemia, Myeloid/drug therapy , Prostatic Neoplasms/drug therapy , Breast Neoplasms/genetics , Breast Neoplasms/metabolism , Computational Biology/methods , Databases, Factual , Drug Discovery , Female , Humans , Leukemia, Myeloid/genetics , Leukemia, Myeloid/metabolism , Male , Prostatic Neoplasms/genetics , Prostatic Neoplasms/metabolism , Signal Transduction
20.
J Am Med Inform Assoc ; 19(2): 166-70, 2012.
Article in English | MEDLINE | ID: mdl-22101971

ABSTRACT

The National Center for Integrative and Biomedical Informatics (NCIBI) is one of the eight NCBCs. NCIBI supports information access and data analysis for biomedical researchers, enabling them to build computational and knowledge models of biological systems to address the Driving Biological Problems (DBPs). The NCIBI DBPs have included prostate cancer progression, organ-specific complications of type 1 and 2 diabetes, bipolar disorder, and metabolic analysis of obesity syndrome. Collaborating with these and other partners, NCIBI has developed a series of software tools for exploratory analysis, concept visualization, and literature searches, as well as core database and web services resources. Many of our training and outreach initiatives have been in collaboration with the Research Centers at Minority Institutions (RCMI), integrating NCIBI and RCMI faculty and students, culminating each year in an annual workshop. Our future directions include focusing on the TranSMART data sharing and analysis initiative.


Subject(s)
Biomedical Research , Information Dissemination , Integrative Medicine , Medical Informatics , Databases as Topic , Forecasting , Goals , National Institutes of Health (U.S.) , United States
SELECTION OF CITATIONS
SEARCH DETAIL
...