Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 61
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
iScience ; 27(6): 109781, 2024 Jun 21.
Artigo em Inglês | MEDLINE | ID: mdl-38868205

RESUMO

Sarcomas are a diverse group of rare malignancies composed of multiple different clinical and molecular subtypes. Due to their rarity and heterogeneity, basic, translational, and clinical research in sarcoma has trailed behind that of other cancers. Outcomes for patients remain generally poor due to an incomplete understanding of disease biology and a lack of novel therapies. To address some of the limitations impeding preclinical sarcoma research, we have developed Sarcoma_CellMinerCDB, a publicly available interactive tool that merges publicly available sarcoma cell line data and newly generated omics data to create a comprehensive database of genomic, transcriptomic, methylomic, proteomic, metabolic, and pharmacologic data on 133 annotated sarcoma cell lines. The reproducibility, functionality, biological relevance, and therapeutic applications of Sarcoma_CellMinerCDB described herein are powerful tools to address and generate biological questions and test hypotheses for translational research. Sarcoma_CellMinerCDB (https://discover.nci.nih.gov/SarcomaCellMinerCDB) aims to contribute to advancing the preclinical study of sarcoma.

2.
ArXiv ; 2024 May 14.
Artigo em Inglês | MEDLINE | ID: mdl-38800657

RESUMO

Drug development is a lengthy process with a high failure rate. Increasingly, machine learning is utilized to facilitate the drug development processes. These models aim to enhance our understanding of drug characteristics, including their activity in biological contexts. However, a major challenge in drug response (DR) prediction is model interpretability as it aids in the validation of findings. This is important in biomedicine, where models need to be understandable in comparison with established knowledge of drug interactions with proteins. drGAT, a graph deep learning model, leverages a heterogeneous graph composed of relationships between proteins, cell lines, and drugs. drGAT is designed with two objectives: DR prediction as a binary sensitivity prediction and elucidation of drug mechanism from attention coefficients. drGAT has demonstrated superior performance over existing models, achieving 78% accuracy (and precision), and 76% F1 score for 269 DNA-damaging compounds of the NCI60 drug response dataset. To assess the model's interpretability, we conducted a review of drug-gene co-occurrences in Pubmed abstracts in comparison to the top 5 genes with the highest attention coefficients for each drug. We also examined whether known relationships were retained in the model by inspecting the neighborhoods of topoisomerase-related drugs. For example, our model retained TOP1 as a highly weighted predictive feature for irinotecan and topotecan, in addition to other genes that could potentially be regulators of the drugs. Our method can be used to accurately predict sensitivity to drugs and may be useful in the identification of biomarkers relating to the treatment of cancer patients.

3.
Nat Methods ; 21(3): 531-540, 2024 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-38279009

RESUMO

Analysis across a growing number of single-cell perturbation datasets is hampered by poor data interoperability. To facilitate development and benchmarking of computational methods, we collect a set of 44 publicly available single-cell perturbation-response datasets with molecular readouts, including transcriptomics, proteomics and epigenomics. We apply uniform quality control pipelines and harmonize feature annotations. The resulting information resource, scPerturb, enables development and testing of computational methods, and facilitates comparison and integration across datasets. We describe energy statistics (E-statistics) for quantification of perturbation effects and significance testing, and demonstrate E-distance as a general distance measure between sets of single-cell expression profiles. We illustrate the application of E-statistics for quantifying similarity and efficacy of perturbations. The perturbation-response datasets and E-statistics computation software are publicly available at scperturb.org. This work provides an information resource for researchers working with single-cell perturbation data and recommendations for experimental design, including optimal cell counts and read depth.


Assuntos
Proteômica , Software , Perfilação da Expressão Gênica/métodos , Epigenômica , Análise de Célula Única
4.
PLoS One ; 18(8): e0285339, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-37585474

RESUMO

cyjShiny is an open-source R package that allows users to embed network visualization into Shiny apps and R Markdown documents. cyjShiny (https://github.com/cytoscape/cyjShiny) builds on the cytoscape.js Javascript graph library. Additionally, the package provides helper functions to convert common R data representations (e.g., data.frame) into forms compatible with cytoscape.js.


Assuntos
Bibliotecas , Software
5.
J Proteome Res ; 22(9): 2847-2859, 2023 09 01.
Artigo em Inglês | MEDLINE | ID: mdl-37555633

RESUMO

The ongoing pandemic of the coronavirus disease 2019 (COVID-19) caused by the severe acute respiratory syndrome coronavirus 2 still has limited treatment options. Our understanding of the molecular dysregulations that occur in response to infection remains incomplete. We developed a web application COVIDpro (https://www.guomics.com/covidPro/) that includes proteomics data obtained from 41 original studies conducted in 32 hospitals worldwide, involving 3077 patients and covering 19 types of clinical specimens, predominantly plasma and serum. The data set encompasses 53 protein expression matrices, comprising a total of 5434 samples and 14,403 unique proteins. We identified a panel of proteins that exhibit significant dysregulation, enabling the classification of COVID-19 patients into severe and non-severe disease categories. The proteomic signatures achieved promising results in distinguishing severe cases, with a mean area under the curve of 0.87 and accuracy of 0.80 across five independent test sets. COVIDpro serves as a valuable resource for testing hypotheses and exploring potential targets for novel treatments in COVID-19 patients.


Assuntos
COVID-19 , Humanos , Proteômica , SARS-CoV-2
6.
Mol Cell Proteomics ; 22(8): 100602, 2023 08.
Artigo em Inglês | MEDLINE | ID: mdl-37343696

RESUMO

Treatment and relevant targets for breast cancer (BC) remain limited, especially for triple-negative BC (TNBC). We identified 6091 proteins of 76 human BC cell lines using data-independent acquisition (DIA). Integrating our proteomic findings with prior multi-omics datasets, we found that including proteomics data improved drug sensitivity predictions and provided insights into the mechanisms of action. We subsequently profiled the proteomic changes in nine cell lines (five TNBC and four non-TNBC) treated with EGFR/AKT/mTOR inhibitors. In TNBC, metabolism pathways were dysregulated after EGFR/mTOR inhibitor treatment, while RNA modification and cell cycle pathways were affected by AKT inhibitor. This systematic multi-omics and in-depth analysis of the proteome of BC cells can help prioritize potential therapeutic targets and provide insights into adaptive resistance in TNBC.


Assuntos
Transdução de Sinais , Neoplasias de Mama Triplo Negativas , Humanos , Proteínas Proto-Oncogênicas c-akt/metabolismo , Proteômica , Proliferação de Células , Linhagem Celular Tumoral , Resistencia a Medicamentos Antineoplásicos/genética , Neoplasias de Mama Triplo Negativas/metabolismo , Receptores ErbB/metabolismo
7.
Cancer Res ; 83(12): 1941-1952, 2023 06 15.
Artigo em Inglês | MEDLINE | ID: mdl-37140427

RESUMO

Major advances have been made in the field of precision medicine for treating cancer. However, many open questions remain that need to be answered to realize the goal of matching every patient with cancer to the most efficacious therapy. To facilitate these efforts, we have developed CellMinerCDB: National Center for Advancing Translational Sciences (NCATS; https://discover.nci.nih.gov/rsconnect/cellminercdb_ncats/), which makes available activity information for 2,675 drugs and compounds, including multiple nononcology drugs and 1,866 drugs and compounds unique to the NCATS. CellMinerCDB: NCATS comprises 183 cancer cell lines, with 72 unique to NCATS, including some from previously understudied tissues of origin. Multiple forms of data from different institutes are integrated, including single and combination drug activity, DNA copy number, methylation and mutation, transcriptome, protein levels, histone acetylation and methylation, metabolites, CRISPR, and miscellaneous signatures. Curation of cell lines and drug names enables cross-database (CDB) analyses. Comparison of the datasets is made possible by the overlap between cell lines and drugs across databases. Multiple univariate and multivariate analysis tools are built-in, including linear regression and LASSO. Examples have been presented here for the clinical topoisomerase I (TOP1) inhibitors topotecan and irinotecan/SN-38. This web application provides both substantial new data and significant pharmacogenomic integration, allowing exploration of interrelationships. SIGNIFICANCE: CellMinerCDB: NCATS provides activity information for 2,675 drugs in 183 cancer cell lines and analysis tools to facilitate pharmacogenomic research and to identify determinants of response.


Assuntos
National Center for Advancing Translational Sciences (U.S.) , Neoplasia de Células Basais , Estados Unidos , Humanos , Farmacogenética , Linhagem Celular Tumoral , Bases de Dados Factuais , Irinotecano , Internet
8.
Commun Biol ; 6(1): 462, 2023 04 27.
Artigo em Inglês | MEDLINE | ID: mdl-37106127

RESUMO

The interactions between tumor intrinsic processes and immune checkpoints can mediate immune evasion by cancer cells and responses to immunotherapy. It is, however, challenging to identify functional interactions due to the prohibitively complex molecular landscape of the tumor-immune interfaces. We address this challenge with a statistical analysis framework, immuno-oncology gene interaction maps (ImogiMap). ImogiMap quantifies and statistically validates tumor-immune checkpoint interactions based on their co-associations with immune-associated phenotypes. The outcome is a catalog of tumor-immune checkpoint interaction maps for diverse immune-associated phenotypes. Applications of ImogiMap recapitulate the interaction of SERPINB9 and immune checkpoints with interferon gamma (IFNγ) expression. Our analyses suggest that CD86-CD70 and CD274-CD70 immunoregulatory interactions are significantly associated with IFNγ expression in uterine corpus endometrial carcinoma and basal-like breast cancer, respectively. The open-source ImogiMap software and user-friendly web application will enable future applications of ImogiMap. Such applications may guide the discovery of previously unknown tumor-immune interactions and immunotherapy targets.


Assuntos
Neoplasias , Humanos , Neoplasias/genética , Neoplasias/terapia , Imunoterapia , Interferon gama/genética
9.
Bioinformatics ; 39(3)2023 03 01.
Artigo em Inglês | MEDLINE | ID: mdl-36897014

RESUMO

SUMMARY: The systems biology graphical notation (SBGN) has become the de facto standard for the graphical representation of molecular maps. Having rapid and easy access to the content of large collections of maps is necessary to perform semantic or graph-based analysis of these resources. To this end, we propose StonPy, a new tool to store and query SBGN maps in a Neo4j graph database. StonPy notably includes a data model that takes into account all three SBGN languages and a completion module to automatically build valid SBGN maps from query results. StonPy is built as a library that can be integrated into other software and offers a command-line interface that allows users to easily perform all operations. AVAILABILITY AND IMPLEMENTATION: StonPy is implemented in Python 3 under a GPLv3 license. Its code and complete documentation are freely available from https://github.com/adrienrougny/stonpy. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Software , Biologia de Sistemas , Biologia de Sistemas/métodos , Bases de Dados Factuais , Idioma , Documentação
10.
Front Immunol ; 14: 1282859, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-38414974

RESUMO

Introduction: The COVID-19 Disease Map project is a large-scale community effort uniting 277 scientists from 130 Institutions around the globe. We use high-quality, mechanistic content describing SARS-CoV-2-host interactions and develop interoperable bioinformatic pipelines for novel target identification and drug repurposing. Methods: Extensive community work allowed an impressive step forward in building interfaces between Systems Biology tools and platforms. Our framework can link biomolecules from omics data analysis and computational modelling to dysregulated pathways in a cell-, tissue- or patient-specific manner. Drug repurposing using text mining and AI-assisted analysis identified potential drugs, chemicals and microRNAs that could target the identified key factors. Results: Results revealed drugs already tested for anti-COVID-19 efficacy, providing a mechanistic context for their mode of action, and drugs already in clinical trials for treating other diseases, never tested against COVID-19. Discussion: The key advance is that the proposed framework is versatile and expandable, offering a significant upgrade in the arsenal for virus-host interactions and other complex pathologies.


Assuntos
COVID-19 , Humanos , SARS-CoV-2 , Reposicionamento de Medicamentos , Biologia de Sistemas , Simulação por Computador
11.
bioRxiv ; 2022 Sep 30.
Artigo em Inglês | MEDLINE | ID: mdl-36203550

RESUMO

Background: The ongoing pandemic of the coronavirus disease 2019 (COVID-19) caused by the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) still has limited treatment options partially due to our incomplete understanding of the molecular dysregulations of the COVID-19 patients. We aimed to generate a repository and data analysis tools to examine the modulated proteins underlying COVID-19 patients for the discovery of potential therapeutic targets and diagnostic biomarkers. Methods: We built a web server containing proteomic expression data from COVID-19 patients with a toolset for user-friendly data analysis and visualization. The web resource covers expert-curated proteomic data from COVID-19 patients published before May 2022. The data were collected from ProteomeXchange and from select publications via PubMed searches and aggregated into a comprehensive dataset. Protein expression by disease subgroups across projects was compared by examining differentially expressed proteins. We also visualize differentially expressed pathways and proteins. Moreover, circulating proteins that differentiated severe cases were nominated as predictive biomarkers. Findings: We built and maintain a web server COVIDpro ( https://www.guomics.com/covidPro/ ) containing proteomics data generated by 41 original studies from 32 hospitals worldwide, with data from 3077 patients covering 19 types of clinical specimens, the majority from plasma and sera. 53 protein expression matrices were collected, for a total of 5434 samples and 14,403 unique proteins. Our analyses showed that the lipopolysaccharide-binding protein, as identified in the majority of the studies, was highly expressed in the blood samples of patients with severe disease. A panel of significantly dysregulated proteins was identified to separate patients with severe disease from non-severe disease. Classification of severe disease based on these proteomic signatures on five test sets reached a mean AUC of 0.87 and ACC of 0.80. Interpretation: COVIDpro is an online database with an integrated analysis toolkit. It is a unique and valuable resource for testing hypotheses and identifying proteins or pathways that could be targeted by new treatments of COVID-19 patients. Funding: National Key R&D Program of China: Key PDPM technologies (2021YFA1301602, 2021YFA1301601, 2021YFA1301603), Zhejiang Provincial Natural Science Foundation for Distinguished Young Scholars (LR19C050001), Hangzhou Agriculture and Society Advancement Program (20190101A04), National Natural Science Foundation of China (81972492) and National Science Fund for Young Scholars (21904107), National Resource for Network Biology (NRNB) from the National Institute of General Medical Sciences (NIGMS-P41 GM103504). Research in context: Evidence before this study: Although an increasing number of therapies against COVID-19 are being developed, they are still insufficient, especially with the rise of new variants of concern. This is partially due to our incomplete understanding of the disease’s mechanisms. As data have been collected worldwide, several questions are now worth addressing via meta-analyses. Most COVID-19 drugs function by targeting or affecting proteins. Effectiveness and resistance to therapeutics can be effectively assessed via protein measurements. Empowered by mass spectrometry-based proteomics, protein expression has been characterized in a variety of patient specimens, including body fluids (e.g., serum, plasma, urea) and tissue (i.e., formalin-fixed and paraffin-embedded (FFPE)). We expert-curated proteomic expression data from COVID-19 patients published before May 2022, from the largest proteomic data repository ProteomeXhange as well as from literature search engines. Using this resource, a COVID-19 proteome meta-analysis could provide useful insights into the mechanisms of the disease and identify new potential drug targets.Added value of this study: We integrated many published datasets from patients with COVID-19 from 11 nations, with over 3000 patients and more than 5434 proteome measurements. We collected these datasets in an online database, and generated a toolbox to easily explore, analyze, and visualize the data. Next, we used the database and its associated toolbox to identify new proteins of diagnostic and therapeutic value for COVID-19 treatment. In particular, we identified a set of significantly dysregulated proteins for distinguishing severe from non-severe patients using serum samples.Implications of all the available evidence: COVIDpro will support the navigation and analysis of patterns of dysregulated proteins in various COVID-19 clinical specimens for identification and verification of protein biomarkers and potential therapeutic targets.

12.
Cell Rep ; 40(11): 111304, 2022 09 13.
Artigo em Inglês | MEDLINE | ID: mdl-36103824

RESUMO

Therapeutic options for treatment of basal-like breast cancers remain limited. Here, we demonstrate that bromodomain and extra-terminal (BET) inhibition induces an adaptive response leading to MCL1 protein-driven evasion of apoptosis in breast cancer cells. Consequently, co-targeting MCL1 and BET is highly synergistic in breast cancer models. The mechanism of adaptive response to BET inhibition involves the upregulation of lipid synthesis enzymes including the rate-limiting stearoyl-coenzyme A (CoA) desaturase. Changes in lipid synthesis pathway are associated with increases in cell motility and membrane fluidity as well as re-localization and activation of HER2/EGFR. In turn, the HER2/EGFR signaling results in the accumulation of and vulnerability to the inhibition of MCL1. Drug response and genomics analyses reveal that MCL1 copy-number alterations are associated with effective BET and MCL1 co-targeting. The high frequency of MCL1 chromosomal amplifications (>30%) in basal-like breast cancers suggests that BET and MCL1 co-targeting may have therapeutic utility in this aggressive subtype of breast cancer.


Assuntos
Neoplasias da Mama , Neoplasias da Mama/tratamento farmacológico , Neoplasias da Mama/genética , Linhagem Celular Tumoral , Receptores ErbB/metabolismo , Ácidos Graxos , Feminino , Humanos , Lipídeos , Proteína de Sequência 1 de Leucemia de Células Mieloides/metabolismo , Regulação para Cima
13.
Cancer Discov ; 12(6): 1542-1559, 2022 06 02.
Artigo em Inglês | MEDLINE | ID: mdl-35412613

RESUMO

Cancer cells depend on multiple driver alterations whose oncogenic effects can be suppressed by drug combinations. Here, we provide a comprehensive resource of precision combination therapies tailored to oncogenic coalterations that are recurrent across patient cohorts. To generate the resource, we developed Recurrent Features Leveraged for Combination Therapy (REFLECT), which integrates machine learning and cancer informatics algorithms. Using multiomic data, the method maps recurrent coalteration signatures in patient cohorts to combination therapies. We validated the REFLECT pipeline using data from patient-derived xenografts, in vitro drug screens, and a combination therapy clinical trial. These validations demonstrate that REFLECT-selected combination therapies have significantly improved efficacy, synergy, and survival outcomes. In patient cohorts with immunotherapy response markers, DNA repair aberrations, and HER2 activation, we have identified therapeutically actionable and recurrent coalteration signatures. REFLECT provides a resource and framework to design combination therapies tailored to tumor cohorts in data-driven clinical trials and preclinical studies. SIGNIFICANCE: We developed the predictive bioinformatics platform REFLECT and a multiomics- based precision combination therapy resource. The REFLECT-selected therapies lead to significant improvements in efficacy and patient survival in preclinical and clinical settings. Use of REFLECT can optimize therapeutic benefit through selection of drug combinations tailored to molecular signatures of tumors. See related commentary by Pugh and Haibe-Kains, p. 1416. This article is highlighted in the In This Issue feature, p. 1397.


Assuntos
Neoplasias , Oncogenes , Carcinogênese , Biologia Computacional/métodos , Humanos , Imunoterapia , Neoplasias/tratamento farmacológico , Neoplasias/genética , Neoplasias/patologia
14.
STAR Protoc ; 2(4): 100955, 2021 12 17.
Artigo em Inglês | MEDLINE | ID: mdl-34877547

RESUMO

CausalPath (causalpath.org) evaluates proteomic measurements against prior knowledge of biological pathways and infers causality between changes in measured features, such as global protein and phospho-protein levels. It uses pathway resources to determine potential causality between observable omic features, which are called prior relations. The subset of the prior relations that are supported by the proteomic profiles are reported and evaluated for statistical significance. The end result is a network model of signaling that explains the patterns observed in the experimental dataset. For complete details on the use and execution of this protocol, please refer to Babur et al. (2021).


Assuntos
Mapeamento de Interação de Proteínas/métodos , Proteínas , Proteômica/métodos , Transdução de Sinais/fisiologia , Causalidade , Bases de Dados de Proteínas , Humanos , Proteínas/metabolismo , Proteínas/fisiologia , Software
15.
Elife ; 102021 12 03.
Artigo em Inglês | MEDLINE | ID: mdl-34860157

RESUMO

Making the knowledge contained in scientific papers machine-readable and formally computable would allow researchers to take full advantage of this information by enabling integration with other knowledge sources to support data analysis and interpretation. Here we describe Biofactoid, a web-based platform that allows scientists to specify networks of interactions between genes, their products, and chemical compounds, and then translates this information into a representation suitable for computational analysis, search and discovery. We also report the results of a pilot study to encourage the wide adoption of Biofactoid by the scientific community.


Assuntos
Biologia Computacional/métodos , Genômica/métodos , Biologia Computacional/instrumentação , Bases de Dados Factuais , Genômica/instrumentação , Projetos Piloto
16.
J Integr Bioinform ; 18(3)2021 Jun 08.
Artigo em Inglês | MEDLINE | ID: mdl-34098590

RESUMO

People who are engineering biological organisms often find it useful to communicate in diagrams, both about the structure of the nucleic acid sequences that they are engineering and about the functional relationships between sequence features and other molecular species. Some typical practices and conventions have begun to emerge for such diagrams. The Synthetic Biology Open Language Visual (SBOL Visual) has been developed as a standard for organizing and systematizing such conventions in order to produce a coherent language for expressing the structure and function of genetic designs. This document details version 2.3 of SBOL Visual, which builds on the prior SBOL Visual 2.2 in several ways. First, the specification now includes higher-level "interactions with interactions," such as an inducer molecule stimulating a repression interaction. Second, binding with a nucleic acid backbone can be shown by overlapping glyphs, as with other molecular complexes. Finally, a new "unspecified interaction" glyph is added for visualizing interactions whose nature is unknown, the "insulator" glyph is deprecated in favor of a new "inert DNA spacer" glyph, and the polypeptide region glyph is recommended for showing 2A sequences.


Assuntos
Linguagens de Programação , Biologia Sintética , Humanos , Idioma
17.
Patterns (N Y) ; 2(6): 100257, 2021 Jun 11.
Artigo em Inglês | MEDLINE | ID: mdl-34179843

RESUMO

We present a computational method to infer causal mechanisms in cell biology by analyzing changes in high-throughput proteomic profiles on the background of prior knowledge captured in biochemical reaction knowledge bases. The method mimics a biologist's traditional approach of explaining changes in data using prior knowledge but does this at the scale of hundreds of thousands of reactions. This is a specific example of how to automate scientific reasoning processes and illustrates the power of mapping from experimental data to prior knowledge via logic programming. The identified mechanisms can explain how experimental and physiological perturbations, propagating in a network of reactions, affect cellular responses and their phenotypic consequences. Causal pathway analysis is a powerful and flexible discovery tool for a wide range of cellular profiling data types and biological questions. The automated causation inference tool, as well as the source code, are freely available at http://causalpath.org.

18.
Nucleic Acids Res ; 49(D1): D1083-D1093, 2021 01 08.
Artigo em Inglês | MEDLINE | ID: mdl-33196823

RESUMO

CellMiner Cross-Database (CellMinerCDB, discover.nci.nih.gov/cellminercdb) allows integration and analysis of molecular and pharmacological data within and across cancer cell line datasets from the National Cancer Institute (NCI), Broad Institute, Sanger/MGH and MD Anderson Cancer Center (MDACC). We present CellMinerCDB 1.2 with updates to datasets from NCI-60, Broad Cancer Cell Line Encyclopedia and Sanger/MGH, and the addition of new datasets, including NCI-ALMANAC drug combination, MDACC Cell Line Project proteomic, NCI-SCLC DNA copy number and methylation data, and Broad methylation, genetic dependency and metabolomic datasets. CellMinerCDB (v1.2) includes several improvements over the previously published version: (i) new and updated datasets; (ii) support for pattern comparisons and multivariate analyses across data sources; (iii) updated annotations with drug mechanism of action information and biologically relevant multigene signatures; (iv) analysis speedups via caching; (v) a new dataset download feature; (vi) improved visualization of subsets of multiple tissue types; (vii) breakdown of univariate associations by tissue type; and (viii) enhanced help information. The curation and common annotations (e.g. tissues of origin and identifiers) provided here across pharmacogenomic datasets increase the utility of the individual datasets to address multiple researcher question types, including data reproducibility, biomarker discovery and multivariate analysis of drug activity.


Assuntos
Biologia Computacional/métodos , Bases de Dados Factuais , Neoplasias/metabolismo , Farmacogenética/métodos , Proteômica/métodos , Linhagem Celular Tumoral , Curadoria de Dados/métodos , Mineração de Dados/métodos , Tratamento Farmacológico/métodos , Genômica/métodos , Humanos , Internet , Neoplasias/tratamento farmacológico , Neoplasias/genética
19.
Cell Syst ; 12(2): 128-140.e4, 2021 02 17.
Artigo em Inglês | MEDLINE | ID: mdl-33373583

RESUMO

Systematic perturbation of cells followed by comprehensive measurements of molecular and phenotypic responses provides informative data resources for constructing computational models of cell biology. Models that generalize well beyond training data can be used to identify combinatorial perturbations of potential therapeutic interest. Major challenges for machine learning on large biological datasets are to find global optima in a complex multidimensional space and mechanistically interpret the solutions. To address these challenges, we introduce a hybrid approach that combines explicit mathematical models of cell dynamics with a machine-learning framework, implemented in TensorFlow. We tested the modeling framework on a perturbation-response dataset of a melanoma cell line after drug treatments. The models can be efficiently trained to describe cellular behavior accurately. Even though completely data driven and independent of prior knowledge, the resulting de novo network models recapitulate some known interactions. The approach is readily applicable to various kinetic models of cell biology. A record of this paper's Transparent Peer Review process is included in the Supplemental Information.


Assuntos
Biologia Computacional/métodos , Quimioterapia Combinada/métodos , Aprendizado de Máquina/normas , Neoplasias/terapia , Humanos
20.
Cell Rep Methods ; 1(2): 100039, 2021 06 21.
Artigo em Inglês | MEDLINE | ID: mdl-35475239

RESUMO

Patient-derived cell lines are often used in pre-clinical cancer research, but some cell lines are too different from tumors to be good models. Comparison of genomic and expression profiles can guide the choice of pre-clinical models, but typically not all features are equally relevant. We present TumorComparer, a computational method for comparing cellular profiles with higher weights on functional features of interest. In this pan-cancer application, we compare ∼600 cell lines and ∼8,000 tumor samples of 24 cancer types, using weights to emphasize known oncogenic alterations. We characterize the similarity of cell lines and tumors within and across cancers by using multiple datum types and rank cell lines by their inferred quality as representative models. Beyond the assessment of cell lines, the weighted similarity approach is adaptable to patient stratification in clinical trials and personalized medicine.


Assuntos
Genômica , Neoplasias , Humanos , Linhagem Celular Tumoral , Genômica/métodos , Neoplasias/genética
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...