Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 13 de 13
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
Bioinform Adv ; 4(1): vbae036, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38577542

RESUMO

Motivation: Graph representation learning is a family of related approaches that learn low-dimensional vector representations of nodes and other graph elements called embeddings. Embeddings approximate characteristics of the graph and can be used for a variety of machine-learning tasks such as novel edge prediction. For many biomedical applications, partial knowledge exists about positive edges that represent relationships between pairs of entities, but little to no knowledge is available about negative edges that represent the explicit lack of a relationship between two nodes. For this reason, classification procedures are forced to assume that the vast majority of unlabeled edges are negative. Existing approaches to sampling negative edges for training and evaluating classifiers do so by uniformly sampling pairs of nodes. Results: We show here that this sampling strategy typically leads to sets of positive and negative examples with imbalanced node degree distributions. Using representative heterogeneous biomedical knowledge graph and random walk-based graph machine learning, we show that this strategy substantially impacts classification performance. If users of graph machine-learning models apply the models to prioritize examples that are drawn from approximately the same distribution as the positive examples are, then performance of models as estimated in the validation phase may be artificially inflated. We present a degree-aware node sampling approach that mitigates this effect and is simple to implement. Availability and implementation: Our code and data are publicly available at https://github.com/monarch-initiative/negativeExampleSelection.

2.
Mol Cancer Ther ; 23(7): 949-960, 2024 Jul 02.
Artigo em Inglês | MEDLINE | ID: mdl-38507740

RESUMO

The activated B cell (ABC) subset of diffuse large B-cell lymphoma (DLBCL) is characterized by chronic B-cell receptor signaling and associated with poor outcomes when treated with standard therapy. In ABC-DLBCL, MALT1 is a core enzyme that is constitutively activated by stimulation of the B-cell receptor or gain-of-function mutations in upstream components of the signaling pathway, making it an attractive therapeutic target. We discovered a novel small-molecule inhibitor, ABBV-MALT1, that potently shuts down B-cell signaling selectively in ABC-DLBCL preclinical models leading to potent cell growth and xenograft inhibition. We also identified a rational combination partner for ABBV-MALT1 in the BCL2 inhibitor, venetoclax, which when combined significantly synergizes to elicit deep and durable responses in preclinical models. This work highlights the potential of ABBV-MALT1 monotherapy and combination with venetoclax as effective treatment options for patients with ABC-DLBCL.


Assuntos
Sinergismo Farmacológico , Proteína de Translocação 1 do Linfoma de Tecido Linfoide Associado à Mucosa , Proteínas Proto-Oncogênicas c-bcl-2 , Ensaios Antitumorais Modelo de Xenoenxerto , Proteína de Translocação 1 do Linfoma de Tecido Linfoide Associado à Mucosa/antagonistas & inibidores , Proteína de Translocação 1 do Linfoma de Tecido Linfoide Associado à Mucosa/metabolismo , Humanos , Animais , Camundongos , Proteínas Proto-Oncogênicas c-bcl-2/antagonistas & inibidores , Proteínas Proto-Oncogênicas c-bcl-2/metabolismo , Proteínas Proto-Oncogênicas c-bcl-2/genética , Linhagem Celular Tumoral , Sulfonamidas/farmacologia , Sulfonamidas/uso terapêutico , Proliferação de Células/efeitos dos fármacos , Linfoma Difuso de Grandes Células B/tratamento farmacológico , Linfoma Difuso de Grandes Células B/genética , Linfoma Difuso de Grandes Células B/patologia , Compostos Bicíclicos Heterocíclicos com Pontes/farmacologia , Compostos Bicíclicos Heterocíclicos com Pontes/uso terapêutico , Antineoplásicos/farmacologia , Antineoplásicos/uso terapêutico , Modelos Animais de Doenças
3.
Cell Rep Med ; 4(8): 101158, 2023 08 15.
Artigo em Inglês | MEDLINE | ID: mdl-37586321

RESUMO

Autologous anti-CD19 chimeric antigen receptor T cell (CAR T) therapy is highly effective in relapsed/refractory large B cell lymphoma (rrLBCL) but is associated with toxicities that delay recovery. While the biological mechanisms of cytokine release syndrome and neurotoxicity have been investigated, the pathophysiology is poorly understood for prolonged cytopenia, defined as grade ≥3 cytopenia lasting beyond 30 days after CAR T infusion. We performed single-cell RNA sequencing of bone marrow samples from healthy donors and rrLBCL patients with or without prolonged cytopenia and identified significantly increased frequencies of clonally expanded CX3CR1hi cytotoxic T cells, expressing high interferon (IFN)-γ and cytokine signaling gene sets, associated with prolonged cytopenia. In line with this, we found that hematopoietic stem cells from these patients expressed IFN-γ response signatures. IFN-γ deregulates hematopoietic stem cell self-renewal and differentiation and can be targeted with thrombopoietin agonists or IFN-γ-neutralizing antibodies, highlighting a potential mechanism-based approach for the treatment of CAR T-associated prolonged cytopenia.


Assuntos
Linfoma de Células B , Receptores de Antígenos Quiméricos , Humanos , Imunoterapia Adotiva , Medula Óssea , Linfócitos T CD8-Positivos , Antígenos CD19 , Interferon gama
4.
Bioinformatics ; 39(7)2023 07 01.
Artigo em Inglês | MEDLINE | ID: mdl-37389415

RESUMO

MOTIVATION: Knowledge graphs (KGs) are a powerful approach for integrating heterogeneous data and making inferences in biology and many other domains, but a coherent solution for constructing, exchanging, and facilitating the downstream use of KGs is lacking. RESULTS: Here we present KG-Hub, a platform that enables standardized construction, exchange, and reuse of KGs. Features include a simple, modular extract-transform-load pattern for producing graphs compliant with Biolink Model (a high-level data model for standardizing biological data), easy integration of any OBO (Open Biological and Biomedical Ontologies) ontology, cached downloads of upstream data sources, versioned and automatically updated builds with stable URLs, web-browsable storage of KG artifacts on cloud infrastructure, and easy reuse of transformed subgraphs across projects. Current KG-Hub projects span use cases including COVID-19 research, drug repurposing, microbial-environmental interactions, and rare disease research. KG-Hub is equipped with tooling to easily analyze and manipulate KGs. KG-Hub is also tightly integrated with graph machine learning (ML) tools which allow automated graph ML, including node embeddings and training of models for link prediction and node classification. AVAILABILITY AND IMPLEMENTATION: https://kghub.org.


Assuntos
Ontologias Biológicas , COVID-19 , Humanos , Reconhecimento Automatizado de Padrão , Doenças Raras , Aprendizado de Máquina
5.
Nat Comput Sci ; 3(6): 552-568, 2023 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-38177435

RESUMO

Graph representation learning methods opened new avenues for addressing complex, real-world problems represented by graphs. However, many graphs used in these applications comprise millions of nodes and billions of edges and are beyond the capabilities of current methods and software implementations. We present GRAPE (Graph Representation Learning, Prediction and Evaluation), a software resource for graph processing and embedding that is able to scale with big graphs by using specialized and smart data structures, algorithms, and a fast parallel implementation of random-walk-based methods. Compared with state-of-the-art software resources, GRAPE shows an improvement of orders of magnitude in empirical space and time complexity, as well as competitive edge- and node-label prediction performance. GRAPE comprises approximately 1.7 million well-documented lines of Python and Rust code and provides 69 node-embedding methods, 25 inference models, a collection of efficient graph-processing utilities, and over 80,000 graphs from the literature and other sources. Standardized interfaces allow a seamless integration of third-party libraries, while ready-to-use and modular pipelines permit an easy-to-use evaluation of graph-representation-learning methods, therefore also positioning GRAPE as a software resource that performs a fair comparison between methods and libraries for graph processing and embedding.


Assuntos
Bibliotecas , Vitis , Algoritmos , Software , Aprendizagem
6.
NAR Genom Bioinform ; 3(4): lqab113, 2021 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-34888523

RESUMO

Inhibiting protein kinases (PKs) that cause cancers has been an important topic in cancer therapy for years. So far, almost 8% of >530 PKs have been targeted by FDA-approved medications, and around 150 protein kinase inhibitors (PKIs) have been tested in clinical trials. We present an approach based on natural language processing and machine learning to investigate the relations between PKs and cancers, predicting PKs whose inhibition would be efficacious to treat a certain cancer. Our approach represents PKs and cancers as semantically meaningful 100-dimensional vectors based on word and concept neighborhoods in PubMed abstracts. We use information about phase I-IV trials in ClinicalTrials.gov to construct a training set for random forest classification. Our results with historical data show that associations between PKs and specific cancers can be predicted years in advance with good accuracy. Our tool can be used to predict the relevance of inhibiting PKs for specific cancers and to support the design of well-focused clinical trials to discover novel PKIs for cancer therapy.

7.
Patterns (N Y) ; 2(1): 100155, 2021 Jan 08.
Artigo em Inglês | MEDLINE | ID: mdl-33196056

RESUMO

Integrated, up-to-date data about SARS-CoV-2 and COVID-19 is crucial for the ongoing response to the COVID-19 pandemic by the biomedical research community. While rich biological knowledge exists for SARS-CoV-2 and related viruses (SARS-CoV, MERS-CoV), integrating this knowledge is difficult and time-consuming, since much of it is in siloed databases or in textual format. Furthermore, the data required by the research community vary drastically for different tasks; the optimal data for a machine learning task, for example, is much different from the data used to populate a browsable user interface for clinicians. To address these challenges, we created KG-COVID-19, a flexible framework that ingests and integrates heterogeneous biomedical data to produce knowledge graphs (KGs), and applied it to create a KG for COVID-19 response. This KG framework also can be applied to other problems in which siloed biomedical data must be quickly integrated for different research applications, including future pandemics.

8.
bioRxiv ; 2020 Aug 18.
Artigo em Inglês | MEDLINE | ID: mdl-32839776

RESUMO

Integrated, up-to-date data about SARS-CoV-2 and coronavirus disease 2019 (COVID-19) is crucial for the ongoing response to the COVID-19 pandemic by the biomedical research community. While rich biological knowledge exists for SARS-CoV-2 and related viruses (SARS-CoV, MERS-CoV), integrating this knowledge is difficult and time consuming, since much of it is in siloed databases or in textual format. Furthermore, the data required by the research community varies drastically for different tasks - the optimal data for a machine learning task, for example, is much different from the data used to populate a browsable user interface for clinicians. To address these challenges, we created KG-COVID-19, a flexible framework that ingests and integrates biomedical data to produce knowledge graphs (KGs) for COVID-19 response. This KG framework can also be applied to other problems in which siloed biomedical data must be quickly integrated for different research applications, including future pandemics. BIGGER PICTURE: An effective response to the COVID-19 pandemic relies on integration of many different types of data available about SARS-CoV-2 and related viruses. KG-COVID-19 is a framework for producing knowledge graphs that can be customized for downstream applications including machine learning tasks, hypothesis-based querying, and browsable user interface to enable researchers to explore COVID-19 data and discover relationships.

9.
Am J Hum Genet ; 107(3): 403-417, 2020 09 03.
Artigo em Inglês | MEDLINE | ID: mdl-32755546

RESUMO

Human Phenotype Ontology (HPO)-based analysis has become standard for genomic diagnostics of rare diseases. Current algorithms use a variety of semantic and statistical approaches to prioritize the typically long lists of genes with candidate pathogenic variants. These algorithms do not provide robust estimates of the strength of the predictions beyond the placement in a ranked list, nor do they provide measures of how much any individual phenotypic observation has contributed to the prioritization result. However, given that the overall success rate of genomic diagnostics is only around 25%-50% or less in many cohorts, a good ranking cannot be taken to imply that the gene or disease at rank one is necessarily a good candidate. Here, we present an approach to genomic diagnostics that exploits the likelihood ratio (LR) framework to provide an estimate of (1) the posttest probability of candidate diagnoses, (2) the LR for each observed HPO phenotype, and (3) the predicted pathogenicity of observed genotypes. LIkelihood Ratio Interpretation of Clinical AbnormaLities (LIRICAL) placed the correct diagnosis within the first three ranks in 92.9% of 384 case reports comprising 262 Mendelian diseases, and the correct diagnosis had a mean posttest probability of 67.3%. Simulations show that LIRICAL is robust to many typically encountered forms of genomic and phenomic noise. In summary, LIRICAL provides accurate, clinically interpretable results for phenotype-driven genomic diagnostics.


Assuntos
Biologia Computacional , Bases de Dados Genéticas , Genômica , Doenças Raras/diagnóstico , Algoritmos , Exoma/genética , Humanos , Fenótipo , Doenças Raras/genética , Software
10.
Nucleic Acids Res ; 48(D1): D704-D715, 2020 01 08.
Artigo em Inglês | MEDLINE | ID: mdl-31701156

RESUMO

In biology and biomedicine, relating phenotypic outcomes with genetic variation and environmental factors remains a challenge: patient phenotypes may not match known diseases, candidate variants may be in genes that haven't been characterized, research organisms may not recapitulate human or veterinary diseases, environmental factors affecting disease outcomes are unknown or undocumented, and many resources must be queried to find potentially significant phenotypic associations. The Monarch Initiative (https://monarchinitiative.org) integrates information on genes, variants, genotypes, phenotypes and diseases in a variety of species, and allows powerful ontology-based search. We develop many widely adopted ontologies that together enable sophisticated computational analysis, mechanistic discovery and diagnostics of Mendelian diseases. Our algorithms and tools are widely used to identify animal models of human disease through phenotypic similarity, for differential diagnostics and to facilitate translational research. Launched in 2015, Monarch has grown with regards to data (new organisms, more sources, better modeling); new API and standards; ontologies (new Mondo unified disease ontology, improvements to ontologies such as HPO and uPheno); user interface (a redesigned website); and community development. Monarch data, algorithms and tools are being used and extended by resources such as GA4GH and NCATS Translator, among others, to aid mechanistic discovery and diagnostics.


Assuntos
Biologia Computacional/métodos , Genótipo , Fenótipo , Algoritmos , Animais , Ontologias Biológicas , Bases de Dados Genéticas , Exoma , Estudos de Associação Genética , Variação Genética , Genômica , Humanos , Internet , Software , Pesquisa Translacional Biomédica , Interface Usuário-Computador
11.
Artigo em Inglês | MEDLINE | ID: mdl-31119199

RESUMO

Electronic Health Record (EHR) systems typically define laboratory test results using the Laboratory Observation Identifier Names and Codes (LOINC) and can transmit them using Fast Healthcare Interoperability Resource (FHIR) standards. LOINC has not yet been semantically integrated with computational resources for phenotype analysis. Here, we provide a method for mapping LOINC-encoded laboratory test results transmitted in FHIR standards to Human Phenotype Ontology (HPO) terms. We annotated the medical implications of 2923 commonly used laboratory tests with HPO terms. Using these annotations, our software assesses laboratory test results and converts each result into an HPO term. We validated our approach with EHR data from 15,681 patients with respiratory complaints and identified known biomarkers for asthma. Finally, we provide a freely available SMART on FHIR application that can be used within EHR systems. Our approach allows readily available laboratory tests in EHR to be reused for deep phenotyping and exploits the hierarchical structure of HPO to integrate distinct tests that have comparable medical interpretations for association studies.

12.
Bioinformatics ; 34(6): 911-919, 2018 03 15.
Artigo em Inglês | MEDLINE | ID: mdl-29087447

RESUMO

Motivation: Chromatin immunoprecipitation sequencing (ChIP-seq) experiments are inexpensive and time-efficient, and result in massive datasets that introduce significant storage and maintenance challenges. To address the resulting Big Data problems, we propose a lossless and lossy compression framework specifically designed for ChIP-seq Wig data, termed ChIPWig. ChIPWig enables random access, summary statistics lookups and it is based on the asymptotic theory of optimal point density design for nonuniform quantizers. Results: We tested the ChIPWig compressor on 10 ChIP-seq datasets generated by the ENCODE consortium. On average, lossless ChIPWig reduced the file sizes to merely 6% of the original, and offered 6-fold compression rate improvement compared to bigWig. The lossy feature further reduced file sizes 2-fold compared to the lossless mode, with little or no effects on peak calling and motif discovery using specialized NarrowPeaks methods. The compression and decompression speed rates are of the order of 0.2 sec/MB using general purpose computers. Availability and implementation: The source code and binaries are freely available for download at https://github.com/vidarmehr/ChIPWig-v2, implemented in C ++. Contact: milenkov@illinois.edu. Supplementary information: Supplementary data are available at Bioinformatics online.


Assuntos
Imunoprecipitação da Cromatina/métodos , Compressão de Dados/métodos , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Análise de Sequência de DNA/métodos , Software
13.
Artigo em Inglês | MEDLINE | ID: mdl-21464507

RESUMO

We introduce a class of finite systems models of gene regulatory networks exhibiting behavior of the cell cycle. The model is an extension of a Boolean network model. The system spontaneously cycles through a finite set of internal states, tracking the increase of an external factor such as cell mass, and also exhibits checkpoints in which errors in gene expression levels due to cellular noise are automatically corrected. We present a 7-gene network based on Projective Geometry codes, which can correct, at every given time, one gene expression error. The topology of a network is highly symmetric and requires using only simple Boolean functions that can be synthesized using genes of various organisms. The attractor structure of the Boolean network contains a single cycle attractor. It is the smallest nontrivial network with such high robustness. The methodology allows construction of artificial cell cycle gene regulatory networks with the number of phases larger than in natural cell cycle.


Assuntos
Redes Reguladoras de Genes , Teoria da Informação , Modelos Genéticos , Ciclo Celular/genética , Biologia Computacional
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...