Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 8 de 8
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
Bioinformatics ; 37(Suppl_1): i245-i253, 2021 07 12.
Artigo em Inglês | MEDLINE | ID: mdl-34252933

RESUMO

SUMMARY: In recent years, SWATH-MS has become the proteomic method of choice for data-independent-acquisition, as it enables high proteome coverage, accuracy and reproducibility. However, data analysis is convoluted and requires prior information and expert curation. Furthermore, as quantification is limited to a small set of peptides, potentially important biological information may be discarded. Here we demonstrate that deep learning can be used to learn discriminative features directly from raw MS data, eliminating hence the need of elaborate data processing pipelines. Using transfer learning to overcome sample sparsity, we exploit a collection of publicly available deep learning models already trained for the task of natural image classification. These models are used to produce feature vectors from each mass spectrometry (MS) raw image, which are later used as input for a classifier trained to distinguish tumor from normal prostate biopsies. Although the deep learning models were originally trained for a completely different classification task and no additional fine-tuning is performed on them, we achieve a highly remarkable classification performance of 0.876 AUC. We investigate different types of image preprocessing and encoding. We also investigate whether the inclusion of the secondary MS2 spectra improves the classification performance. Throughout all tested models, we use standard protein expression vectors as gold standards. Even with our naïve implementation, our results suggest that the application of deep learning and transfer learning techniques might pave the way to the broader usage of raw mass spectrometry data in real-time diagnosis. AVAILABILITY AND IMPLEMENTATION: The open source code used to generate the results from MS images is available on GitHub: https://ibm.biz/mstransc. The raw MS data underlying this article cannot be shared publicly for the privacy of individuals that participated in the study. Processed data including the MS images, their encodings, classification labels and results can be accessed at the following link: https://ibm.box.com/v/mstc-supplementary. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Aprendizado Profundo , Estudos de Viabilidade , Humanos , Masculino , Espectrometria de Massas , Redes Neurais de Computação , Proteômica , Reprodutibilidade dos Testes
2.
Bioinformatics ; 37(14): 2070-2072, 2021 08 04.
Artigo em Inglês | MEDLINE | ID: mdl-33241320

RESUMO

SUMMARY: The advent of high-throughput technologies has provided researchers with measurements of thousands of molecular entities and enable the investigation of the internal regulatory apparatus of the cell. However, network inference from high-throughput data is far from being a solved problem. While a plethora of different inference methods have been proposed, they often lead to non-overlapping predictions, and many of them lack user-friendly implementations to enable their broad utilization. Here, we present Consensus Interaction Network Inference Service (COSIFER), a package and a companion web-based platform to infer molecular networks from expression data using state-of-the-art consensus approaches. COSIFER includes a selection of state-of-the-art methodologies for network inference and different consensus strategies to integrate the predictions of individual methods and generate robust networks. AVAILABILITY AND IMPLEMENTATION: COSIFER Python source code is available at https://github.com/PhosphorylatedRabbits/cosifer. The web service is accessible at https://ibm.biz/cosifer-aas. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Software , Consenso
3.
NPJ Syst Biol Appl ; 6(1): 27, 2020 08 25.
Artigo em Inglês | MEDLINE | ID: mdl-32843649

RESUMO

Knowledge about the clonal evolution of a tumor can help to interpret the function of its genetic alterations by identifying initiating events and events that contribute to the selective advantage of proliferative, metastatic, and drug-resistant subclones. Clonal evolution can be reconstructed from estimates of the relative abundance (frequency) of subclone-specific alterations in tumor biopsies, which, in turn, inform on its composition. However, estimating these frequencies is complicated by the high genetic instability that characterizes many cancers. Models for genetic instability suggest that copy number alterations (CNAs) can influence mutation-frequency estimates and thus impede efforts to reconstruct tumor phylogenies. Our analysis suggested that accurate mutation frequency estimates require accounting for CNAs-a challenging endeavour using the genetic profile of a single tumor biopsy. Instead, we propose an optimization algorithm, Chimæra, to account for the effects of CNAs using profiles of multiple biopsies per tumor. Analyses of simulated data and tumor profiles suggested that Chimæra estimates are consistently more accurate than those of previously proposed methods and resulted in improved phylogeny reconstructions and subclone characterizations. Our analyses inferred recurrent initiating mutations in hepatocellular carcinomas, resolved the clonal composition of Wilms' tumors, and characterized the acquisition of mutations in drug-resistant prostate cancers.


Assuntos
Evolução Clonal , Neoplasias/genética , Neoplasias/patologia , Biópsia , Variações do Número de Cópias de DNA , Humanos
4.
IEEE/ACM Trans Comput Biol Bioinform ; 17(6): 2141-2147, 2020.
Artigo em Inglês | MEDLINE | ID: mdl-31494553

RESUMO

Boolean models are a powerful abstraction for qualitative modeling of gene regulatory networks. With the recent availability of advanced high-throughput technologies, Boolean models have increasingly grown in size and complexity, posing a challenge for existing software simulation tools that have not scaled at the same speed. Field Programmable Gate Arrays (FPGAs) are powerful reconfigurable integrated circuits that can offer massive performance improvements. Due to their highly parallel nature, FPGAs are well suited to simulate complex molecular networks. We present here a new simulation framework for Boolean models, which first converts the model to Verilog, a standardized hardware description language, and then connects it to an execution core that runs on an FPGA coherently attached to a POWER8 processor. We report an order of magnitude speedup over a multi-threaded software simulation tool running on the same processor on a selection of Boolean models. Analysis on a T-cell large granular lymphocyte leukemia (T-LGL) demonstrates that our framework achieves consistent performance improvements resulting in new biological insights. In addition, we show that our solution allows to perform attractor detection at an unprecedented speed, exhibiting a speedup ranging from one to three orders of magnitude compared to alternative software solutions.


Assuntos
Biologia Computacional/métodos , Simulação por Computador , Redes Reguladoras de Genes/genética , Modelos Genéticos , Humanos , Leucemia Linfocítica Granular Grande/genética , Software
5.
Sci Rep ; 9(1): 15918, 2019 11 04.
Artigo em Inglês | MEDLINE | ID: mdl-31685861

RESUMO

We present the Network-based Biased Tree Ensembles (NetBiTE) method for drug sensitivity prediction and drug sensitivity biomarker identification in cancer using a combination of prior knowledge and gene expression data. Our devised method consists of a biased tree ensemble that is built according to a probabilistic bias weight distribution. The bias weight distribution is obtained from the assignment of high weights to the drug targets and propagating the assigned weights over a protein-protein interaction network such as STRING. The propagation of weights, defines neighborhoods of influence around the drug targets and as such simulates the spread of perturbations within the cell, following drug administration. Using a synthetic dataset, we showcase how application of biased tree ensembles (BiTE) results in significant accuracy gains at a much lower computational cost compared to the unbiased random forests (RF) algorithm. We then apply NetBiTE to the Genomics of Drug Sensitivity in Cancer (GDSC) dataset and demonstrate that NetBiTE outperforms RF in predicting IC50 drug sensitivity, only for drugs that target membrane receptor pathways (MRPs): RTK, EGFR and IGFR signaling pathways. We propose based on the NetBiTE results, that for drugs that inhibit MRPs, the expression of target genes prior to drug administration is a biomarker for IC50 drug sensitivity following drug administration. We further verify and reinforce this proposition through control studies on, PI3K/MTOR signaling pathway inhibitors, a drug category that does not target MRPs, and through assignment of dummy targets to MRP inhibiting drugs and investigating the variation in NetBiTE accuracy.


Assuntos
Algoritmos , Antineoplásicos/química , Biomarcadores/metabolismo , Neoplasias/patologia , Antineoplásicos/farmacologia , Antineoplásicos/uso terapêutico , Bases de Dados Factuais , Humanos , Concentração Inibidora 50 , Neoplasias/tratamento farmacológico , Neoplasias/metabolismo , Fosfatidilinositol 3-Quinases/metabolismo , Mapas de Interação de Proteínas/efeitos dos fármacos , Inibidores de Proteínas Quinases/química , Inibidores de Proteínas Quinases/farmacologia , Inibidores de Proteínas Quinases/uso terapêutico , Receptores de Superfície Celular/metabolismo , Transdução de Sinais/efeitos dos fármacos , Serina-Treonina Quinases TOR/metabolismo
6.
NPJ Syst Biol Appl ; 5: 8, 2019.
Artigo em Inglês | MEDLINE | ID: mdl-30854223

RESUMO

Reliable identification of molecular biomarkers is essential for accurate patient stratification. While state-of-the-art machine learning approaches for sample classification continue to push boundaries in terms of performance, most of these methods are not able to integrate different data types and lack generalization power, limiting their application in a clinical setting. Furthermore, many methods behave as black boxes, and we have very little understanding about the mechanisms that lead to the prediction. While opaqueness concerning machine behavior might not be a problem in deterministic domains, in health care, providing explanations about the molecular factors and phenotypes that are driving the classification is crucial to build trust in the performance of the predictive system. We propose Pathway-Induced Multiple Kernel Learning (PIMKL), a methodology to reliably classify samples that can also help gain insights into the molecular mechanisms that underlie the classification. PIMKL exploits prior knowledge in the form of a molecular interaction network and annotated gene sets, by optimizing a mixture of pathway-induced kernels using a Multiple Kernel Learning (MKL) algorithm, an approach that has demonstrated excellent performance in different machine learning applications. After optimizing the combination of kernels to predict a specific phenotype, the model provides a stable molecular signature that can be interpreted in the light of the ingested prior knowledge and that can be used in transfer learning tasks.


Assuntos
Biomarcadores Tumorais/classificação , Biologia Computacional/métodos , Algoritmos , Humanos , Aprendizado de Máquina , Reconhecimento Automatizado de Padrão/métodos , Software
7.
BMC Evol Biol ; 17(1): 73, 2017 03 09.
Artigo em Inglês | MEDLINE | ID: mdl-28274196

RESUMO

BACKGROUND: The ability to form a cellular memory and use it for cellular decision-making could help bacteria to cope with recurrent stress conditions. We analyzed whether bacteria would form a cellular memory specifically if past events are predictive of future conditions. We worked with the asymmetrically dividing bacterium Caulobacter crescentus where past events are expected to only be informative for one of the two cells emerging from division, the sessile cell that remains in the same microenvironment and does not migrate. RESULTS: Time-resolved analysis of individual cells revealed that past exposure to low levels of antibiotics increases tolerance to future exposure for the sessile but not for the motile cell. Using computer simulations, we found that such an asymmetry in cellular memory could be an evolutionary response to situations where the two cells emerging from division will experience different future conditions. CONCLUSIONS: Our results raise the question whether bacteria can evolve the ability to form and use cellular memory conditionally in situations where it is beneficial.


Assuntos
Antibacterianos/farmacologia , Caulobacter crescentus/efeitos dos fármacos , Caulobacter crescentus/fisiologia , Evolução Biológica , Simulação por Computador , Farmacorresistência Bacteriana
8.
Proc Natl Acad Sci U S A ; 113(15): 4224-9, 2016 Apr 12.
Artigo em Inglês | MEDLINE | ID: mdl-26960998

RESUMO

Most bacteria live in ever-changing environments where periods of stress are common. One fundamental question is whether individual bacterial cells have an increased tolerance to stress if they recently have been exposed to lower levels of the same stressor. To address this question, we worked with the bacterium Caulobacter crescentus and asked whether exposure to a moderate concentration of sodium chloride would affect survival during later exposure to a higher concentration. We found that the effects measured at the population level depended in a surprising and complex way on the time interval between the two exposure events: The effect of the first exposure on survival of the second exposure was positive for some time intervals but negative for others. We hypothesized that the complex pattern of history dependence at the population level was a consequence of the responses of individual cells to sodium chloride that we observed: (i) exposure to moderate concentrations of sodium chloride caused delays in cell division and led to cell-cycle synchronization, and (ii) whether a bacterium would survive subsequent exposure to higher concentrations was dependent on the cell-cycle state. Using computational modeling, we demonstrated that indeed the combination of these two effects could explain the complex patterns of history dependence observed at the population level. Our insight into how the behavior of single cells scales up to processes at the population level provides a perspective on how organisms operate in dynamic environments with fluctuating stress exposure.


Assuntos
Caulobacter crescentus/citologia , Divisão Celular , Ciclo Celular , Cloreto de Sódio
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...