Pesquisa | Portal Regional da BVS (teste)

Mixture-of-Experts Variational Autoencoder for clustering and generating from similarity-based representations on single cell data.

Kopf, Andreas; Fortuin, Vincent; Somnath, Vignesh Ram; Claassen, Manfred.

PLoS Comput Biol ; 17(6): e1009086, 2021 06.

Artigo em Inglês | MEDLINE | ID: mdl-34191792

RESUMO

Clustering high-dimensional data, such as images or biological measurements, is a long-standing problem and has been studied extensively. Recently, Deep Clustering has gained popularity due to its flexibility in fitting the specific peculiarities of complex data. Here we introduce the Mixture-of-Experts Similarity Variational Autoencoder (MoE-Sim-VAE), a novel generative clustering model. The model can learn multi-modal distributions of high-dimensional data and use these to generate realistic data with high efficacy and efficiency. MoE-Sim-VAE is based on a Variational Autoencoder (VAE), where the decoder consists of a Mixture-of-Experts (MoE) architecture. This specific architecture allows for various modes of the data to be automatically learned by means of the experts. Additionally, we encourage the lower dimensional latent representation of our model to follow a Gaussian mixture distribution and to accurately represent the similarities between the data points. We assess the performance of our model on the MNIST benchmark data set and challenging real-world tasks of clustering mouse organs from single-cell RNA-sequencing measurements and defining cell subpopulations from mass cytometry (CyTOF) measurements on hundreds of different datasets. MoE-Sim-VAE exhibits superior clustering performance on all these tasks in comparison to the baselines as well as competitor methods.

Assuntos

Análise de Célula Única/estatística & dados numéricos , Animais , Análise por Conglomerados , Biologia Computacional , Aprendizado Profundo , Perfilação da Expressão Gênica/estatística & dados numéricos , Leucócitos Mononucleares/classificação , Camundongos , Modelos Biológicos , Distribuição Normal , Especificidade de Órgãos , Fenótipo , RNA-Seq/estatística & dados numéricos

MGP-AttTCN: An interpretable machine learning model for the prediction of sepsis.

Rosnati, Margherita; Fortuin, Vincent.

PLoS One ; 16(5): e0251248, 2021.

Artigo em Inglês | MEDLINE | ID: mdl-33961681

RESUMO

With a mortality rate of 5.4 million lives worldwide every year and a healthcare cost of more than 16 billion dollars in the USA alone, sepsis is one of the leading causes of hospital mortality and an increasing concern in the ageing western world. Recently, medical and technological advances have helped re-define the illness criteria of this disease, which is otherwise poorly understood by the medical society. Together with the rise of widely accessible Electronic Health Records, the advances in data mining and complex nonlinear algorithms are a promising avenue for the early detection of sepsis. This work contributes to the research effort in the field of automated sepsis detection with an open-access labelling of the medical MIMIC-III data set. Moreover, we propose MGP-AttTCN: a joint multitask Gaussian Process and attention-based deep learning model to early predict the occurrence of sepsis in an interpretable manner. We show that our model outperforms the current state-of-the-art and present evidence that different labelling heuristics lead to discrepancies in task difficulty. For instance, when predicting sepsis five hours prior to onset on our new realistic labels, our proposed model achieves an area under the ROC curve of 0.660 and an area under the PR curve of 0.483, whereas the (less interpretable) previous state-of-the-art model (MGP-TCN) achieves 0.635 AUROC and 0.460 AUPR and the popular commercial InSight model achieves 0.490 AUROC and 0.359 AUPR.

Assuntos

Mortalidade Hospitalar , Sepse/mortalidade , Algoritmos , Humanos , Aprendizado de Máquina , Prognóstico

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA