Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 5 de 5
Filter
Add more filters










Database
Language
Publication year range
1.
Proc Natl Acad Sci U S A ; 120(21): e2209124120, 2023 05 23.
Article in English | MEDLINE | ID: mdl-37192164

ABSTRACT

Detecting differentially expressed genes is important for characterizing subpopulations of cells. In scRNA-seq data, however, nuisance variation due to technical factors like sequencing depth and RNA capture efficiency obscures the underlying biological signal. Deep generative models have been extensively applied to scRNA-seq data, with a special focus on embedding cells into a low-dimensional latent space and correcting for batch effects. However, little attention has been paid to the problem of utilizing the uncertainty from the deep generative model for differential expression (DE). Furthermore, the existing approaches do not allow for controlling for effect size or the false discovery rate (FDR). Here, we present lvm-DE, a generic Bayesian approach for performing DE predictions from a fitted deep generative model, while controlling the FDR. We apply the lvm-DE framework to scVI and scSphere, two deep generative models. The resulting approaches outperform state-of-the-art methods at estimating the log fold change in gene expression levels as well as detecting differentially expressed genes between subpopulations of cells.


Subject(s)
RNA , Single-Cell Analysis , Bayes Theorem , Sequence Analysis, RNA/methods , Single-Cell Analysis/methods , Gene Expression Profiling/methods
3.
Nat Methods ; 18(3): 272-282, 2021 03.
Article in English | MEDLINE | ID: mdl-33589839

ABSTRACT

The paired measurement of RNA and surface proteins in single cells with cellular indexing of transcriptomes and epitopes by sequencing (CITE-seq) is a promising approach to connect transcriptional variation with cell phenotypes and functions. However, combining these paired views into a unified representation of cell state is made challenging by the unique technical characteristics of each measurement. Here we present Total Variational Inference (totalVI; https://scvi-tools.org ), a framework for end-to-end joint analysis of CITE-seq data that probabilistically represents the data as a composite of biological and technical factors, including protein background and batch effects. To evaluate totalVI's performance, we profiled immune cells from murine spleen and lymph nodes with CITE-seq, measuring over 100 surface proteins. We demonstrate that totalVI provides a cohesive solution for common analysis tasks such as dimensionality reduction, the integration of datasets with different measured proteins, estimation of correlations between molecules and differential expression testing.


Subject(s)
Lymph Nodes/metabolism , Proteins/analysis , Single-Cell Analysis/methods , Spleen/metabolism , Transcriptome/genetics , Animals , Cells, Cultured , Data Analysis , Female , High-Throughput Screening Assays/methods , Lymph Nodes/cytology , Mice , Mice, Inbred C57BL , RNA/analysis , RNA/genetics , Spleen/cytology
4.
Mol Syst Biol ; 17(1): e9620, 2021 01.
Article in English | MEDLINE | ID: mdl-33491336

ABSTRACT

As the number of single-cell transcriptomics datasets grows, the natural next step is to integrate the accumulating data to achieve a common ontology of cell types and states. However, it is not straightforward to compare gene expression levels across datasets and to automatically assign cell type labels in a new dataset based on existing annotations. In this manuscript, we demonstrate that our previously developed method, scVI, provides an effective and fully probabilistic approach for joint representation and analysis of scRNA-seq data, while accounting for uncertainty caused by biological and measurement noise. We also introduce single-cell ANnotation using Variational Inference (scANVI), a semi-supervised variant of scVI designed to leverage existing cell state annotations. We demonstrate that scVI and scANVI compare favorably to state-of-the-art methods for data integration and cell state annotation in terms of accuracy, scalability, and adaptability to challenging settings. In contrast to existing methods, scVI and scANVI integrate multiple datasets with a single generative model that can be directly used for downstream tasks, such as differential expression. Both methods are easily accessible through scvi-tools.


Subject(s)
Computational Biology/methods , Single-Cell Analysis/methods , Databases, Genetic , Gene Expression Profiling , Humans , Molecular Sequence Annotation , Sequence Analysis, RNA , Supervised Machine Learning
5.
Nat Methods ; 15(12): 1053-1058, 2018 12.
Article in English | MEDLINE | ID: mdl-30504886

ABSTRACT

Single-cell transcriptome measurements can reveal unexplored biological diversity, but they suffer from technical noise and bias that must be modeled to account for the resulting uncertainty in downstream analyses. Here we introduce single-cell variational inference (scVI), a ready-to-use scalable framework for the probabilistic representation and analysis of gene expression in single cells ( https://github.com/YosefLab/scVI ). scVI uses stochastic optimization and deep neural networks to aggregate information across similar cells and genes and to approximate the distributions that underlie observed expression values, while accounting for batch effects and limited sensitivity. We used scVI for a range of fundamental analysis tasks including batch correction, visualization, clustering, and differential expression, and achieved high accuracy for each task.


Subject(s)
Computational Biology/methods , High-Throughput Nucleotide Sequencing/methods , Models, Biological , Sequence Analysis, RNA/methods , Single-Cell Analysis/methods , Transcriptome , Algorithms , Animals , Brain/cytology , Brain/metabolism , Cluster Analysis , Genetic Variation , Hematopoietic Stem Cells/cytology , Hematopoietic Stem Cells/metabolism , Humans , Leukocytes, Mononuclear/cytology , Leukocytes, Mononuclear/metabolism , Mice
SELECTION OF CITATIONS
SEARCH DETAIL
...