Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 9 de 9
Filter
Add more filters










Database
Language
Publication year range
2.
Nat Methods ; 21(2): 217-227, 2024 Feb.
Article in English | MEDLINE | ID: mdl-38191932

ABSTRACT

Single-cell omics technologies have revolutionized the study of gene regulation in complex tissues. A major computational challenge in analyzing these datasets is to project the large-scale and high-dimensional data into low-dimensional space while retaining the relative relationships between cells. This low dimension embedding is necessary to decompose cellular heterogeneity and reconstruct cell-type-specific gene regulatory programs. Traditional dimensionality reduction techniques, however, face challenges in computational efficiency and in comprehensively addressing cellular diversity across varied molecular modalities. Here we introduce a nonlinear dimensionality reduction algorithm, embodied in the Python package SnapATAC2, which not only achieves a more precise capture of single-cell omics data heterogeneities but also ensures efficient runtime and memory usage, scaling linearly with the number of cells. Our algorithm demonstrates exceptional performance, scalability and versatility across diverse single-cell omics datasets, including single-cell assay for transposase-accessible chromatin using sequencing, single-cell RNA sequencing, single-cell Hi-C and single-cell multi-omics datasets, underscoring its utility in advancing single-cell analysis.


Subject(s)
Algorithms , Chromatin , Single-Cell Analysis/methods
3.
Nature ; 624(7991): 390-402, 2023 Dec.
Article in English | MEDLINE | ID: mdl-38092918

ABSTRACT

Divergence of cis-regulatory elements drives species-specific traits1, but how this manifests in the evolution of the neocortex at the molecular and cellular level remains unclear. Here we investigated the gene regulatory programs in the primary motor cortex of human, macaque, marmoset and mouse using single-cell multiomics assays, generating gene expression, chromatin accessibility, DNA methylome and chromosomal conformation profiles from a total of over 200,000 cells. From these data, we show evidence that divergence of transcription factor expression corresponds to species-specific epigenome landscapes. We find that conserved and divergent gene regulatory features are reflected in the evolution of the three-dimensional genome. Transposable elements contribute to nearly 80% of the human-specific candidate cis-regulatory elements in cortical cells. Through machine learning, we develop sequence-based predictors of candidate cis-regulatory elements in different species and demonstrate that the genomic regulatory syntax is highly preserved from rodents to primates. Finally, we show that epigenetic conservation combined with sequence similarity helps to uncover functional cis-regulatory elements and enhances our ability to interpret genetic variants contributing to neurological disease and traits.


Subject(s)
Conserved Sequence , Evolution, Molecular , Gene Expression Regulation , Gene Regulatory Networks , Mammals , Neocortex , Animals , Humans , Mice , Callithrix/genetics , Chromatin/genetics , Chromatin/metabolism , Conserved Sequence/genetics , DNA Methylation , DNA Transposable Elements/genetics , Epigenome , Gene Expression Regulation/genetics , Macaca/genetics , Mammals/genetics , Motor Cortex/cytology , Motor Cortex/metabolism , Multiomics , Neocortex/cytology , Neocortex/metabolism , Regulatory Sequences, Nucleic Acid/genetics , Single-Cell Analysis , Transcription Factors/metabolism , Genetic Variation/genetics
4.
Nature ; 624(7991): 378-389, 2023 Dec.
Article in English | MEDLINE | ID: mdl-38092917

ABSTRACT

Recent advances in single-cell technologies have led to the discovery of thousands of brain cell types; however, our understanding of the gene regulatory programs in these cell types is far from complete1-4. Here we report a comprehensive atlas of candidate cis-regulatory DNA elements (cCREs) in the adult mouse brain, generated by analysing chromatin accessibility in 2.3 million individual brain cells from 117 anatomical dissections. The atlas includes approximately 1 million cCREs and their chromatin accessibility across 1,482 distinct brain cell populations, adding over 446,000 cCREs to the most recent such annotation in the mouse genome. The mouse brain cCREs are moderately conserved in the human brain. The mouse-specific cCREs-specifically, those identified from a subset of cortical excitatory neurons-are strongly enriched for transposable elements, suggesting a potential role for transposable elements in the emergence of new regulatory programs and neuronal diversity. Finally, we infer the gene regulatory networks in over 260 subclasses of mouse brain cells and develop deep-learning models to predict the activities of gene regulatory elements in different brain cell types from the DNA sequence alone. Our results provide a resource for the analysis of cell-type-specific gene regulation programs in both mouse and human brains.


Subject(s)
Brain , Chromatin , Single-Cell Analysis , Animals , Humans , Mice , Brain/cytology , Brain/metabolism , Cerebral Cortex/cytology , Chromatin/chemistry , Chromatin/genetics , Chromatin/metabolism , Deep Learning , DNA Transposable Elements/genetics , Gene Regulatory Networks/genetics , Neurons/metabolism
5.
bioRxiv ; 2023 Sep 15.
Article in English | MEDLINE | ID: mdl-37745443

ABSTRACT

Single-cell omics technologies have ushered in a new era for the study of dynamic gene regulation in complex tissues during development and disease pathogenesis. A major computational challenge in analyzing these datasets is to project the large-scale and high dimensional data into low-dimensional space while retaining the relative relationships between cells in order to decompose the cellular heterogeneity and reconstruct cell-type-specific gene regulatory programs. Conventional dimensionality reduction methods suffer from computational inefficiency, difficulty to capture the full spectrum of cellular heterogeneity, or inability to apply across diverse molecular modalities. Here, we report a fast and nonlinear dimensionality reduction algorithm that not only more accurately captures the heterogeneities of single-cell omics data, but also features runtime and memory usage that is computational efficient and linearly proportional to cell numbers. We implement this algorithm in a Python package named SnapATAC2, and demonstrate its superior performance, remarkable scalability and general adaptability using an array of single-cell omics data types, including single-cell ATAC-seq, single-cell RNA-seq, single-cell Hi-C, and single-cell multiomics datasets.

6.
Cell Genom ; 3(7): 100342, 2023 Jul 12.
Article in English | MEDLINE | ID: mdl-37492103

ABSTRACT

Single-cell sequencing could help to solve the fundamental challenge of linking millions of cell-type-specific enhancers with their target genes. However, this task is confounded by patterns of gene co-expression in much the same way that genetic correlation due to linkage disequilibrium confounds fine-mapping in genome-wide association studies (GWAS). We developed a non-parametric permutation-based procedure to establish stringent statistical criteria to control the risk of false-positive associations in enhancer-gene association studies (EGAS). We applied our procedure to large-scale transcriptome and epigenome data from multiple tissues and species, including the mouse and human brain, to predict enhancer-gene associations genome wide. We tested the functional validity of our predictions by comparing them with chromatin conformation data and causal enhancer perturbation experiments. Our study shows how controlling for gene co-expression enables robust enhancer-gene linkage using single-cell sequencing data.

7.
bioRxiv ; 2023 Apr 08.
Article in English | MEDLINE | ID: mdl-37066152

ABSTRACT

Sequence divergence of cis- regulatory elements drives species-specific traits, but how this manifests in the evolution of the neocortex at the molecular and cellular level remains to be elucidated. We investigated the gene regulatory programs in the primary motor cortex of human, macaque, marmoset, and mouse with single-cell multiomics assays, generating gene expression, chromatin accessibility, DNA methylome, and chromosomal conformation profiles from a total of over 180,000 cells. For each modality, we determined species-specific, divergent, and conserved gene expression and epigenetic features at multiple levels. We find that cell type-specific gene expression evolves more rapidly than broadly expressed genes and that epigenetic status at distal candidate cis -regulatory elements (cCREs) evolves faster than promoters. Strikingly, transposable elements (TEs) contribute to nearly 80% of the human-specific cCREs in cortical cells. Through machine learning, we develop sequence-based predictors of cCREs in different species and demonstrate that the genomic regulatory syntax is highly preserved from rodents to primates. Lastly, we show that epigenetic conservation combined with sequence similarity helps uncover functional cis -regulatory elements and enhances our ability to interpret genetic variants contributing to neurological disease and traits.

8.
Cell Genom ; 2(3)2022 Mar 09.
Article in English | MEDLINE | ID: mdl-35419551

ABSTRACT

Single-cell technologies measure unique cellular signatures but are typically limited to a single modality. Computational approaches allow the fusion of diverse single-cell data types, but their efficacy is difficult to validate in the absence of authentic multi-omic measurements. To comprehensively assess the molecular phenotypes of single cells, we devised single-nucleus methylcytosine, chromatin accessibility, and transcriptome sequencing (snmCAT-seq) and applied it to postmortem human frontal cortex tissue. We developed a cross-validation approach using multi-modal information to validate fine-grained cell types and assessed the effectiveness of computational data fusion methods. Correlation analysis in individual cells revealed distinct relations between methylation and gene expression. Our integrative approach enabled joint analyses of the methylome, transcriptome, chromatin accessibility, and conformation for 63 human cortical cell types. We reconstructed regulatory lineages for cortical cell populations and found specific enrichment of genetic risk for neuropsychiatric traits, enabling the prediction of cell types that are associated with diseases.

9.
Neuron ; 109(1): 11-26, 2021 01 06.
Article in English | MEDLINE | ID: mdl-33412093

ABSTRACT

Single-cell sequencing technologies, including transcriptomic and epigenomic assays, are transforming our understanding of the cellular building blocks of neural circuits. By directly measuring multiple molecular signatures in thousands to millions of individual cells, single-cell sequencing methods can comprehensively characterize the diversity of brain cell types. These measurements uncover gene regulatory mechanisms that shape cellular identity and provide insight into developmental and evolutionary relationships between brain cell populations. Single-cell sequencing data can aid the design of tools for targeted functional studies of brain circuit components, linking molecular signatures with anatomy, connectivity, morphology, and physiology. Here, we discuss the fundamental principles of single-cell transcriptome and epigenome sequencing, integrative computational analysis of the data, and key applications in neuroscience.


Subject(s)
Brain/metabolism , Epigenome/physiology , Single-Cell Analysis/methods , Transcriptome/physiology , DNA Methylation/physiology , Epigenomics/methods , Humans , Sequence Analysis, DNA/methods , Sequence Analysis, RNA/methods
SELECTION OF CITATIONS
SEARCH DETAIL
...