Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 36
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
Nat Neurosci ; 27(6): 1051-1063, 2024 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-38594596

RESUMO

RNA isoforms influence cell identity and function. However, a comprehensive brain isoform map was lacking. We analyze single-cell RNA isoforms across brain regions, cell subtypes, developmental time points and species. For 72% of genes, full-length isoform expression varies along one or more axes. Splicing, transcription start and polyadenylation sites vary strongly between cell types, influence protein architecture and associate with disease-linked variation. Additionally, neurotransmitter transport and synapse turnover genes harbor cell-type variability across anatomical regions. Regulation of cell-type-specific splicing is pronounced in the postnatal day 21-to-postnatal day 28 adolescent transition. Developmental isoform regulation is stronger than regional regulation for the same cell type. Cell-type-specific isoform regulation in mice is mostly maintained in the human hippocampus, allowing extrapolation to the human brain. Conversely, the human brain harbors additional cell-type specificity, suggesting gain-of-function isoforms. Together, this detailed single-cell atlas of full-length isoform regulation across development, anatomical regions and species reveals an unappreciated degree of isoform variability across multiple axes.


Assuntos
Encéfalo , Análise de Célula Única , Animais , Humanos , Camundongos , Encéfalo/metabolismo , Encéfalo/crescimento & desenvolvimento , Análise de Célula Única/métodos , Splicing de RNA/genética , Isoformas de RNA/genética , Processamento Alternativo/genética , Masculino , Camundongos Endogâmicos C57BL
2.
bioRxiv ; 2024 Feb 28.
Artigo em Inglês | MEDLINE | ID: mdl-38464236

RESUMO

Multimodal measurements have become widespread in genomics, however measuring open chromatin accessibility and splicing simultaneously in frozen brain tissues remains unconquered. Hence, we devised Single-Cell-ISOform-RNA sequencing coupled with the Assay-for-Transposase-Accessible-Chromatin (ScISOr-ATAC). We utilized ScISOr-ATAC to assess whether chromatin and splicing alterations in the brain convergently affect the same cell types or divergently different ones. We applied ScISOr-ATAC to three major conditions: comparing (i) the Rhesus macaque (Macaca mulatta) prefrontal cortex (PFC) and visual cortex (VIS), (ii) cross species divergence of Rhesus macaque versus human PFC, as well as (iii) dysregulation in Alzheimer's disease in human PFC. We found that among cortical-layer biased excitatory neuron subtypes, splicing is highly brain-region specific for L3-5/L6 IT_RORB neurons, moderately specific in L2-3 IT_CUX2.RORB neurons and unspecific in L2-3 IT_CUX2 neurons. In contrast, at the chromatin level, L2-3 IT_CUX2.RORB neurons show the highest brain-region specificity compared to other subtypes. Likewise, when comparing human and macaque PFC, strong evolutionary divergence on one molecular modality does not necessarily imply strong such divergence on another molecular level in the same cell type. Finally, in Alzheimer's disease, oligodendrocytes show convergently high dysregulation in both chromatin and splicing. However, chromatin and splicing dysregulation most strongly affect distinct oligodendrocyte subtypes. Overall, these results indicate that chromatin and splicing can show convergent or divergent results depending on the performed comparison, justifying the need for their concurrent measurement to investigate complex systems. Taken together, ScISOr-ATAC allows for the characterization of single-cell splicing and chromatin patterns and the comparison of sample groups in frozen brain samples.

3.
Bioinformatics ; 40(2)2024 02 01.
Artigo em Inglês | MEDLINE | ID: mdl-38262343

RESUMO

MOTIVATION: Recent advancements in long-read RNA sequencing have enabled the examination of full-length isoforms, previously uncaptured by short-read sequencing methods. An alternative powerful method for studying isoforms is through the use of barcoded short-read RNA reads, for which a barcode indicates whether two short-reads arise from the same molecule or not. Such techniques included the 10x Genomics linked-read based SParse Isoform Sequencing (SPIso-seq), as well as Loop-Seq, or Tell-Seq. Some applications, such as novel-isoform discovery, require very high coverage. Obtaining high coverage using long reads can be difficult, making barcoded RNA-seq data a valuable alternative for this task. However, most annotation pipelines are not able to work with a set of short reads instead of a single transcript, also not able to work with coverage gaps within a molecule if any. In order to overcome this challenge, we present an RNA-seq assembler that allows the determination of the expressed isoform per barcode. RESULTS: In this article, we present cloudrnaSPAdes, a tool for assembling full-length isoforms from barcoded RNA-seq linked-read data in a reference-free fashion. Evaluating it on simulated and real human data, we found that cloudrnaSPAdes accurately assembles isoforms, even for genes with high isoform diversity. AVAILABILITY AND IMPLEMENTATION: cloudrnaSPAdes is a feature release of a SPAdes assembler and version used for this article is available at https://github.com/1dayac/cloudrnaSPAdes-release.


Assuntos
Genômica , RNA , Humanos , RNA/genética , Análise de Sequência de RNA/métodos , Isoformas de Proteínas/genética , Isoformas de Proteínas/metabolismo , RNA-Seq , Genômica/métodos , Sequenciamento de Nucleotídeos em Larga Escala , Transcriptoma
4.
bioRxiv ; 2023 Jul 27.
Artigo em Inglês | MEDLINE | ID: mdl-37546844

RESUMO

Motivation: Recent advancements in long-read RNA sequencing have enabled the examination of full-length isoforms, previously uncaptured by short-read sequencing methods. An alternative powerful method for studying isoforms is through the use of barcoded short-read RNA reads, for which a barcode indicates whether two short-reads arise from the same molecule or not. Such techniques included the 10x Genomics linked-read based SParse Isoform Sequencing (SPIso-seq), as well as Loop-Seq, or Tell-Seq. Some applications, such as novel-isoform discovery, require very high coverage. Obtaining high coverage using long reads can be difficult, making barcoded RNA-seq data a valuable alternative for this task. However, most annotation pipelines are not able to work with a set of short reads instead of a single transcript, also not able to work with coverage gaps within a molecule if any. In order to overcome this challenge, we present an RNA-seq assembler allowing the determination of the expressed isoform per barcode. Results: In this paper, we present cloudrnaSPAdes, a tool for assembling full-length isoforms from barcoded RNA-seq linked-read data in a reference-free fashion. Evaluating it on simulated and real human data, we found that cloudrnaSPAdes accurately assembles isoforms, even for genes with high isoform diversity. Availability: cloudrnaSPAdes is a feature release of a SPAdes assembler and available at https://cab.spbu.ru/software/cloudrnaspades/.

5.
PLoS Biol ; 21(6): e3002133, 2023 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-37390046

RESUMO

Characterizing cellular diversity at different levels of biological organization and across data modalities is a prerequisite to understanding the function of cell types in the brain. Classification of neurons is also essential to manipulate cell types in controlled ways and to understand their variation and vulnerability in brain disorders. The BRAIN Initiative Cell Census Network (BICCN) is an integrated network of data-generating centers, data archives, and data standards developers, with the goal of systematic multimodal brain cell type profiling and characterization. Emphasis of the BICCN is on the whole mouse brain with demonstration of prototype feasibility for human and nonhuman primate (NHP) brains. Here, we provide a guide to the cellular and spatial approaches employed by the BICCN, and to accessing and using these data and extensive resources, including the BRAIN Cell Data Center (BCDC), which serves to manage and integrate data across the ecosystem. We illustrate the power of the BICCN data ecosystem through vignettes highlighting several BICCN analysis and visualization tools. Finally, we present emerging standards that have been developed or adopted toward Findable, Accessible, Interoperable, and Reusable (FAIR) neuroscience. The combined BICCN ecosystem provides a comprehensive resource for the exploration and analysis of cell types in the brain.


Assuntos
Encéfalo , Neurociências , Animais , Humanos , Camundongos , Ecossistema , Neurônios
6.
Transcription ; 14(3-5): 92-104, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-37314295

RESUMO

The profiling of gene expression patterns to glean biological insights from single cells has become commonplace over the last few years. However, this approach overlooks the transcript contents that can differ between individual cells and cell populations. In this review, we describe early work in the field of single-cell short-read sequencing as well as full-length isoforms from single cells. We then describe recent work in single-cell long-read sequencing wherein some transcript elements have been observed to work in tandem. Based on earlier work in bulk tissue, we motivate the study of combination patterns of other RNA variables. Given that we are still blind to some aspects of isoform biology, we suggest possible future avenues such as CRISPR screens which can further illuminate the function of RNA variables in distinct cell populations.


Assuntos
Perfilação da Expressão Gênica , Transcriptoma , Processamento Alternativo , Isoformas de Proteínas/genética , Isoformas de Proteínas/metabolismo , RNA/genética , Análise de Sequência de RNA , Sequenciamento de Nucleotídeos em Larga Escala
7.
bioRxiv ; 2023 Apr 04.
Artigo em Inglês | MEDLINE | ID: mdl-37066387

RESUMO

RNA isoforms influence cell identity and function. Until recently, technological limitations prevented a genome-wide appraisal of isoform influence on cell identity in various parts of the brain. Using enhanced long-read single-cell isoform sequencing, we comprehensively analyze RNA isoforms in multiple mouse brain regions, cell subtypes, and developmental timepoints from postnatal day 14 (P14) to adult (P56). For 75% of genes, full-length isoform expression varies along one or more axes of phenotypic origin, underscoring the pervasiveness of isoform regulation across multiple scales. As expected, splicing varies strongly between cell types. However, certain gene classes including neurotransmitter release and reuptake as well as synapse turnover, harbor significant variability in the same cell type across anatomical regions, suggesting differences in network activity may influence cell-type identity. Glial brain-region specificity in isoform expression includes strong poly(A)-site regulation, whereas neurons have stronger TSS regulation. Furthermore, developmental patterns of cell-type specific splicing are especially pronounced in the murine adolescent transition from P21 to P28. The same cell type traced across development shows more isoform variability than across adult anatomical regions, indicating a coordinated modulation of functional programs dictating neural development. As most cell-type specific exons in P56 mouse hippocampus behave similarly in newly generated data from human hippocampi, these principles may be extrapolated to human brain. However, human brains have evolved additional cell-type specificity in splicing, suggesting gain-of-function isoforms. Taken together, we present a detailed single-cell atlas of full-length brain isoform regulation across development and anatomical regions, providing a previously unappreciated degree of isoform variability across multiple scales of the brain.

9.
Nat Biotechnol ; 41(7): 915-918, 2023 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-36593406

RESUMO

Annotating newly sequenced genomes and determining alternative isoforms from long-read RNA data are complex and incompletely solved problems. Here we present IsoQuant-a computational tool using intron graphs that accurately reconstructs transcripts both with and without reference genome annotation. For novel transcript discovery, IsoQuant reduces the false-positive rate fivefold and 2.5-fold for Oxford Nanopore reference-based or reference-free mode, respectively. IsoQuant also improves performance for Pacific Biosciences data.


Assuntos
Sequenciamento de Nucleotídeos em Larga Escala , RNA , Isoformas de Proteínas/genética , Análise de Sequência de RNA , Genoma , Análise de Sequência de DNA
10.
Bioinformatics ; 38(13): 3474-3476, 2022 06 27.
Artigo em Inglês | MEDLINE | ID: mdl-35604081

RESUMO

SUMMARY: RNA isoforms contribute to the diverse functionality of the proteins they encode within the cell. Visualizing how isoform expression differs across cell types and brain regions can inform our understanding of disease and gain or loss of functionality caused by alternative splicing with potential negative impacts. However, the extent to which this occurs in specific cell types and brain regions is largely unknown. This is the kind of information that ScisorWiz plots can provide in an informative and easily communicable manner. ScisorWiz affords its user the opportunity to visualize specific genes across any number of cell types, and provides various sorting options for the user to gain different ways to understand their data. ScisorWiz provides a clear picture of differential isoform expression through various clustering methods and highlights features such as alternative exons and single-nucleotide variants. Tools like ScisorWiz are key for interpreting single-cell isoform sequencing data. This tool applies to any single-cell long-read RNA sequencing data in any cell type, tissue or species. AVAILABILITY AND IMPLEMENTATION: Source code is available at http://github.com/ans4013/ScisorWiz. No new data were generated for this publication. Data used to generate figures was sourced from GEO accession token GSE158450 and available on GitHub as example data.


Assuntos
Processamento Alternativo , Software , Isoformas de Proteínas/genética , Isoformas de Proteínas/metabolismo , Isoformas de RNA/metabolismo , Éxons , Análise de Sequência de RNA
11.
Nat Biotechnol ; 40(7): 1082-1092, 2022 07.
Artigo em Inglês | MEDLINE | ID: mdl-35256815

RESUMO

Single-nuclei RNA sequencing characterizes cell types at the gene level. However, compared to single-cell approaches, many single-nuclei cDNAs are purely intronic, lack barcodes and hinder the study of isoforms. Here we present single-nuclei isoform RNA sequencing (SnISOr-Seq). Using microfluidics, PCR-based artifact removal, target enrichment and long-read sequencing, SnISOr-Seq increased barcoded, exon-spanning long reads 7.5-fold compared to naive long-read single-nuclei sequencing. We applied SnISOr-Seq to adult human frontal cortex and found that exons associated with autism exhibit coordinated and highly cell-type-specific inclusion. We found two distinct combination patterns: those distinguishing neural cell types, enriched in TSS-exon, exon-polyadenylation-site and non-adjacent exon pairs, and those with multiple configurations within one cell type, enriched in adjacent exon pairs. Finally, we observed that human-specific exons are almost as tightly coordinated as conserved exons, implying that coordination can be rapidly established during evolution. SnISOr-Seq enables cell-type-specific long-read isoform analysis in human brain and in any frozen or hard-to-dissociate sample.


Assuntos
Encéfalo , RNA , Processamento Alternativo/genética , Encéfalo/metabolismo , Éxons/genética , Humanos , Isoformas de Proteínas/genética , RNA/genética , Análise de Sequência de RNA
12.
Genome Res ; 32(4): 726-737, 2022 04.
Artigo em Inglês | MEDLINE | ID: mdl-35301264

RESUMO

Long-read transcriptomics require understanding error sources inherent to technologies. Current approaches cannot compare methods for an individual RNA molecule. Here, we present a novel platform-comparison method that combines barcoding strategies and long-read sequencing to sequence cDNA copies representing an individual RNA molecule on both Pacific Biosciences (PacBio) and Oxford Nanopore Technologies (ONT). We compare these long-read pairs in terms of sequence content and isoform patterns. Although individual read pairs show high similarity, we find differences in (1) aligned length, (2) transcription start site (TSS), (3) polyadenylation site (poly(A)-site) assignment, and (4) exon-intron structures. Overall, 25% of read pairs disagree on either TSS, poly(A)-site, or splice site. Intron-chain disagreement typically arises from alignment errors of microexons and complicated splice sites. Our single-molecule technology comparison reveals that inconsistencies are often caused by sequencing error-induced inaccurate ONT alignments, especially to downstream GUNNGU donor motifs. However, annotation-disagreeing upstream shifts in NAGNAG acceptors in ONT are often confirmed by PacBio and are thus likely real. In both barcoded and nonbarcoded ONT reads, we find that intron number and proximity of GU/AGs better predict inconsistencies with the annotation than read quality alone. We summarize these findings in an annotation-based algorithm for spliced alignment correction that improves subsequent transcript construction with ONT reads.


Assuntos
Nanoporos , DNA Complementar , Sequenciamento de Nucleotídeos em Larga Escala/métodos , RNA , Análise de Sequência de DNA/métodos , Tecnologia
13.
Sci Rep ; 12(1): 4369, 2022 03 14.
Artigo em Inglês | MEDLINE | ID: mdl-35288582

RESUMO

The zebra finch is one of the most commonly studied songbirds in biology, particularly in genomics, neuroscience and vocal communication. However, this species lacks a robust cell line for molecular biology research and reagent optimization. We generated a cell line, designated CFS414, from zebra finch embryonic fibroblasts using the SV40 large and small T antigens. This cell line demonstrates an improvement over previous songbird cell lines through continuous and density-independent growth, allowing for indefinite culture and monoclonal line derivation. Cytogenetic, genomic, and transcriptomic profiling established the provenance of this cell line and identified the expression of genes relevant to ongoing songbird research. Using this cell line, we disrupted endogenous gene sequences using S.aureus Cas9 and confirmed a stress-dependent localization response of a song system specialized gene, SAP30L. The utility of CFS414 cells enhances the comprehensive molecular potential of the zebra finch and validates cell immortalization strategies in a songbird species.


Assuntos
Tentilhões , Animais , Sistemas CRISPR-Cas , Linhagem Celular , Tentilhões/genética , Genoma , Genômica
14.
Mol Psychiatry ; 27(3): 1416-1434, 2022 03.
Artigo em Inglês | MEDLINE | ID: mdl-34789849

RESUMO

Due to an inability to ethically access developing human brain tissue as well as identify prospective cases, early-arising neurodevelopmental and cell-specific signatures of Schizophrenia (Scz) have remained unknown and thus undefined. To overcome these challenges, we utilized patient-derived induced pluripotent stem cells (iPSCs) to generate 3D cerebral organoids to model neuropathology of Scz during this critical period. We discovered that Scz organoids exhibited ventricular neuropathology resulting in altered progenitor survival and disrupted neurogenesis. This ultimately yielded fewer neurons within developing cortical fields of Scz organoids. Single-cell sequencing revealed that Scz progenitors were specifically depleted of neuronal programming factors leading to a remodeling of cell-lineages, altered differentiation trajectories, and distorted cortical cell-type diversity. While Scz organoids were similar in their macromolecular diversity to organoids generated from healthy controls (Ctrls), four GWAS factors (PTN, COMT, PLCL1, and PODXL) and peptide fragments belonging to the POU-domain transcription factor family (e.g., POU3F2/BRN2) were altered. This revealed that Scz organoids principally differed not in their proteomic diversity, but specifically in their total quantity of disease and neurodevelopmental factors at the molecular level. Single-cell sequencing subsequently identified cell-type specific alterations in neuronal programming factors as well as a developmental switch in neurotrophic growth factor expression, indicating that Scz neuropathology can be encoded on a cell-type-by-cell-type basis. Furthermore, single-cell sequencing also specifically replicated the depletion of BRN2 (POU3F2) and PTN in both Scz progenitors and neurons. Subsequently, in two mechanistic rescue experiments we identified that the transcription factor BRN2 and growth factor PTN operate as mechanistic substrates of neurogenesis and cellular survival, respectively, in Scz organoids. Collectively, our work suggests that multiple mechanisms of Scz exist in patient-derived organoids, and that these disparate mechanisms converge upon primordial brain developmental pathways such as neuronal differentiation, survival, and growth factor support, which may amalgamate to elevate intrinsic risk of Scz.


Assuntos
Células-Tronco Pluripotentes Induzidas , Esquizofrenia , Humanos , Organoides/metabolismo , Proteômica , Esquizofrenia/metabolismo , Fatores de Transcrição/metabolismo
15.
Nat Commun ; 12(1): 463, 2021 01 19.
Artigo em Inglês | MEDLINE | ID: mdl-33469025

RESUMO

Splicing varies across brain regions, but the single-cell resolution of regional variation is unclear. We present a single-cell investigation of differential isoform expression (DIE) between brain regions using single-cell long-read sequencing in mouse hippocampus and prefrontal cortex in 45 cell types at postnatal day 7 ( www.isoformAtlas.com ). Isoform tests for DIE show better performance than exon tests. We detect hundreds of DIE events traceable to cell types, often corresponding to functionally distinct protein isoforms. Mostly, one cell type is responsible for brain-region specific DIE. However, for fewer genes, multiple cell types influence DIE. Thus, regional identity can, although rarely, override cell-type specificity. Cell types indigenous to one anatomic structure display distinctive DIE, e.g. the choroid plexus epithelium manifests distinct transcription-start-site usage. Spatial transcriptomics and long-read sequencing yield a spatially resolved splicing map. Our methods quantify isoform expression with cell-type and spatial resolution and it contributes to further our understanding of how the brain integrates molecular and cellular complexity.


Assuntos
Processamento Alternativo/fisiologia , Regulação da Expressão Gênica no Desenvolvimento/fisiologia , Hipocampo/metabolismo , Córtex Pré-Frontal/metabolismo , Isoformas de Proteínas/metabolismo , Animais , Animais Recém-Nascidos , Biologia Computacional , Feminino , Hipocampo/citologia , Hipocampo/crescimento & desenvolvimento , Camundongos , Modelos Animais , Córtex Pré-Frontal/citologia , Córtex Pré-Frontal/crescimento & desenvolvimento , Isoformas de Proteínas/análise , Isoformas de Proteínas/genética , Análise de Célula Única/métodos , Análise Espacial
16.
Front Genet ; 10: 709, 2019.
Artigo em Inglês | MEDLINE | ID: mdl-31475029

RESUMO

The advent of second-generation sequencing and its application to RNA sequencing have revolutionized the field of genomics by allowing quantification of gene expression, as well as the definition of transcription start/end sites, exons, splice sites and RNA editing sites. However, due to the sequencing of fragments of cDNAs, these methods have not given a reliable picture of complete RNA isoforms. Third-generation sequencing has filled this gap and allows end-to-end sequencing of entire RNA/cDNA molecules. This approach to transcriptomics has been a "niche" technology for a couple of years but now is becoming mainstream with many different applications. Here, we review the background and progress made to date in this rapidly growing field. We start by reviewing the progressive realization that alternative splicing is omnipresent. We then focus on long-noncoding RNA isoforms and the distinct combination patterns of exons in noncoding and coding genes. We consider the implications of the recent technologies of direct RNA sequencing and single-cell isoform RNA sequencing. Finally, we discuss the parameters that define the success of long-read RNA sequencing experiments and strategies commonly used to make the most of such data.

17.
Neuron ; 103(2): 217-234.e4, 2019 07 17.
Artigo em Inglês | MEDLINE | ID: mdl-31171447

RESUMO

Synapses are fundamental information-processing units of the brain, and synaptic dysregulation is central to many brain disorders ("synaptopathies"). However, systematic annotation of synaptic genes and ontology of synaptic processes are currently lacking. We established SynGO, an interactive knowledge base that accumulates available research about synapse biology using Gene Ontology (GO) annotations to novel ontology terms: 87 synaptic locations and 179 synaptic processes. SynGO annotations are exclusively based on published, expert-curated evidence. Using 2,922 annotations for 1,112 genes, we show that synaptic genes are exceptionally well conserved and less tolerant to mutations than other genes. Many SynGO terms are significantly overrepresented among gene variations associated with intelligence, educational attainment, ADHD, autism, and bipolar disorder and among de novo variants associated with neurodevelopmental disorders, including schizophrenia. SynGO is a public, universal reference for synapse research and an online analysis platform for interpretation of large-scale -omics data (https://syngoportal.org and http://geneontology.org).


Assuntos
Encéfalo/citologia , Ontologia Genética , Proteômica , Software , Sinapses/fisiologia , Animais , Encéfalo/fisiologia , Bases de Dados Genéticas , Humanos , Bases de Conhecimento , Potenciais Sinápticos/fisiologia , Sinaptossomos
18.
Genome Biol ; 20(1): 86, 2019 04 30.
Artigo em Inglês | MEDLINE | ID: mdl-31039798

RESUMO

The 19th Annual Advances in Genome Biology and Technology (AGBT) meeting came back to Marco Island, Florida, and was held in the renovated venue from 27 February to 2 March 2019. The meeting showed a variety of new technology, both in wet lab and in bioinformatics. This year's themes included single-cell technology and applications, spatially resolved gene expression measurements, new sequencing platforms, genome assembly and variation, and long and linked reads.


Assuntos
Genômica/tendências , Animais , Humanos , Análise de Sequência de DNA , Análise de Célula Única
19.
Front Genet ; 10: 309, 2019.
Artigo em Inglês | MEDLINE | ID: mdl-31031799

RESUMO

The human brain is one of the last frontiers of biomedical research. Genome-wide association studies (GWAS) have succeeded in identifying thousands of haplotype blocks associated with a range of neuropsychiatric traits, including disorders such as schizophrenia, Alzheimer's and Parkinson's disease. However, the majority of single nucleotide polymorphisms (SNPs) that mark these haplotype blocks fall within non-coding regions of the genome, hindering their functional validation. While some of these GWAS loci may contain cis-acting regulatory DNA elements such as enhancers, we hypothesized that many are also transcribed into non-coding RNAs that are missing from publicly available transcriptome annotations. Here, we use targeted RNA capture ('RNA CaptureSeq') in combination with nanopore long-read cDNA sequencing to transcriptionally profile 1,023 haplotype blocks across the genome containing non-coding GWAS SNPs associated with neuropsychiatric traits, using post-mortem human brain tissue from three neurologically healthy donors. We find that the majority (62%) of targeted haplotype blocks, including 13% of intergenic blocks, are transcribed into novel, multi-exonic RNAs, most of which are not yet recorded in GENCODE annotations. We validated our findings with short-read RNA-seq, providing orthogonal confirmation of novel splice junctions and enabling a quantitative assessment of the long-read assemblies. Many novel transcripts are supported by independent evidence of transcription including cap analysis of gene expression (CAGE) data and epigenetic marks, and some show signs of potential functional roles. We present these transcriptomes as a preliminary atlas of non-coding transcription in human brain that can be used to connect neurological phenotypes with gene expression.

20.
Nat Biotechnol ; 2018 Oct 15.
Artigo em Inglês | MEDLINE | ID: mdl-30320766

RESUMO

Full-length RNA sequencing (RNA-Seq) has been applied to bulk tissue, cell lines and sorted cells to characterize transcriptomes, but applying this technology to single cells has proven to be difficult, with less than ten single-cell transcriptomes having been analyzed thus far. Although single splicing events have been described for ≤200 single cells with statistical confidence, full-length mRNA analyses for hundreds of cells have not been reported. Single-cell short-read 3' sequencing enables the identification of cellular subtypes, but full-length mRNA isoforms for these cell types cannot be profiled. We developed a method that starts with bulk tissue and identifies single-cell types and their full-length RNA isoforms without fluorescence-activated cell sorting. Using single-cell isoform RNA-Seq (ScISOr-Seq), we identified RNA isoforms in neurons, astrocytes, microglia, and cell subtypes such as Purkinje and Granule cells, and cell-type-specific combination patterns of distant splice sites. We used ScISOr-Seq to improve genome annotation in mouse Gencode version 10 by determining the cell-type-specific expression of 18,173 known and 16,872 novel isoforms.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...