Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 15 de 15
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
Nat Biotechnol ; 40(7): 1082-1092, 2022 07.
Artigo em Inglês | MEDLINE | ID: mdl-35256815

RESUMO

Single-nuclei RNA sequencing characterizes cell types at the gene level. However, compared to single-cell approaches, many single-nuclei cDNAs are purely intronic, lack barcodes and hinder the study of isoforms. Here we present single-nuclei isoform RNA sequencing (SnISOr-Seq). Using microfluidics, PCR-based artifact removal, target enrichment and long-read sequencing, SnISOr-Seq increased barcoded, exon-spanning long reads 7.5-fold compared to naive long-read single-nuclei sequencing. We applied SnISOr-Seq to adult human frontal cortex and found that exons associated with autism exhibit coordinated and highly cell-type-specific inclusion. We found two distinct combination patterns: those distinguishing neural cell types, enriched in TSS-exon, exon-polyadenylation-site and non-adjacent exon pairs, and those with multiple configurations within one cell type, enriched in adjacent exon pairs. Finally, we observed that human-specific exons are almost as tightly coordinated as conserved exons, implying that coordination can be rapidly established during evolution. SnISOr-Seq enables cell-type-specific long-read isoform analysis in human brain and in any frozen or hard-to-dissociate sample.


Assuntos
Encéfalo , RNA , Processamento Alternativo/genética , Encéfalo/metabolismo , Éxons/genética , Humanos , Isoformas de Proteínas/genética , RNA/genética , Análise de Sequência de RNA
2.
Nat Commun ; 12(1): 463, 2021 01 19.
Artigo em Inglês | MEDLINE | ID: mdl-33469025

RESUMO

Splicing varies across brain regions, but the single-cell resolution of regional variation is unclear. We present a single-cell investigation of differential isoform expression (DIE) between brain regions using single-cell long-read sequencing in mouse hippocampus and prefrontal cortex in 45 cell types at postnatal day 7 ( www.isoformAtlas.com ). Isoform tests for DIE show better performance than exon tests. We detect hundreds of DIE events traceable to cell types, often corresponding to functionally distinct protein isoforms. Mostly, one cell type is responsible for brain-region specific DIE. However, for fewer genes, multiple cell types influence DIE. Thus, regional identity can, although rarely, override cell-type specificity. Cell types indigenous to one anatomic structure display distinctive DIE, e.g. the choroid plexus epithelium manifests distinct transcription-start-site usage. Spatial transcriptomics and long-read sequencing yield a spatially resolved splicing map. Our methods quantify isoform expression with cell-type and spatial resolution and it contributes to further our understanding of how the brain integrates molecular and cellular complexity.


Assuntos
Processamento Alternativo/fisiologia , Regulação da Expressão Gênica no Desenvolvimento/fisiologia , Hipocampo/metabolismo , Córtex Pré-Frontal/metabolismo , Isoformas de Proteínas/metabolismo , Animais , Animais Recém-Nascidos , Biologia Computacional , Feminino , Hipocampo/citologia , Hipocampo/crescimento & desenvolvimento , Camundongos , Modelos Animais , Córtex Pré-Frontal/citologia , Córtex Pré-Frontal/crescimento & desenvolvimento , Isoformas de Proteínas/análise , Isoformas de Proteínas/genética , Análise de Célula Única/métodos , Análise Espacial
3.
Front Genet ; 10: 709, 2019.
Artigo em Inglês | MEDLINE | ID: mdl-31475029

RESUMO

The advent of second-generation sequencing and its application to RNA sequencing have revolutionized the field of genomics by allowing quantification of gene expression, as well as the definition of transcription start/end sites, exons, splice sites and RNA editing sites. However, due to the sequencing of fragments of cDNAs, these methods have not given a reliable picture of complete RNA isoforms. Third-generation sequencing has filled this gap and allows end-to-end sequencing of entire RNA/cDNA molecules. This approach to transcriptomics has been a "niche" technology for a couple of years but now is becoming mainstream with many different applications. Here, we review the background and progress made to date in this rapidly growing field. We start by reviewing the progressive realization that alternative splicing is omnipresent. We then focus on long-noncoding RNA isoforms and the distinct combination patterns of exons in noncoding and coding genes. We consider the implications of the recent technologies of direct RNA sequencing and single-cell isoform RNA sequencing. Finally, we discuss the parameters that define the success of long-read RNA sequencing experiments and strategies commonly used to make the most of such data.

4.
Nat Protoc ; 14(7): 2119-2151, 2019 07.
Artigo em Inglês | MEDLINE | ID: mdl-31217595

RESUMO

Next-generation sequencing (NGS) has been widely adopted to identify genetic variants and investigate their association with disease. However, the analysis of sequencing data remains challenging because of the complexity of human genetic variation and confounding errors introduced during library preparation, sequencing and analysis. We have developed a set of synthetic DNA spike-ins-termed 'sequins' (sequencing spike-ins)-that are directly added to DNA samples before library preparation. Sequins can be used to measure technical biases and to act as internal quantitative and qualitative controls throughout the sequencing workflow. This step-by-step protocol explains the use of sequins for both whole-genome and targeted sequencing of the human genome. This includes instructions regarding the dilution and addition of sequins to human DNA samples, followed by the bioinformatic steps required to separate sequin- and sample-derived sequencing reads and to evaluate the diagnostic performance of the assay. These practical guidelines are accompanied by a broader discussion of the conceptual and statistical principles that underpin the design of sequin standards. This protocol is suitable for users with standard laboratory and bioinformatic experience. The laboratory steps require ~1-4 d and the bioinformatic steps (which can be performed with the provided example data files) take an additional day.


Assuntos
DNA/síntese química , Genoma Humano , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Calibragem , Biologia Computacional/métodos , DNA/genética , Sequenciamento de Nucleotídeos em Larga Escala/normas , Humanos , Células K562 , Células MCF-7 , Neoplasias/genética , Proteínas Proto-Oncogênicas B-raf/genética
5.
Front Genet ; 10: 309, 2019.
Artigo em Inglês | MEDLINE | ID: mdl-31031799

RESUMO

The human brain is one of the last frontiers of biomedical research. Genome-wide association studies (GWAS) have succeeded in identifying thousands of haplotype blocks associated with a range of neuropsychiatric traits, including disorders such as schizophrenia, Alzheimer's and Parkinson's disease. However, the majority of single nucleotide polymorphisms (SNPs) that mark these haplotype blocks fall within non-coding regions of the genome, hindering their functional validation. While some of these GWAS loci may contain cis-acting regulatory DNA elements such as enhancers, we hypothesized that many are also transcribed into non-coding RNAs that are missing from publicly available transcriptome annotations. Here, we use targeted RNA capture ('RNA CaptureSeq') in combination with nanopore long-read cDNA sequencing to transcriptionally profile 1,023 haplotype blocks across the genome containing non-coding GWAS SNPs associated with neuropsychiatric traits, using post-mortem human brain tissue from three neurologically healthy donors. We find that the majority (62%) of targeted haplotype blocks, including 13% of intergenic blocks, are transcribed into novel, multi-exonic RNAs, most of which are not yet recorded in GENCODE annotations. We validated our findings with short-read RNA-seq, providing orthogonal confirmation of novel splice junctions and enabling a quantitative assessment of the long-read assemblies. Many novel transcripts are supported by independent evidence of transcription including cap analysis of gene expression (CAGE) data and epigenetic marks, and some show signs of potential functional roles. We present these transcriptomes as a preliminary atlas of non-coding transcription in human brain that can be used to connect neurological phenotypes with gene expression.

6.
Nat Commun ; 9(1): 3096, 2018 08 06.
Artigo em Inglês | MEDLINE | ID: mdl-30082706

RESUMO

The complexity of microbial communities, combined with technical biases in next-generation sequencing, pose a challenge to metagenomic analysis. Here, we develop a set of internal DNA standards, termed "sequins" (sequencing spike-ins), that together constitute a synthetic community of artificial microbial genomes. Sequins are added to environmental DNA samples prior to library preparation, and undergo concurrent sequencing with the accompanying sample. We validate the performance of sequins by comparison to mock microbial communities, and demonstrate their use in the analysis of real metagenome samples. We show how sequins can be used to measure fold change differences in the size and structure of accompanying microbial communities, and perform quantitative normalization between samples. We further illustrate how sequins can be used to benchmark and optimize new methods, including nanopore long-read sequencing technology. We provide metagenome sequins, along with associated data sets, protocols, and an accompanying software toolkit, as reference standards to aid in metagenomic studies.


Assuntos
Metagenoma , Metagenômica , Análise de Sequência de DNA , DNA Bacteriano/análise , Biblioteca Gênica , Genoma Bacteriano , Sequenciamento de Nucleotídeos em Larga Escala , Modelos Biológicos , Nanoporos , Filogenia , Padrões de Referência , Reprodutibilidade dos Testes , Software
7.
Nat Rev Genet ; 18(8): 473-484, 2017 08.
Artigo em Inglês | MEDLINE | ID: mdl-28626224

RESUMO

Next-generation sequencing (NGS) provides a broad investigation of the genome, and it is being readily applied for the diagnosis of disease-associated genetic features. However, the interpretation of NGS data remains challenging owing to the size and complexity of the genome and the technical errors that are introduced during sample preparation, sequencing and analysis. These errors can be understood and mitigated through the use of reference standards - well-characterized genetic materials or synthetic spike-in controls that help to calibrate NGS measurements and to evaluate diagnostic performance. The informed use of reference standards, and associated statistical principles, ensures rigorous analysis of NGS data and is essential for its future clinical use.


Assuntos
Sequenciamento de Nucleotídeos em Larga Escala/normas , Análise de Sequência de DNA/normas , Animais , Humanos , Padrões de Referência
8.
Trends Genet ; 33(7): 464-478, 2017 07.
Artigo em Inglês | MEDLINE | ID: mdl-28535931

RESUMO

The combination of pervasive transcription and prolific alternative splicing produces a mammalian transcriptome of great breadth and diversity. The majority of transcribed genomic bases are intronic, antisense, or intergenic to protein-coding genes, yielding a plethora of short and long non-protein-coding regulatory RNAs. Long noncoding RNAs (lncRNAs) share most aspects of their biogenesis, processing, and regulation with mRNAs. However, lncRNAs are typically expressed in more restricted patterns, frequently from enhancers, and exhibit almost universal alternative splicing. These features are consistent with their role as modular epigenetic regulators. We describe here the key studies and technological advances that have shaped our understanding of the dimensions, dynamics, and biological relevance of the mammalian noncoding transcriptome.


Assuntos
RNA não Traduzido/genética , Transcriptoma , Processamento Alternativo , Animais , Éxons , Humanos
9.
Bioinformatics ; 33(11): 1723-1724, 2017 Jun 01.
Artigo em Inglês | MEDLINE | ID: mdl-28130232

RESUMO

SUMMARY: Spike-in controls are synthetic nucleic-acid sequences that are added to a user's sample and constitute internal standards for subsequent steps in the next generation sequencing workflow. AVAILABILITY AND IMPLEMENTATION: : The software is implemented in C ++/R and is freely available under BSD license. The source code is available from github.com/student-t/Anaquin , binaries and user manual from www.sequin.xyz/software and R package from bioconductor.org/packages/Anaquin. CONTACT: anaquin@garvan.org.au or t.mercer@garvan.org.au. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Sequenciamento de Nucleotídeos em Larga Escala/métodos , Análise de Sequência de DNA/métodos , Análise de Sequência de RNA/métodos , Software , Humanos
10.
Nat Methods ; 13(9): 784-91, 2016 09.
Artigo em Inglês | MEDLINE | ID: mdl-27502217

RESUMO

The identification of genetic variation with next-generation sequencing is confounded by the complexity of the human genome sequence and by biases that arise during library preparation, sequencing and analysis. We have developed a set of synthetic DNA standards, termed 'sequins', that emulate human genetic features and constitute qualitative and quantitative spike-in controls for genome sequencing. Sequencing reads derived from sequins align exclusively to an artificial in silico reference chromosome, rather than the human reference genome, which allows them them to be partitioned for parallel analysis. Here we use this approach to represent common and clinically relevant genetic variation, ranging from single nucleotide variants to large structural rearrangements and copy-number variation. We validate the design and performance of sequin standards by comparison to examples in the NA12878 reference genome, and we demonstrate their utility during the detection and quantification of variants. We provide sequins as a standardized, quantitative resource against which human genetic variation can be measured and diagnostic performance assessed.


Assuntos
Variações do Número de Cópias de DNA , DNA/genética , Genoma Humano , Genômica/métodos , Polimorfismo de Nucleotídeo Único , Análise de Sequência de DNA/métodos , Cromossomos Artificiais/química , Cromossomos Artificiais/genética , DNA/síntese química , DNA/química , Humanos , Padrões de Referência , Análise de Sequência de DNA/normas
11.
Nat Methods ; 13(9): 792-8, 2016 09.
Artigo em Inglês | MEDLINE | ID: mdl-27502218

RESUMO

RNA sequencing (RNA-seq) can be used to assemble spliced isoforms, quantify expressed genes and provide a global profile of the transcriptome. However, the size and diversity of the transcriptome, the wide dynamic range in gene expression and inherent technical biases confound RNA-seq analysis. We have developed a set of spike-in RNA standards, termed 'sequins' (sequencing spike-ins), that represent full-length spliced mRNA isoforms. Sequins have an entirely artificial sequence with no homology to natural reference genomes, but they align to gene loci encoded on an artificial in silico chromosome. The combination of multiple sequins across a range of concentrations emulates alternative splicing and differential gene expression, and it provides scaling factors for normalization between samples. We demonstrate the use of sequins in RNA-seq experiments to measure sample-specific biases and determine the limits of reliable transcript assembly and quantification in accompanying human RNA samples. In addition, we have designed a complementary set of sequins that represent fusion genes arising from rearrangements of the in silico chromosome to aid in cancer diagnosis. RNA sequins provide a qualitative and quantitative reference with which to navigate the complexity of the human transcriptome.


Assuntos
Perfilação da Expressão Gênica/normas , Genes Sintéticos , Splicing de RNA , RNA Mensageiro/genética , Análise de Sequência de RNA/normas , Cromossomos Artificiais , Humanos , Controle de Qualidade , Splicing de RNA/genética , RNA Mensageiro/síntese química , RNA Mensageiro/química , Padrões de Referência , Análise de Sequência de RNA/métodos
12.
ISME J ; 3(2): 209-15, 2009 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-18923456

RESUMO

DNA sequencing, phylogenetic and mapping studies suggest that the class 1 integron found in pathogens arose when one member of the diverse family of environmental class 1 integrons became embedded into a Tn402 transposon. However, the timing of this event and the selective forces that first fixed the newly formed element in a bacterial lineage are still unknown. Biocides have a longer use in clinical practice than antibiotics, and a qac (quaternary ammonium compound) resistance gene, or remnant thereof, is a normal feature of class 1 integrons recovered from clinical isolates. Consequently, it is possible that the initial selective advantage was conferred by resistance to biocides, mediated by qac. Here, we show that diverse qac gene cassettes are a dominant feature of cassette arrays from environmental class 1 integrons, and that they occur in the absence of any antibiotic resistance gene cassettes. They are present in arrays that are dynamic, acquiring and rearranging gene cassettes within the arrays. The abundance of qac gene cassettes makes them a likely candidate for participation in the original insertion into Tn402, and as a source of a readily selectable phenotype. More broadly, the increasing use of qac and other biocides at the present time seems likely to promote the fixation of further novel genetic elements, with unpredictable and potentially adverse consequences for human health and agriculture.


Assuntos
Bactérias/genética , Desinfetantes/farmacologia , Farmacorresistência Bacteriana , Microbiologia Ambiental , Integrons , Compostos de Amônio Quaternário/farmacologia , Bactérias/efeitos dos fármacos , Evolução Molecular , Transferência Genética Horizontal , Humanos
13.
FEMS Microbiol Lett ; 287(1): 56-62, 2008 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-18680525

RESUMO

Class 1 integrons are an important vector for the spread of antibiotic resistance. The core of this genetic element is highly conserved in all class 1 integrons recovered from clinical contexts. Recently, bacteria containing more divergent class 1 integrons have been isolated from environmental samples, suggesting undiscovered diversity in these elements. We performed a culture-independent survey of the class 1 integron-integrase gene (intI1) from environmental DNA, assessing sequence variation using capillary electrophoresis single-strand conformation polymorphism. This analysis allowed informed selection of environments for further investigation based on the diversity of intI1 targets that were present. IntI1 was common in environmental samples and exhibited previously unsuspected sequence diversity. The method allowed discrimination between clinical and environmental variants of intI1.


Assuntos
DNA Bacteriano/genética , Microbiologia Ambiental , Variação Genética , Integrases/genética , Integrons , Bactérias/genética , Dados de Sequência Molecular , Filogenia , Polimorfismo Conformacional de Fita Simples , Análise de Sequência de DNA
14.
FEMS Microbiol Lett ; 278(2): 207-12, 2008 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-18042230

RESUMO

Integrons are bacterial genetic elements capable of capturing and expressing potentially adaptive genetic material. Class 1 integrons constitute the most intensely studied group of these elements to date, mainly due to their well-established role in the acquisition and dissemination of antibiotic resistance genes in clinical environments. However, virtually nothing is known about the distribution or abundance of class 1 integrons outside of the clinical context. Here we develop a SYBR Green-based real-time quantitative PCR assay capable of quantifying the abundance of class 1 integrons in environmental samples. It was shown that the abundance of the intI1 gene in creek sediment correlates with ecological condition, implying that class 1 integrons provide selective advantages relevant to environmental pressures other than the use of antibiotics. By comparing the quantities of intI1 and 16S rRNA gene in each sample, it was demonstrated that approximately 2.7% of cells potentially harbour a class 1 integron. These findings suggest that class 1 integrons are widespread in natural environments removed from clinical settings and occur in a broader range of host organisms than had previously been assumed on the basis of culture-dependent estimates.


Assuntos
Integrons/genética , Reação em Cadeia da Polimerase/métodos , Austrália , DNA/genética , DNA/isolamento & purificação , Geografia , Sedimentos Geológicos/química , RNA Ribossômico 16S/genética
15.
Eur J Hum Genet ; 15(12): 1218-29, 2007 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-17712354

RESUMO

Comprehensive genetic screening programs have led to the identification of pathogenic methyl-CpG-binding protein 2 (MECP2) mutations in up to 95% of classical Rett syndrome (RTT) patients. This high rate of mutation detection can partly be attributed to specialised techniques that have enabled the detection of large deletions in a substantial fraction of otherwise mutation-negative patients. These cases would normally be missed by the routine PCR-based screening strategies. Here, we have identified large multi-exonic deletions in 12/149 apparently mutation-negative RTT patients using multiplex ligation-dependent probe amplification (MLPA). These deletions were subsequently characterised using real-time quantitative PCR (qPCR) and long-range PCR with the ultimate aim of defining the exact nucleotide positions of the breakpoints and rearrangements. We detected an apparent deletion in one further patient using MLPA; however, this finding was contradicted by subsequent qPCR and long-range PCR results. The patient group includes an affected brother and sister with a large MECP2 deletion also present in their carrier mother. The X chromosome inactivation pattern of all female patients in this study was determined, which, coupled with detailed clinical information, allowed meaningful genotype-phenotype correlations to be drawn. This study reaffirms the view that large MECP2 deletions are an important cause of both classical and atypical RTT syndrome, and cautions that apparent deletions detected using high-throughput diagnostic techniques require further characterisation.


Assuntos
Família , Deleção de Genes , Proteína 2 de Ligação a Metil-CpG/genética , Síndrome de Rett/genética , Adolescente , Sequência de Bases , Criança , Pré-Escolar , Mapeamento Cromossômico , Análise Mutacional de DNA , Feminino , Rearranjo Gênico , Genótipo , Humanos , Lactente , Masculino , Dados de Sequência Molecular , Linhagem , Fenótipo , Reação em Cadeia da Polimerase , Inativação do Cromossomo X
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...