Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 51
Filtrar
1.
Nat Commun ; 11(1): 1172, 2020 03 03.
Artigo em Inglês | MEDLINE | ID: mdl-32127543

RESUMO

von Economo neurons (VENs) are bipolar, spindle-shaped neurons restricted to layer 5 of human frontoinsula and anterior cingulate cortex that appear to be selectively vulnerable to neuropsychiatric and neurodegenerative diseases, although little is known about other VEN cellular phenotypes. Single nucleus RNA-sequencing of frontoinsula layer 5 identifies a transcriptomically-defined cell cluster that contained VENs, but also fork cells and a subset of pyramidal neurons. Cross-species alignment of this cell cluster with a well-annotated mouse classification shows strong homology to extratelencephalic (ET) excitatory neurons that project to subcerebral targets. This cluster also shows strong homology to a putative ET cluster in human temporal cortex, but with a strikingly specific regional signature. Together these results suggest that VENs are a regionally distinctive type of ET neuron. Additionally, we describe the first patch clamp recordings of VENs from neurosurgically-resected tissue that show distinctive intrinsic membrane properties relative to neighboring pyramidal neurons.


Assuntos
Neurônios/fisiologia , Lobo Temporal/citologia , Transcriptoma , Animais , Encéfalo/citologia , Encéfalo/fisiologia , Eletrofisiologia/métodos , Perfilação da Expressão Gênica , Humanos , Hibridização in Situ Fluorescente , Camundongos , Neurônios/citologia , Células Piramidais/fisiologia , Telencéfalo/citologia , Lobo Temporal/fisiologia
2.
Blood ; 134(1): 30-43, 2019 07 04.
Artigo em Inglês | MEDLINE | ID: mdl-31023703

RESUMO

The era of targeted therapies has seen significant improvements in depth of response, progression-free survival, and overall survival for patients with multiple myeloma. Despite these improvements in clinical outcome, patients inevitably relapse and require further treatment. Drug-resistant dormant myeloma cells that reside in specific niches within the skeleton are considered a basis of disease relapse but remain elusive and difficult to study. Here, we developed a method to sequence the transcriptome of individual dormant myeloma cells from the bones of tumor-bearing mice. Our analyses show that dormant myeloma cells express a distinct transcriptome signature enriched for immune genes and, unexpectedly, genes associated with myeloid cell differentiation. These genes were switched on by coculture with osteoblastic cells. Targeting AXL, a gene highly expressed by dormant cells, using small-molecule inhibitors released cells from dormancy and promoted their proliferation. Analysis of the expression of AXL and coregulated genes in human cohorts showed that healthy human controls and patients with monoclonal gammopathy of uncertain significance expressed higher levels of the dormancy signature genes than patients with multiple myeloma. Furthermore, in patients with multiple myeloma, the expression of this myeloid transcriptome signature translated into a twofold increase in overall survival, indicating that this dormancy signature may be a marker of disease progression. Thus, engagement of myeloma cells with the osteoblastic niche induces expression of a suite of myeloid genes that predicts disease progression and that comprises potential drug targets to eradicate dormant myeloma cells.


Assuntos
Mieloma Múltiplo/genética , Mieloma Múltiplo/patologia , Recidiva Local de Neoplasia/genética , Células-Tronco Neoplásicas/patologia , Nicho de Células-Tronco/genética , Animais , Humanos , Camundongos , Recidiva Local de Neoplasia/patologia , Proteínas Proto-Oncogênicas/genética , Receptores Proteína Tirosina Quinases/genética , Transcriptoma , Receptor Tirosina Quinase Axl
3.
PLoS One ; 13(12): e0209648, 2018.
Artigo em Inglês | MEDLINE | ID: mdl-30586455

RESUMO

Transcriptomic profiling of complex tissues by single-nucleus RNA-sequencing (snRNA-seq) affords some advantages over single-cell RNA-sequencing (scRNA-seq). snRNA-seq provides less biased cellular coverage, does not appear to suffer cell isolation-based transcriptional artifacts, and can be applied to archived frozen specimens. We used well-matched snRNA-seq and scRNA-seq datasets from mouse visual cortex to compare cell type detection. Although more transcripts are detected in individual whole cells (~11,000 genes) than nuclei (~7,000 genes), we demonstrate that closely related neuronal cell types can be similarly discriminated with both methods if intronic sequences are included in snRNA-seq analysis. We estimate that the nuclear proportion of total cellular mRNA varies from 20% to over 50% for large and small pyramidal neurons, respectively. Together, these results illustrate the high information content of nuclear RNA for characterization of cellular diversity in brain tissues.


Assuntos
Núcleo Celular/genética , Análise de Célula Única , Transcriptoma/genética , Córtex Visual/metabolismo , Animais , Linhagem da Célula/genética , Linhagem da Célula/fisiologia , Perfilação da Expressão Gênica/métodos , Camundongos , Neurônios/metabolismo , Análise de Sequência de RNA/métodos , Córtex Visual/fisiologia
4.
Nat Neurosci ; 21(9): 1185-1195, 2018 09.
Artigo em Inglês | MEDLINE | ID: mdl-30150662

RESUMO

We describe convergent evidence from transcriptomics, morphology, and physiology for a specialized GABAergic neuron subtype in human cortex. Using unbiased single-nucleus RNA sequencing, we identify ten GABAergic interneuron subtypes with combinatorial gene signatures in human cortical layer 1 and characterize a group of human interneurons with anatomical features never described in rodents, having large 'rosehip'-like axonal boutons and compact arborization. These rosehip cells show an immunohistochemical profile (GAD1+CCK+, CNR1-SST-CALB2-PVALB-) matching a single transcriptomically defined cell type whose specific molecular marker signature is not seen in mouse cortex. Rosehip cells in layer 1 make homotypic gap junctions, predominantly target apical dendritic shafts of layer 3 pyramidal neurons, and inhibit backpropagating pyramidal action potentials in microdomains of the dendritic tuft. These cells are therefore positioned for potent local control of distal dendritic computation in cortical pyramidal neurons.


Assuntos
Córtex Cerebral/metabolismo , Córtex Cerebral/ultraestrutura , Neurônios GABAérgicos/metabolismo , Neurônios GABAérgicos/ultraestrutura , Transcriptoma , Adulto , Idoso , Axônios/ultraestrutura , Espinhas Dendríticas/metabolismo , Espinhas Dendríticas/ultraestrutura , Junções Comunicantes/metabolismo , Junções Comunicantes/ultraestrutura , Biblioteca Gênica , Humanos , Masculino , Reação em Cadeia da Polimerase , Terminações Pré-Sinápticas/metabolismo , Terminações Pré-Sinápticas/ultraestrutura , Células Piramidais/metabolismo , Células Piramidais/ultraestrutura , RNA/análise , RNA/genética , Análise de Sequência de RNA
5.
Nat Neurosci ; 21(7): 1016, 2018 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-29703932

RESUMO

In the version of this article initially published, NIH grant U01 MH106882 to F.H.G. was missing from the Acknowledgments. The error has been corrected in the HTML and PDF versions of the article.

6.
Hum Mol Genet ; 27(R1): R40-R47, 2018 05 01.
Artigo em Inglês | MEDLINE | ID: mdl-29590361

RESUMO

Cells are fundamental function units of multicellular organisms, with different cell types playing distinct physiological roles in the body. The recent advent of single-cell transcriptional profiling using RNA sequencing is producing 'big data', enabling the identification of novel human cell types at an unprecedented rate. In this review, we summarize recent work characterizing cell types in the human central nervous and immune systems using single-cell and single-nuclei RNA sequencing, and discuss the implications that these discoveries are having on the representation of cell types in the reference Cell Ontology (CL). We propose a method, based on random forest machine learning, for identifying sets of necessary and sufficient marker genes, which can be used to assemble consistent and reproducible cell type definitions for incorporation into the CL. The representation of defined cell type classes and their relationships in the CL using this strategy will make the cell type classes being identified by high-throughput/high-content technologies findable, accessible, interoperable and reusable (FAIR), allowing the CL to serve as a reference knowledgebase of information about the role that distinct cellular phenotypes play in human health and disease.


Assuntos
Big Data , Perfilação da Expressão Gênica/tendências , Análise de Sequência de RNA/tendências , Análise de Célula Única/tendências , Linhagem da Célula/genética , Humanos , Transcriptoma/genética
7.
Sci Rep ; 7(1): 12268, 2017 09 25.
Artigo em Inglês | MEDLINE | ID: mdl-28947747

RESUMO

Chest pain is a leading reason patients seek medical evaluation. While assays to detect myocyte death are used to diagnose a heart attack (acute myocardial infarction, AMI), there is no biomarker to indicate an impending cardiac event. Transcriptional patterns present in circulating endothelial cells (CEC) may provide a window into the plaque rupture process and identify a proximal biomarker for AMI. Thus, we aimed to identify a transcriptomic signature of AMI present in whole blood, but derived from CECs. Candidate genes indicative of AMI were nominated from microarray of enriched CEC samples, and then verified for detectability and predictive potential via qPCR in whole blood. This signature was validated in an independent cohort. Our findings suggest that a whole blood CEC-derived molecular signature identifies patients with AMI and sets the framework to potentially identify the earlier stages of an impending cardiac event when used in concert with clinical history and other diagnostics where conventional biomarkers indicative of myonecrosis remain undetected.


Assuntos
Biomarcadores/sangue , Células Endoteliais/patologia , Perfilação da Expressão Gênica , Infarto do Miocárdio/patologia , Adolescente , Adulto , Idoso , Idoso de 80 Anos ou mais , Feminino , Humanos , Masculino , Análise em Microsséries , Pessoa de Meia-Idade , Reação em Cadeia da Polimerase em Tempo Real , Adulto Jovem
9.
BMC Genomics ; 18(1): 458, 2017 06 12.
Artigo em Inglês | MEDLINE | ID: mdl-28606096

RESUMO

BACKGROUND: Cancer research to date has largely focused on somatically acquired genetic aberrations. In contrast, the degree to which germline, or inherited, variation contributes to tumorigenesis remains unclear, possibly due to a lack of accessible germline variant data. Here we called germline variants on 9618 cases from The Cancer Genome Atlas (TCGA) database representing 31 cancer types. RESULTS: We identified batch effects affecting loss of function (LOF) variant calls that can be traced back to differences in the way the sequence data were generated both within and across cancer types. Overall, LOF indel calls were more sensitive to technical artifacts than LOF Single Nucleotide Variant (SNV) calls. In particular, whole genome amplification of DNA prior to sequencing led to an artificially increased burden of LOF indel calls, which confounded association analyses relating germline variants to tumor type despite stringent indel filtering strategies. The samples affected by these technical artifacts include all acute myeloid leukemia and practically all ovarian cancer samples. CONCLUSIONS: We demonstrate how technical artifacts induced by whole genome amplification of DNA can lead to false positive germline-tumor type associations and suggest TCGA whole genome amplified samples be used with caution. This study draws attention to the need to be sensitive to problems associated with a lack of uniformity in data generation in TCGA data.


Assuntos
Artefatos , Bases de Dados Genéticas , Genômica , Mutação em Linhagem Germinativa , Neoplasias/genética , Genoma Humano/genética , Humanos , Mutação com Perda de Função
10.
Environ Microbiol ; 19(7): 2769-2784, 2017 07.
Artigo em Inglês | MEDLINE | ID: mdl-28474498

RESUMO

Candidate phyla (CP) are broad phylogenetic clusters of organisms that lack cultured representatives. Included in this fraction is the candidate Parcubacteria superphylum. Specific characteristics that have been ascribed to the Parcubacteria include reduced genome size, limited metabolic potential and exclusive reliance on fermentation for energy acquisition. The study of new environmental niches, such as the marine versus terrestrial subsurface, often expands the understanding of the genetic potential of taxonomic groups. For this reason, we analyzed 12 Parcubacteria single amplified genomes (SAGs) from sediment samples collected within the Challenger Deep of the Mariana Trench, obtained during the Deepsea Challenge (DSC) Expedition. Many of these SAGs are closely related to environmental sequences obtained from deep-sea environments based on 16S rRNA gene similarity and BLAST matches to predicted proteins. DSC SAGs encode features not previously identified in Parcubacteria obtained from other habitats. These include adaptation to oxidative stress, polysaccharide modification and genes associated with respiratory nitrate reduction. The DSC SAGs are also distinguished by relative greater abundance of genes for nucleotide and amino acid biosynthesis, repair of alkylated DNA and the synthesis of mechanosensitive ion channels. These results present an expanded view of the Parcubacteria, among members residing in an ultra-deep hadal environment.


Assuntos
Bactérias/genética , Genoma Bacteriano/genética , Sedimentos Geológicos/microbiologia , Análise de Célula Única/métodos , Aminoácidos/biossíntese , Bactérias/metabolismo , Reparo do DNA/genética , Ecossistema , Meio Ambiente , Tamanho do Genoma/genética , Nitrato Redutases/genética , Nitratos/metabolismo , Oceanos e Mares , Filogenia , Polissacarídeos/metabolismo , RNA Ribossômico 16S/genética
11.
BMC Bioinformatics ; 18(Suppl 17): 559, 2017 12 21.
Artigo em Inglês | MEDLINE | ID: mdl-29322913

RESUMO

BACKGROUND: A fundamental characteristic of multicellular organisms is the specialization of functional cell types through the process of differentiation. These specialized cell types not only characterize the normal functioning of different organs and tissues, they can also be used as cellular biomarkers of a variety of different disease states and therapeutic/vaccine responses. In order to serve as a reference for cell type representation, the Cell Ontology has been developed to provide a standard nomenclature of defined cell types for comparative analysis and biomarker discovery. Historically, these cell types have been defined based on unique cellular shapes and structures, anatomic locations, and marker protein expression. However, we are now experiencing a revolution in cellular characterization resulting from the application of new high-throughput, high-content cytometry and sequencing technologies. The resulting explosion in the number of distinct cell types being identified is challenging the current paradigm for cell type definition in the Cell Ontology. RESULTS: In this paper, we provide examples of state-of-the-art cellular biomarker characterization using high-content cytometry and single cell RNA sequencing, and present strategies for standardized cell type representations based on the data outputs from these cutting-edge technologies, including "context annotations" in the form of standardized experiment metadata about the specimen source analyzed and marker genes that serve as the most useful features in machine learning-based cell type classification models. We also propose a statistical strategy for comparing new experiment data to these standardized cell type representations. CONCLUSION: The advent of high-throughput/high-content single cell technologies is leading to an explosion in the number of distinct cell types being identified. It will be critical for the bioinformatics community to develop and adopt data standard conventions that will be compatible with these new technologies and support the data representation needs of the research community. The proposals enumerated here will serve as a useful starting point to address these challenges.


Assuntos
Ontologias Biológicas , Biomarcadores/metabolismo , Células/classificação , Células/metabolismo , Biologia Computacional/métodos , Análise de Sequência de RNA/métodos , Análise de Célula Única/métodos , Humanos
12.
Pac Symp Biocomput ; 22: 564-575, 2017.
Artigo em Inglês | MEDLINE | ID: mdl-27897007

RESUMO

Next generation sequencing of the RNA content of single cells or single nuclei (sc/nRNA-seq) has become a powerful approach to understand the cellular complexity and diversity of multicellular organisms and environmental ecosystems. However, the fact that the procedure begins with a relatively small amount of starting material, thereby pushing the limits of the laboratory procedures required, dictates that careful approaches for sample quality control (QC) are essential to reduce the impact of technical noise and sample bias in downstream analysis applications. Here we present a preliminary framework for sample level quality control that is based on the collection of a series of quantitative laboratory and data metrics that are used as features for the construction of QC classification models using random forest machine learning approaches. We've applied this initial framework to a dataset comprised of 2272 single nuclei RNA-seq results and determined that ~79% of samples were of high quality. Removal of the poor quality samples from downstream analysis was found to improve the cell type clustering results. In addition, this approach identified quantitative features related to the proportion of unique or duplicate reads and the proportion of reads remaining after quality trimming as useful features for pass/fail classification. The construction and use of classification models for the identification of poor quality samples provides for an objective and scalable approach to sc/nRNA-seq quality control.


Assuntos
Sequenciamento de Nucleotídeos em Larga Escala/estatística & dados numéricos , Neocórtex/citologia , Neocórtex/metabolismo , RNA Nuclear/genética , Análise de Sequência de RNA/estatística & dados numéricos , Autopsia , Viés , Núcleo Celular/genética , Biologia Computacional , Bases de Dados de Ácidos Nucleicos , Árvores de Decisões , Sequenciamento de Nucleotídeos em Larga Escala/normas , Humanos , Aprendizado de Máquina , Controle de Qualidade , Análise de Sequência de RNA/normas , Análise de Célula Única , Software
13.
Nat Neurosci ; 19(12): 1583-1591, 2016 12.
Artigo em Inglês | MEDLINE | ID: mdl-27618310

RESUMO

The healthy human brain is a mosaic of varied genomes. Long interspersed element-1 (LINE-1 or L1) retrotransposition is known to create mosaicism by inserting L1 sequences into new locations of somatic cell genomes. Using a machine learning-based, single-cell sequencing approach, we discovered that somatic L1-associated variants (SLAVs) are composed of two classes: L1 retrotransposition insertions and retrotransposition-independent L1-associated variants. We demonstrate that a subset of SLAVs comprises somatic deletions generated by L1 endonuclease cutting activity. Retrotransposition-independent rearrangements in inherited L1s resulted in the deletion of proximal genomic regions. These rearrangements were resolved by microhomology-mediated repair, which suggests that L1-associated genomic regions are hotspots for somatic copy number variants in the brain and therefore a heritable genetic contributor to somatic mosaicism. We demonstrate that SLAVs are present in crucial neural genes, such as DLG2 (also called PSD93), and affect 44-63% of cells of the cells in the healthy brain.


Assuntos
Encéfalo/metabolismo , Elementos Nucleotídeos Longos e Dispersos/genética , Neurônios/metabolismo , Células Cultivadas , Dosagem de Genes , Estudo de Associação Genômica Ampla/métodos , Genômica/métodos , Humanos , Deleção de Sequência
14.
Nat Protoc ; 11(3): 499-524, 2016 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-26890679

RESUMO

A protocol is described for sequencing the transcriptome of a cell nucleus. Nuclei are isolated from specimens and sorted by FACS, cDNA libraries are constructed and RNA-seq is performed, followed by data analysis. Some steps follow published methods (Smart-seq2 for cDNA synthesis and Nextera XT barcoded library preparation) and are not described in detail here. Previous single-cell approaches for RNA-seq from tissues include cell dissociation using protease treatment at 30 °C, which is known to alter the transcriptome. We isolate nuclei at 4 °C from tissue homogenates, which cause minimal damage. Nuclear transcriptomes can be obtained from postmortem human brain tissue stored at -80 °C, making brain archives accessible for RNA-seq from individual neurons. The method also allows investigation of biological features unique to nuclei, such as enrichment of certain transcripts and precursors of some noncoding RNAs. By following this procedure, it takes about 4 d to construct cDNA libraries that are ready for sequencing.


Assuntos
Encéfalo/citologia , Nucléolo Celular/genética , Neurônios/metabolismo , RNA/genética , Análise de Sequência de RNA/métodos , Transcriptoma , Autopsia/métodos , Fracionamento Celular/métodos , DNA Complementar/genética , Perfilação da Expressão Gênica/métodos , Biblioteca Gênica , Humanos , Análise de Célula Única/métodos , Preservação de Tecido
15.
Appl Environ Microbiol ; 81(24): 8265-76, 2015 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-26386059

RESUMO

Hadal ecosystems are found at a depth of 6,000 m below sea level and below, occupying less than 1% of the total area of the ocean. The microbial communities and metabolic potential in these ecosystems are largely uncharacterized. Here, we present four single amplified genomes (SAGs) obtained from 8,219 m below the sea surface within the hadal ecosystem of the Puerto Rico Trench (PRT). These SAGs are derived from members of deep-sea clades, including the Thaumarchaeota and SAR11 clade, and two are related to previously isolated piezophilic (high-pressure-adapted) microorganisms. In order to identify genes that might play a role in adaptation to deep-sea environments, comparative analyses were performed with genomes from closely related shallow-water microbes. The archaeal SAG possesses genes associated with mixotrophy, including lipoylation and the glycine cleavage pathway. The SAR11 SAG encodes glycolytic enzymes previously reported to be missing from this abundant and cosmopolitan group. The other SAGs, which are related to piezophilic isolates, possess genes that may supplement energy demands through the oxidation of hydrogen or the reduction of nitrous oxide. We found evidence for potential trench-specific gene distributions, as several SAG genes were observed only in a PRT metagenome and not in shallower deep-sea metagenomes. These results illustrate new ecotype features that might perform important roles in the adaptation of microorganisms to life in hadal environments.


Assuntos
Archaea/classificação , Archaea/genética , Genoma Arqueal/genética , Metagenoma/genética , Água do Mar/microbiologia , Aclimatação , Archaea/isolamento & purificação , Sequência de Bases , DNA Arqueal/genética , Ecossistema , Metabolismo Energético/fisiologia , Ácidos Graxos/metabolismo , Lipídeos/biossíntese , Dados de Sequência Molecular , Oceanos e Mares , Porto Rico , RNA Ribossômico 16S/genética , Análise de Sequência de DNA , Enxofre/metabolismo , Microbiologia da Água
16.
Proc Natl Acad Sci U S A ; 112(31): E4236-45, 2015 Aug 04.
Artigo em Inglês | MEDLINE | ID: mdl-26199412

RESUMO

Dioxygenases of the TET (Ten-Eleven Translocation) family produce oxidized methylcytosines, intermediates in DNA demethylation, as well as new epigenetic marks. Here we show data suggesting that TET proteins maintain the consistency of gene transcription. Embryos lacking Tet1 and Tet3 (Tet1/3 DKO) displayed a strong loss of 5-hydroxymethylcytosine (5hmC) and a concurrent increase in 5-methylcytosine (5mC) at the eight-cell stage. Single cells from eight-cell embryos and individual embryonic day 3.5 blastocysts showed unexpectedly variable gene expression compared with controls, and this variability correlated in blastocysts with variably increased 5mC/5hmC in gene bodies and repetitive elements. Despite the variability, genes encoding regulators of cholesterol biosynthesis were reproducibly down-regulated in Tet1/3 DKO blastocysts, resulting in a characteristic phenotype of holoprosencephaly in the few embryos that survived to later stages. Thus, TET enzymes and DNA cytosine modifications could directly or indirectly modulate transcriptional noise, resulting in the selective susceptibility of certain intracellular pathways to regulation by TET proteins.


Assuntos
5-Metilcitosina/metabolismo , Proteínas de Ligação a DNA/metabolismo , Desenvolvimento Embrionário/genética , Deleção de Genes , Regulação da Expressão Gênica no Desenvolvimento , Proteínas Proto-Oncogênicas/metabolismo , Transcriptoma/genética , Animais , Biomarcadores/metabolismo , Blastocisto/metabolismo , Blastômeros/metabolismo , Blastômeros/patologia , Linhagem da Célula , Colesterol/biossíntese , DNA/metabolismo , Proteínas de Ligação a DNA/deficiência , Proteínas de Ligação a DNA/genética , Dioxigenases , Regulação para Baixo/genética , Perda do Embrião/metabolismo , Perda do Embrião/patologia , Embrião de Mamíferos/patologia , Impressão Genômica , Proteínas Hedgehog/metabolismo , Camundongos Endogâmicos C57BL , Camundongos Knockout , Proteínas Proto-Oncogênicas/deficiência , Proteínas Proto-Oncogênicas/genética , Sequências Repetitivas de Ácido Nucleico/genética , Análise de Sequência de RNA , Transdução de Sinais/genética
17.
BMC Bioinformatics ; 15: 357, 2014 Nov 19.
Artigo em Inglês | MEDLINE | ID: mdl-25407910

RESUMO

BACKGROUND: Deep shotgun sequencing on next generation sequencing (NGS) platforms has contributed significant amounts of data to enrich our understanding of genomes, transcriptomes, amplified single-cell genomes, and metagenomes. However, deep coverage variations in short-read data sets and high sequencing error rates of modern sequencers present new computational challenges in data interpretation, including mapping and de novo assembly. New lab techniques such as multiple displacement amplification (MDA) of single cells and sequence independent single primer amplification (SISPA) allow for sequencing of organisms that cannot be cultured, but generate highly variable coverage due to amplification biases. RESULTS: Here we introduce NeatFreq, a software tool that reduces a data set to more uniform coverage by clustering and selecting from reads binned by their median kmer frequency (RMKF) and uniqueness. Previous algorithms normalize read coverage based on RMKF, but do not include methods for the preferred selection of (1) extremely low coverage regions produced by extremely variable sequencing of random-primed products and (2) 2-sided paired-end sequences. The algorithm increases the incorporation of the most unique, lowest coverage, segments of a genome using an error-corrected data set. NeatFreq was applied to bacterial, viral plaque, and single-cell sequencing data. The algorithm showed an increase in the rate at which the most unique reads in a genome were included in the assembled consensus while also reducing the count of duplicative and erroneous contigs (strings of high confidence overlaps) in the deliverable consensus. The results obtained from conventional Overlap-Layout-Consensus (OLC) were compared to simulated multi-de Bruijn graph assembly alternatives trained for variable coverage input using sequence before and after normalization of coverage. Coverage reduction was shown to increase processing speed and reduce memory requirements when using conventional bacterial assembly algorithms. CONCLUSIONS: The normalization of deep coverage spikes, which would otherwise inhibit consensus resolution, enables High Throughput Sequencing (HTS) assembly projects to consistently run to completion with existing assembly software. The NeatFreq software package is free, open source and available at https://github.com/bioh4x/NeatFreq .


Assuntos
Sequenciamento de Nucleotídeos em Larga Escala/métodos , Análise de Sequência de DNA/métodos , Software , Algoritmos , Genômica
18.
Nat Rev Genet ; 15(9): 577-84, 2014 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-25091868

RESUMO

The vast majority of microbial species remain uncultivated and, until recently, about half of all known bacterial phyla were identified only from their 16S ribosomal RNA gene sequence. With the advent of single-cell sequencing, genomes of uncultivated species are rapidly filling in unsequenced branches of the microbial phylogenetic tree. The wealth of new insights gained from these previously inaccessible groups is providing a deeper understanding of their basic biology, taxonomy and evolution, as well as their diverse roles in environmental ecosystems and human health.


Assuntos
Bactérias/genética , Genoma Bacteriano/genética , Genômica , Microbiota , Análise de Sequência de DNA/métodos , Bactérias/classificação , DNA Bacteriano/química , DNA Bacteriano/genética , DNA Ribossômico/química , DNA Ribossômico/genética , Evolução Molecular , Variação Genética , Filogenia , RNA Ribossômico 16S/genética
19.
Genome Med ; 6(11): 108, 2014.
Artigo em Inglês | MEDLINE | ID: mdl-25621014

RESUMO

Genomic sequencing from single cells is a powerful tool in microbiology and holds great promise for infectious disease research. Vast numbers of uncultivable species and pathogens that persist at low abundance in environmental reservoirs are now accessible for genomic analysis.

20.
Science ; 342(6158): 632-7, 2013 Nov 01.
Artigo em Inglês | MEDLINE | ID: mdl-24179226

RESUMO

We used single-cell genomic approaches to map DNA copy number variation (CNV) in neurons obtained from human induced pluripotent stem cell (hiPSC) lines and postmortem human brains. We identified aneuploid neurons, as well as numerous subchromosomal CNVs in euploid neurons. Neurotypic hiPSC-derived neurons had larger CNVs than fibroblasts, and several large deletions were found in hiPSC-derived neurons but not in matched neural progenitor cells. Single-cell sequencing of endogenous human frontal cortex neurons revealed that 13 to 41% of neurons have at least one megabase-scale de novo CNV, that deletions are twice as common as duplications, and that a subset of neurons have highly aberrant genomes marked by multiple alterations. Our results show that mosaic CNV is abundant in human neurons.


Assuntos
Variações do Número de Cópias de DNA , Lobo Frontal/citologia , Mosaicismo , Células-Tronco Neurais/citologia , Neurônios/citologia , Aneuploidia , Humanos , Células-Tronco Pluripotentes Induzidas/citologia , Masculino , Neurogênese , Análise de Sequência de DNA , Deleção de Sequência , Análise de Célula Única
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...