Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 10 de 10
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
bioRxiv ; 2024 Jan 07.
Artigo em Inglês | MEDLINE | ID: mdl-38260359

RESUMO

Direct nanopore-based RNA sequencing can be used to detect post-transcriptional base modifications, such as m6A methylation, based on the electric current signals produced by the distinct chemical structures of modified bases. A key challenge is the scarcity of adequate training data with known methylation modifications. We present Xron, a hybrid encoder-decoder framework that delivers a direct methylation-distinguishing basecaller by training on synthetic RNA data and immunoprecipitation-based experimental data in two steps. First, we generate data with more diverse modification combinations through in silico cross-linking. Second, we use this dataset to train an end-to-end neural network basecaller followed by fine-tuning on immunoprecipitation-based experimental data with label-smoothing. The trained neural network basecaller outperforms existing methylation detection methods on both read-level and site-level prediction scores. Xron is a standalone, end-to-end m6A-distinguishing basecaller capable of detecting methylated bases directly from raw sequencing signals, enabling de novo methylome assembly.

2.
Nat Genet ; 54(7): 1037-1050, 2022 07.
Artigo em Inglês | MEDLINE | ID: mdl-35789323

RESUMO

Zebrafish, a popular organism for studying embryonic development and for modeling human diseases, has so far lacked a systematic functional annotation program akin to those in other animal models. To address this, we formed the international DANIO-CODE consortium and created a central repository to store and process zebrafish developmental functional genomic data. Our data coordination center ( https://danio-code.zfin.org ) combines a total of 1,802 sets of unpublished and re-analyzed published genomic data, which we used to improve existing annotations and show its utility in experimental design. We identified over 140,000 cis-regulatory elements throughout development, including classes with distinct features dependent on their activity in time and space. We delineated the distinct distance topology and chromatin features between regulatory elements active during zygotic genome activation and those active during organogenesis. Finally, we matched regulatory elements and epigenomic landscapes between zebrafish and mouse and predicted functional relationships between them beyond sequence similarity, thus extending the utility of zebrafish developmental genomics to mammals.


Assuntos
Bases de Dados Genéticas , Regulação da Expressão Gênica no Desenvolvimento , Genoma , Genômica , Sequências Reguladoras de Ácido Nucleico , Proteínas de Peixe-Zebra , Peixe-Zebra , Animais , Cromatina/genética , Genoma/genética , Humanos , Camundongos , Anotação de Sequência Molecular , Organogênese/genética , Sequências Reguladoras de Ácido Nucleico/genética , Peixe-Zebra/embriologia , Peixe-Zebra/genética , Proteínas de Peixe-Zebra/genética
3.
Gigascience ; 9(3)2020 03 01.
Artigo em Inglês | MEDLINE | ID: mdl-32170312

RESUMO

BACKGROUND: Over the past few years the variety of experimental designs and protocols for sequencing experiments increased greatly. To ensure the wide usability of the produced data beyond an individual project, rich and systematic annotation of the underlying experiments is crucial. FINDINGS: We first developed an annotation structure that captures the overall experimental design as well as the relevant details of the steps from the biological sample to the library preparation, the sequencing procedure, and the sequencing and processed files. Through various design features, such as controlled vocabularies and different field requirements, we ensured a high annotation quality, comparability, and ease of annotation. The structure can be easily adapted to a large variety of species. We then implemented the annotation strategy in a user-hosted web platform with data import, query, and export functionality. CONCLUSIONS: We present here an annotation structure and user-hosted platform for sequencing experiment data, suitable for lab-internal documentation, collaborations, and large-scale annotation efforts.


Assuntos
Anotação de Sequência Molecular/métodos , Análise de Sequência/métodos , Software , Anotação de Sequência Molecular/normas , Análise de Sequência/normas
4.
Mol Ecol ; 27(4): 886-897, 2018 02.
Artigo em Inglês | MEDLINE | ID: mdl-28746735

RESUMO

Natural habitats are exposed to an increasing number of environmental stressors that cause important ecological consequences. However, the multifarious nature of environmental change, the strength and the relative timing of each stressor largely limit our understanding of biological responses to environmental change. In particular, early response to unpredictable environmental change, critical to survival and fitness in later life stages, is largely uncharacterized. Here, we characterize the early transcriptional response of the keystone species Daphnia magna to twelve environmental perturbations, including biotic and abiotic stressors. We first perform a differential expression analysis aimed at identifying differential regulation of individual genes in response to stress. This preliminary analysis revealed that a few individual genes were responsive to environmental perturbations and they were modulated in a stressor and genotype-specific manner. Given the limited number of differentially regulated genes, we were unable to identify pathways involved in stress response. Hence, to gain a better understanding of the genetic and functional foundation of tolerance to multiple environmental stressors, we leveraged the correlative nature of networks and performed a weighted gene co-expression network analysis. We discovered that approximately one-third of the Daphnia genes, enriched for metabolism, cell signalling and general stress response, drives transcriptional early response to environmental stress and it is shared among genetic backgrounds. This initial response is followed by a genotype- and/or condition-specific transcriptional response with a strong genotype-by-environment interaction. Intriguingly, genotype- and condition-specific transcriptional response is found in genes not conserved beyond crustaceans, suggesting niche-specific adaptation.


Assuntos
Daphnia/genética , Redes Reguladoras de Genes , Transcrição Gênica , Animais , Sequência Conservada , Regulação da Expressão Gênica , Genoma , Genótipo , Família Multigênica
5.
Aging (Albany NY) ; 9(10): 2026-2051, 2017 10 09.
Artigo em Inglês | MEDLINE | ID: mdl-29016359

RESUMO

Luminal epithelial cells in the breast gradually alter gene and protein expression with age, appearing to lose lineage-specificity by acquiring myoepithelial-like characteristics. We hypothesize that the luminal lineage is particularly sensitive to microenvironment changes, and age-related microenvironment changes cause altered luminal cell phenotypes. To evaluate the effects of different microenvironments on the fidelity of epigenetically regulated luminal and myoepithelial gene expression, we generated a set of lineage-specific probes for genes that are controlled through DNA methylation. Culturing primary luminal cells under conditions that favor myoepithelial propogation led to their reprogramming at the level of gene methylation, and to a more myoepithelial-like expression profile. Primary luminal cells' lineage-specific gene expression could be maintained when they were cultured as bilayers with primary myoepithelial cells. Isogenic stromal fibroblast co-cultures were unable to maintain the luminal phenotype. Mixed-age luminal-myoepithelial bilayers revealed that luminal cells adopt transcription and methylation patterns consistent with the chronological age of the myoepithelial cells. We provide evidence that the luminal epithelial phenotype is exquisitely sensitive to microenvironment conditions, and that states of aging are cell non-autonomously communicated through microenvironment cues over at least one cell diameter.


Assuntos
Envelhecimento , Mama/citologia , Microambiente Celular , Células Epiteliais/citologia , Linhagem da Célula , Técnicas de Cocultura , Feminino , Humanos , Fenótipo , Transcriptoma
6.
G3 (Bethesda) ; 6(3): 683-94, 2016 Jan 15.
Artigo em Inglês | MEDLINE | ID: mdl-26772746

RESUMO

Steroid hormones induce cascades of gene activation and repression with transformative effects on cell fate . Steroid transduction plays a major role in the development and physiology of nearly all metazoan species, and in the progression of the most common forms of cancer. Despite the paramount importance of steroids in developmental and translational biology, a complete map of transcriptional response has not been developed for any hormone . In the case of 20-hydroxyecdysone (ecdysone) in Drosophila melanogaster, these trajectories range from apoptosis to immortalization. We mapped the ecdysone transduction network in a cohort of 41 cell lines, the largest such atlas yet assembled. We found that the early transcriptional response mirrors the distinctiveness of physiological origins: genes respond in restricted patterns, conditional on the expression levels of dozens of transcription factors. Only a small cohort of genes is constitutively modulated independent of initial cell state. Ecdysone-responsive genes tend to organize into directional same-stranded units, with consecutive genes induced from the same strand. Here, we identify half of the ecdysone receptor heterodimer as the primary rate-limiting step in the response, and find that initial receptor isoform levels modulate the activated cohort of target transcription factors. This atlas of steroid response reveals organizing principles of gene regulation by a model type II nuclear receptor and lays the foundation for comprehensive and predictive understanding of the ecdysone transduction network in the fruit fly.


Assuntos
Drosophila/genética , Drosophila/metabolismo , Regulação da Expressão Gênica , Hormônios/metabolismo , Transdução de Sinais , Animais , Linhagem Celular , Análise por Conglomerados , Ecdisona/metabolismo , Ecdisona/farmacologia , Perfilação da Expressão Gênica , Regulação da Expressão Gênica/efeitos dos fármacos , Hormônios/farmacologia , Isoformas de Proteínas , Receptores de Esteroides/metabolismo , Esteroides/metabolismo , Fatores de Transcrição/metabolismo , Transcrição Gênica/efeitos dos fármacos , Transcriptoma
7.
Genome Res ; 25(11): 1692-702, 2015 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-26294687

RESUMO

In eukaryotic cells, RNAs exist as ribonucleoprotein particles (RNPs). Despite the importance of these complexes in many biological processes, including splicing, polyadenylation, stability, transportation, localization, and translation, their compositions are largely unknown. We affinity-purified 20 distinct RNA-binding proteins (RBPs) from cultured Drosophila melanogaster cells under native conditions and identified both the RNA and protein compositions of these RNP complexes. We identified "high occupancy target" (HOT) RNAs that interact with the majority of the RBPs we surveyed. HOT RNAs encode components of the nonsense-mediated decay and splicing machinery, as well as RNA-binding and translation initiation proteins. The RNP complexes contain proteins and mRNAs involved in RNA binding and post-transcriptional regulation. Genes with the capacity to produce hundreds of mRNA isoforms, ultracomplex genes, interact extensively with heterogeneous nuclear ribonuclear proteins (hnRNPs). Our data are consistent with a model in which subsets of RNPs include mRNA and protein products from the same gene, indicating the widespread existence of auto-regulatory RNPs. From the simultaneous acquisition and integrative analysis of protein and RNA constituents of RNPs, we identify extensive cross-regulatory and hierarchical interactions in post-transcriptional control.


Assuntos
Proteínas de Drosophila/metabolismo , Drosophila melanogaster/genética , Regulação da Expressão Gênica , Proteínas de Ligação a RNA/metabolismo , Animais , Proteínas de Drosophila/genética , Ribonucleoproteínas Nucleares Heterogêneas/genética , Ribonucleoproteínas Nucleares Heterogêneas/metabolismo , Splicing de RNA/genética , RNA Mensageiro/genética , RNA Mensageiro/metabolismo , Proteínas de Ligação a RNA/genética , Análise de Sequência de RNA , Transfecção
8.
Nat Commun ; 6: 5897, 2015 Jan 29.
Artigo em Inglês | MEDLINE | ID: mdl-25631608

RESUMO

Fasting glucose and insulin are intermediate traits for type 2 diabetes. Here we explore the role of coding variation on these traits by analysis of variants on the HumanExome BeadChip in 60,564 non-diabetic individuals and in 16,491 T2D cases and 81,877 controls. We identify a novel association of a low-frequency nonsynonymous SNV in GLP1R (A316T; rs10305492; MAF=1.4%) with lower FG (ß=-0.09±0.01 mmol l(-1), P=3.4 × 10(-12)), T2D risk (OR[95%CI]=0.86[0.76-0.96], P=0.010), early insulin secretion (ß=-0.07±0.035 pmolinsulin mmolglucose(-1), P=0.048), but higher 2-h glucose (ß=0.16±0.05 mmol l(-1), P=4.3 × 10(-4)). We identify a gene-based association with FG at G6PC2 (pSKAT=6.8 × 10(-6)) driven by four rare protein-coding SNVs (H177Y, Y207S, R283X and S324P). We identify rs651007 (MAF=20%) in the first intron of ABO at the putative promoter of an antisense lncRNA, associating with higher FG (ß=0.02±0.004 mmol l(-1), P=1.3 × 10(-8)). Our approach identifies novel coding variant associations and extends the allelic spectrum of variation underlying diabetes-related quantitative traits and T2D susceptibility.


Assuntos
Glicemia/metabolismo , Diabetes Mellitus Tipo 2/genética , Exoma/genética , Jejum/sangue , Predisposição Genética para Doença , Variação Genética , Taxa de Mutação , Análise de Sequência com Séries de Oligonucleotídeos , População Negra/genética , Diabetes Mellitus Tipo 2/sangue , Estudos de Associação Genética , Loci Gênicos , Receptor do Peptídeo Semelhante ao Glucagon 1/genética , Glucose-6-Fosfatase/genética , Humanos , Insulina/sangue , Polimorfismo de Nucleotídeo Único/genética , População Branca/genética
9.
Nat Biotechnol ; 32(4): 341-6, 2014 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-24633242

RESUMO

The identification of full length transcripts entirely from short-read RNA sequencing data (RNA-seq) remains a challenge in the annotation of genomes. Here we describe an automated pipeline for genome annotation that integrates RNA-seq and gene-boundary data sets, which we call Generalized RNA Integration Tool, or GRIT. Applying GRIT to Drosophila melanogaster short-read RNA-seq, cap analysis of gene expression (CAGE) and poly(A)-site-seq data collected for the modENCODE project, we recovered the vast majority of previously annotated transcripts and doubled the total number of transcripts cataloged. We found that 20% of protein coding genes encode multiple protein-localization signals and that, in 20-d-old adult fly heads, genes with multiple polyadenylation sites are more common than genes with alternative splicing or alternative promoters. GRIT demonstrates 30% higher precision and recall than the most widely used transcript assembly tools. GRIT will facilitate the automated generation of high-quality genome annotations without the need for extensive manual annotation.


Assuntos
Mapeamento Cromossômico/métodos , Genômica/métodos , Anotação de Sequência Molecular/métodos , RNA/química , RNA/genética , Análise de Sequência de RNA/métodos , Animais , Drosophila melanogaster/genética , Genoma de Inseto/genética , RNA/análise
10.
Nature ; 512(7515): 393-9, 2014 Aug 28.
Artigo em Inglês | MEDLINE | ID: mdl-24670639

RESUMO

Animal transcriptomes are dynamic, with each cell type, tissue and organ system expressing an ensemble of transcript isoforms that give rise to substantial diversity. Here we have identified new genes, transcripts and proteins using poly(A)+ RNA sequencing from Drosophila melanogaster in cultured cell lines, dissected organ systems and under environmental perturbations. We found that a small set of mostly neural-specific genes has the potential to encode thousands of transcripts each through extensive alternative promoter usage and RNA splicing. The magnitudes of splicing changes are larger between tissues than between developmental stages, and most sex-specific splicing is gonad-specific. Gonads express hundreds of previously unknown coding and long non-coding RNAs (lncRNAs), some of which are antisense to protein-coding genes and produce short regulatory RNAs. Furthermore, previously identified pervasive intergenic transcription occurs primarily within newly identified introns. The fly transcriptome is substantially more complex than previously recognized, with this complexity arising from combinatorial usage of promoters, splice sites and polyadenylation sites.


Assuntos
Drosophila melanogaster/genética , Perfilação da Expressão Gênica , Transcriptoma/genética , Processamento Alternativo/genética , Animais , Drosophila melanogaster/anatomia & histologia , Drosophila melanogaster/citologia , Feminino , Masculino , Anotação de Sequência Molecular , Tecido Nervoso/metabolismo , Especificidade de Órgãos , Poli A/genética , Poliadenilação , Regiões Promotoras Genéticas/genética , RNA Longo não Codificante/genética , RNA Mensageiro/genética , RNA Mensageiro/metabolismo , Caracteres Sexuais , Estresse Fisiológico/genética
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...