Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 66
Filtrar
1.
Nucleic Acids Res ; 52(D1): D154-D163, 2024 Jan 05.
Artigo em Inglês | MEDLINE | ID: mdl-37971293

RESUMO

We present a major update of the HOCOMOCO collection that provides DNA binding specificity patterns of 949 human transcription factors and 720 mouse orthologs. To make this release, we performed motif discovery in peak sets that originated from 14 183 ChIP-Seq experiments and reads from 2554 HT-SELEX experiments yielding more than 400 thousand candidate motifs. The candidate motifs were annotated according to their similarity to known motifs and the hierarchy of DNA-binding domains of the respective transcription factors. Next, the motifs underwent human expert curation to stratify distinct motif subtypes and remove non-informative patterns and common artifacts. Finally, the curated subset of 100 thousand motifs was supplied to the automated benchmarking to select the best-performing motifs for each transcription factor. The resulting HOCOMOCO v12 core collection contains 1443 verified position weight matrices, including distinct subtypes of DNA binding motifs for particular transcription factors. In addition to the core collection, HOCOMOCO v12 provides motif sets optimized for the recognition of binding sites in vivo and in vitro, and for annotation of regulatory sequence variants. HOCOMOCO is available at https://hocomoco12.autosome.org and https://hocomoco.autosome.org.


Assuntos
Bases de Dados Genéticas , Regulação da Expressão Gênica , Domínios e Motivos de Interação entre Proteínas , Fatores de Transcrição , Animais , Humanos , Camundongos , Sítios de Ligação/genética , Motivos de Nucleotídeos , Fatores de Transcrição/genética , Fatores de Transcrição/metabolismo , Internet , Domínios e Motivos de Interação entre Proteínas/genética
3.
Cell Syst ; 14(4): 285-301.e4, 2023 04 19.
Artigo em Inglês | MEDLINE | ID: mdl-37080163

RESUMO

Recent advances in spatial transcriptomics (STs) enable gene expression measurements from a tissue sample while retaining its spatial context. This technology enables unprecedented in situ resolution of the regulatory pathways that underlie the heterogeneity in the tumor as well as the tumor microenvironment (TME). The direct characterization of cellular co-localization with spatial technologies facilities quantification of the molecular changes resulting from direct cell-cell interaction, as it occurs in tumor-immune interactions. We present SpaceMarkers, a bioinformatics algorithm to infer molecular changes from cell-cell interactions from latent space analysis of ST data. We apply this approach to infer the molecular changes from tumor-immune interactions in Visium spatial transcriptomics data of metastasis, invasive and precursor lesions, and immunotherapy treatment. Further transfer learning in matched scRNA-seq data enabled further quantification of the specific cell types in which SpaceMarkers are enriched. Altogether, SpaceMarkers can identify the location and context-specific molecular interactions within the TME from ST data.


Assuntos
Algoritmos , Microambiente Tumoral , Comunicação Celular , Biologia Computacional , Perfilação da Expressão Gênica
4.
Cancers (Basel) ; 14(19)2022 Sep 25.
Artigo em Inglês | MEDLINE | ID: mdl-36230586

RESUMO

Polyunsaturated fatty acid (PUFA) metabolism is currently a focus in cancer research due to PUFAs functioning as structural components of the membrane matrix, as fuel sources for energy production, and as sources of secondary messengers, so called oxylipins, important players of inflammatory processes. Although breast cancer (BC) is the leading cause of cancer death among women worldwide, no systematic study of PUFA metabolism as a system of interrelated processes in this disease has been carried out. Here, we implemented a Boruta-based feature selection algorithm to determine the list of most important PUFA metabolism genes altered in breast cancer tissues compared with in normal tissues. A rank-based Random Forest (RF) model was built on the selected gene list (33 genes) and applied to predict the cancer phenotype to ascertain the PUFA genes involved in cancerogenesis. It showed high-performance of dichotomic classification (balanced accuracy of 0.94, ROC AUC 0.99) We also retrieved a list of the important PUFA genes (46 genes) that differed between molecular subtypes at the level of breast cancer molecular subtypes. The balanced accuracy of the classification model built on the specified genes was 0.82, while the ROC AUC for the sensitivity analysis was 0.85. Specific patterns of PUFA metabolic changes were obtained for each molecular subtype of breast cancer. These results show evidence that (1) PUFA metabolism genes are critical for the pathogenesis of breast cancer; (2) BC subtypes differ in PUFA metabolism genes expression; and (3) the lists of genes selected in the models are enriched with genes involved in the metabolism of signaling lipids.

5.
Front Immunol ; 13: 803229, 2022.
Artigo em Inglês | MEDLINE | ID: mdl-36052064

RESUMO

Background: B lymphocytes play a pivotal regulatory role in the development of the immune response. It was previously shown that deficiency in B regulatory cells (Bregs) or a decrease in their anti-inflammatory activity can lead to immunological dysfunctions. However, the exact mechanisms of Bregs development and functioning are only partially resolved. For instance, only a little is known about the structure of their B cell receptor (BCR) repertoires in autoimmune disorders, including multiple sclerosis (MS), a severe neuroinflammatory disease with a yet unknown etiology. Here, we elucidate specific properties of B regulatory cells in MS. Methods: We performed a prospective study of the transitional Breg (tBreg) subpopulations with the CD19+CD24highCD38high phenotype from MS patients and healthy donors by (i) measuring their content during two diverging courses of relapsing-remitting MS: benign multiple sclerosis (BMS) and highly active multiple sclerosis (HAMS); (ii) analyzing BCR repertoires of circulating B cells by high-throughput sequencing; and (iii) measuring the percentage of CD27+ cells in tBregs. Results: The tBregs from HAMS patients carry the heavy chain with a lower amount of hypermutations than tBregs from healthy donors. The percentage of transitional CD24highCD38high B cells is elevated, whereas the frequency of differentiated CD27+ cells in this transitional B cell subset was decreased in the MS patients as compared with healthy donors. Conclusions: Impaired maturation of regulatory B cells is associated with MS progression.


Assuntos
Linfócitos B Reguladores , Esclerose Múltipla , Humanos , Interleucina-10 , Estudos Prospectivos , Receptores de Antígenos de Linfócitos B
6.
Nat Commun ; 12(1): 2751, 2021 05 12.
Artigo em Inglês | MEDLINE | ID: mdl-33980847

RESUMO

Sequence variants in gene regulatory regions alter gene expression and contribute to phenotypes of individual cells and the whole organism, including disease susceptibility and progression. Single-nucleotide variants in enhancers or promoters may affect gene transcription by altering transcription factor binding sites. Differential transcription factor binding in heterozygous genomic loci provides a natural source of information on such regulatory variants. We present a novel approach to call the allele-specific transcription factor binding events at single-nucleotide variants in ChIP-Seq data, taking into account the joint contribution of aneuploidy and local copy number variation, that is estimated directly from variant calls. We have conducted a meta-analysis of more than 7 thousand ChIP-Seq experiments and assembled the database of allele-specific binding events listing more than half a million entries at nearly 270 thousand single-nucleotide polymorphisms for several hundred human transcription factors and cell types. These polymorphisms are enriched for associations with phenotypes of medical relevance and often overlap eQTLs, making candidates for causality by linking variants with molecular mechanisms. Specifically, there is a special class of switching sites, where different transcription factors preferably bind alternative alleles, thus revealing allele-specific rewiring of molecular circuitry.


Assuntos
Alelos , Genoma Humano , Sequências Reguladoras de Ácido Nucleico/genética , Fatores de Transcrição/metabolismo , Cromatina/metabolismo , Bases de Dados Genéticas , Dosagem de Genes , Regulação da Expressão Gênica/genética , Estudo de Associação Genômica Ampla , Humanos , Motivos de Nucleotídeos , Fenótipo , Polimorfismo de Nucleotídeo Único , Ligação Proteica , Locos de Características Quantitativas
7.
Cancer Res ; 81(4): 1001-1013, 2021 02 15.
Artigo em Inglês | MEDLINE | ID: mdl-33408119

RESUMO

Adenoid cystic carcinoma (ACC) is the second most common malignancy of the salivary gland. Although characterized as an indolent tumor, ACC often leads to incurable metastatic disease. Patients with ACC respond poorly to currently available therapeutic drugs and factors contributing to the limited response remain unknown. Determining the role of molecular alterations frequently occurring in ACC may clarify ACC tumorigenesis and advance the development of effective treatment strategies. Applying Splice Expression Variant Analysis and outlier statistics on RNA sequencing of primary ACC tumors and matched normal salivary gland tissues, we identified multiple alternative splicing events (ASE) of genes specific to ACC. In ACC cells and patient-derived xenografts, FGFR1 was a uniquely expressed ASE. Detailed PCR analysis identified three novel, truncated, intracellular domain-lacking FGFR1 variants (FGFR1v). Cloning and expression analysis suggest that the three FGFR1v are cell surface proteins, that expression of FGFR1v augmented pAKT activity, and that cells became more resistant to pharmacologic FGFR1 inhibitor. FGFR1v-induced AKT activation was associated with AXL function, and inhibition of AXL activity in FGFR1v knockdown cells led to enhanced cytotoxicity in ACC. Moreover, cell killing effect was increased by dual inhibition of AXL and FGFR1 in ACC cells. This study demonstrates that these previously undescribed FGFR1v cooperate with AXL and desensitize cells to FGFR1 inhibitor, which supports further investigation into combined FGFR1 and AXL inhibition as an effective ACC therapy.This study identifies several FGFR1 variants that function through the AXL/AKT signaling pathway independent of FGF/FGFR1, desensitizing cells to FGFR1 inhibitor suggestive of a potential resistance mechanism in ACC. SIGNIFICANCE: This study identifies several FGFR1 variants that function through the AXL/AKT signaling pathway independent of FGF/FGFR1, desensitizing cells to FGFR1 inhibitor, suggestive of a potential resistance mechanism in ACC.


Assuntos
Carcinoma Adenoide Cístico/genética , Receptor Tipo 1 de Fator de Crescimento de Fibroblastos/genética , Receptor Tipo 1 de Fator de Crescimento de Fibroblastos/metabolismo , Neoplasias das Glândulas Salivares/genética , Animais , Carcinoma Adenoide Cístico/metabolismo , Carcinoma Adenoide Cístico/patologia , Linhagem Celular Tumoral , Feminino , Regulação Neoplásica da Expressão Gênica , Humanos , Camundongos , Camundongos Endogâmicos NOD , Camundongos Transgênicos , Isoformas de Proteínas/genética , Isoformas de Proteínas/isolamento & purificação , Isoformas de Proteínas/metabolismo , Proteínas Proto-Oncogênicas/genética , Proteínas Proto-Oncogênicas/metabolismo , Proteínas Proto-Oncogênicas c-akt/genética , Proteínas Proto-Oncogênicas c-akt/metabolismo , Receptor Cross-Talk/fisiologia , Receptores Proteína Tirosina Quinases/genética , Receptores Proteína Tirosina Quinases/metabolismo , Receptor Tipo 1 de Fator de Crescimento de Fibroblastos/isolamento & purificação , Neoplasias das Glândulas Salivares/metabolismo , Neoplasias das Glândulas Salivares/patologia , Glândulas Salivares/metabolismo , Glândulas Salivares/patologia , Transdução de Sinais/genética , Receptor Tirosina Quinase Axl
8.
Elife ; 102021 01 25.
Artigo em Inglês | MEDLINE | ID: mdl-33491650

RESUMO

Determining the etiologic basis of the mutations that are responsible for cancer is one of the fundamental challenges in modern cancer research. Different mutational processes induce different types of DNA mutations, providing 'mutational signatures' that have led to key insights into cancer etiology. The most widely used signatures for assessing genomic data are based on unsupervised patterns that are then retrospectively correlated with certain features of cancer. We show here that supervised machine-learning techniques can identify signatures, called SuperSigs, that are more predictive than those currently available. Surprisingly, we found that aging yields different SuperSigs in different tissues, and the same is true for environmental exposures. We were able to discover SuperSigs associated with obesity, the most important lifestyle factor contributing to cancer in Western populations.


Assuntos
Aprendizado de Máquina , Mutação , Neoplasias/etiologia , Obesidade/genética , Humanos , Neoplasias/genética
9.
F1000Res ; 10: 1260, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-36204675

RESUMO

A Molecular Features Set (MFS), is a result of a vast diversity of bioinformatics pipelines. The lack of a "gold standard" for most experimental data modalities makes it difficult to provide valid estimation for a particular MFS's quality. Yet, this goal can partially be achieved by analyzing inner-sample Distance Matrices (DM) and their power to distinguish between phenotypes. The quality of a DM can be assessed by summarizing its power to quantify the differences of inner-phenotype and outer-phenotype distances. This estimation of the DM quality can be construed as a measure of the MFS's quality.  Here we propose Hobotnica, an approach to estimate MFSs quality by their ability to stratify data, and assign them significance scores, that allow for collating various signatures and comparing their quality for contrasting groups.


Assuntos
Biologia Computacional , Fenótipo
10.
Cell Rep Methods ; 1(6): 100088, 2021 10 25.
Artigo em Inglês | MEDLINE | ID: mdl-35474897

RESUMO

Molecular interactions at identical transcriptomic locations or at proximal but non-overlapping sites can mediate RNA modification and regulation, necessitating tools to uncover these spatial relationships. We present nearBynding, a flexible algorithm and software pipeline that models spatial correlation between transcriptome-wide tracks from diverse data types. nearBynding can process and correlate interval as well as continuous data and incorporate experimentally derived or in silico predicted transcriptomic tracks. nearBynding offers visualization functions for its statistics to identify colocalizations and adjacent features. We demonstrate the application of nearBynding to correlate RNA-binding protein (RBP) binding preferences with other RBPs, RNA structure, or RNA modification. By cross-correlating RBP binding and RNA structure data, we demonstrate that nearBynding recapitulates known RBP binding to structural motifs and provides biological insights into RBP binding preference of G-quadruplexes. nearBynding is available as an R/Bioconductor package and can run on a personal computer, making correlation of transcriptomic features broadly accessible.


Assuntos
Proteínas de Ligação a RNA , Transcriptoma , Transcriptoma/genética , Proteínas de Ligação a RNA/genética , Sítios de Ligação/genética , RNA/genética , Ligação Proteica
11.
Front Cell Dev Biol ; 8: 698, 2020.
Artigo em Inglês | MEDLINE | ID: mdl-33015029

RESUMO

Head and neck squamous cell carcinoma (HNSCC) has a high recurrence and metastatic rate with an unknown mechanism of cancer spread. Tumor inflammation is the most critical processes of cancer onset, growth, and metastasis. We hypothesize that the release of extracellular vesicles (EVs) by tumor endothelial cells (TECs) induce reprogramming of immune cells as well as stromal cells to create an immunosuppressive microenvironment that favor tumor spread. We call this mechanism as non-metastatic contagious carcinogenesis. Extracellular vesicles were collected from primary HNSCC-derived endothelial cells (TEC-EV) and were used for stimulation of peripheral blood mononuclear cells (PBMCs) and primary adipose mesenchymal stem cells (ASCs). Regulation of ASC gene expression was investigated by RNA sequencing and protein array. PBMC, stimulated with TEC-EV, were analyzed by enzyme-linked immunosorbent assay and fluorescence-activated cell sorting. We validated in vitro the effects of TEC-EV on ASCs or PBMC by measuring invasion, adhesion, and proliferation. We found and confirmed that TEC-EV were able to change ASC inflammatory gene expression signature within 24-48 h. TEC-EV were also able to enhance the secretion of TGF-ß1 and IL-10 by PBMC and to increase T regulatory cell (Treg) expansion. TEC-EV carry specific proteins and RNAs that are responsible for Treg differentiation and immune suppression. ASCs and PBMC, treated with TEC-EV, enhanced proliferation, adhesion of tumor cells, and their invasion. These data indicate that TEC-EV exhibit a mechanism of non-metastatic contagious carcinogenesis that regulates tumor microenvironment and reprograms immune cells to sustain tumor growth and progression.

12.
Oncogene ; 39(40): 6327-6339, 2020 10.
Artigo em Inglês | MEDLINE | ID: mdl-32848210

RESUMO

The dominant paradigm for HPV carcinogenesis includes integration into the host genome followed by expression of E6 and E7 (E6/E7). We explored an alternative carcinogenic pathway characterized by episomal E2, E4, and E5 (E2/E4/E5) expression. Half of HPV positive cervical and pharyngeal cancers comprised a subtype with increase in expression of E2/E4/E5, as well as association with lack of integration into the host genome. Models of the E2/E4/E5 carcinogenesis show p53 dependent enhanced proliferation in vitro, as well as increased susceptibility to induction of cancer in vivo. Whole genomic expression analysis of the E2/E4/E5 pharyngeal cancer subtype is defined by activation of the fibroblast growth factor receptor (FGFR) pathway and this subtype is susceptible to combination FGFR and mTOR inhibition, with implications for targeted therapy.


Assuntos
Carcinogênese/genética , Proteínas Oncogênicas Virais/genética , Infecções por Papillomavirus/genética , Neoplasias Faríngeas/genética , Carcinoma de Células Escamosas de Cabeça e Pescoço/genética , Neoplasias do Colo do Útero/genética , Animais , Protocolos de Quimioterapia Combinada Antineoplásica/farmacologia , Protocolos de Quimioterapia Combinada Antineoplásica/uso terapêutico , Carcinogênese/efeitos dos fármacos , Linhagem Celular Tumoral , Proliferação de Células/genética , Conjuntos de Dados como Assunto , Modelos Animais de Doenças , Intervalo Livre de Doença , Feminino , Regulação Neoplásica da Expressão Gênica/efeitos dos fármacos , Regulação Viral da Expressão Gênica/efeitos dos fármacos , Interações Hospedeiro-Patógeno/genética , Papillomavirus Humano 16/genética , Papillomavirus Humano 16/patogenicidade , Humanos , Camundongos , Camundongos Transgênicos , Infecções por Papillomavirus/tratamento farmacológico , Infecções por Papillomavirus/mortalidade , Infecções por Papillomavirus/virologia , Neoplasias Faríngeas/tratamento farmacológico , Neoplasias Faríngeas/mortalidade , Neoplasias Faríngeas/virologia , Cultura Primária de Células , Receptores de Fatores de Crescimento de Fibroblastos/antagonistas & inibidores , Receptores de Fatores de Crescimento de Fibroblastos/metabolismo , Transdução de Sinais/efeitos dos fármacos , Transdução de Sinais/genética , Carcinoma de Células Escamosas de Cabeça e Pescoço/tratamento farmacológico , Carcinoma de Células Escamosas de Cabeça e Pescoço/patologia , Carcinoma de Células Escamosas de Cabeça e Pescoço/virologia , Serina-Treonina Quinases TOR/antagonistas & inibidores , Serina-Treonina Quinases TOR/metabolismo , Proteína Supressora de Tumor p53/metabolismo , Neoplasias do Colo do Útero/tratamento farmacológico , Neoplasias do Colo do Útero/mortalidade , Neoplasias do Colo do Útero/virologia
13.
Genome Res ; 30(7): 1060-1072, 2020 07.
Artigo em Inglês | MEDLINE | ID: mdl-32718982

RESUMO

Long noncoding RNAs (lncRNAs) constitute the majority of transcripts in the mammalian genomes, and yet, their functions remain largely unknown. As part of the FANTOM6 project, we systematically knocked down the expression of 285 lncRNAs in human dermal fibroblasts and quantified cellular growth, morphological changes, and transcriptomic responses using Capped Analysis of Gene Expression (CAGE). Antisense oligonucleotides targeting the same lncRNAs exhibited global concordance, and the molecular phenotype, measured by CAGE, recapitulated the observed cellular phenotypes while providing additional insights on the affected genes and pathways. Here, we disseminate the largest-to-date lncRNA knockdown data set with molecular phenotyping (over 1000 CAGE deep-sequencing libraries) for further exploration and highlight functional roles for ZNF213-AS1 and lnc-KHDC3L-2.


Assuntos
RNA Longo não Codificante/fisiologia , Processos de Crescimento Celular/genética , Movimento Celular/genética , Fibroblastos/citologia , Fibroblastos/metabolismo , Humanos , Canais de Potássio KCNQ/metabolismo , Anotação de Sequência Molecular , Oligonucleotídeos Antissenso , RNA Longo não Codificante/antagonistas & inibidores , RNA Longo não Codificante/metabolismo , RNA Interferente Pequeno
14.
Nucleic Acids Res ; 48(12): e68, 2020 07 09.
Artigo em Inglês | MEDLINE | ID: mdl-32392348

RESUMO

While the methods available for single-cell ATAC-seq analysis are well optimized for clustering cell types, the question of how to integrate multiple scATAC-seq data sets and/or sequencing modalities is still open. We present an analysis framework that enables such integration across scATAC-seq data sets by applying the CoGAPS Matrix Factorization algorithm and the projectR transfer learning program to identify common regulatory patterns across scATAC-seq data sets. We additionally integrate our analysis with scRNA-seq data to identify orthogonal evidence for transcriptional regulators predicted by scATAC-seq analysis. Using publicly available scATAC-seq data, we find patterns that accurately characterize cell types both within and across data sets. Furthermore, we demonstrate that these patterns are both consistent with current biological understanding and reflective of novel regulatory biology.


Assuntos
Algoritmos , Sequenciamento de Cromatina por Imunoprecipitação/métodos , Perfilação da Expressão Gênica/métodos , Análise de Célula Única/métodos , Animais , Cromatina/genética , Conjuntos de Dados como Assunto , Humanos , Aprendizado de Máquina
15.
Epigenetics ; 15(9): 959-971, 2020 09.
Artigo em Inglês | MEDLINE | ID: mdl-32164487

RESUMO

Human papillomavirus-related oropharyngeal squamous cell carcinoma (HPV+ OPSCC) represents a unique disease entity within head and neck cancer with rising incidence. Previous work has shown that alternative splicing events (ASEs) are prevalent in HPV+ OPSCC, but further validation is needed to understand the regulation of this process and its role in these tumours. In this study, eleven ASEs (GIT2, CTNNB1, MKNK2, MRPL33, SIPA1L3, SNHG6, SYCP2, TPRG1, ZHX2, ZNF331, and ELOVL1) were selected for validation from 109 previously published candidate ASEs to elucidate the post-transcriptional mechanisms of oncogenesis in HPV+ disease. In vitro qRT-PCR confirmed differential expression of 9 of 11 ASE candidates, and in silico analysis within the TCGA cohort confirmed 8 of 11 candidates. Six ASEs (MRPL33, SIPA1L3, SNHG6, TPRG1, ZHX2, and ELOVL1) showed significant differential expression across both methods. Further evaluation of chromatin modification revealed that ASEs strongly correlated with cancer-specific distribution of acetylated lysine 27 of histone 3 (H3K27ac). Subsequent epigenetic treatment of HPV+ HNSCC cell lines (UM-SCC-047 and UPCI-SCC-090) with JQ1 not only induced downregulation of cancer-specific ASE isoforms, but also growth inhibition in both cell lines. The UPCI-SCC-090 cell line, with greater ASE expression, also showed more significant growth inhibition after JQ1 treatment. This study confirms several novel cancer-specific ASEs in HPV+OPSCC and provides evidence for the role of chromatin modifications in regulation of alternative splicing in HPV+OPSCC. This highlights the role of epigenetic changes in the oncogenesis of HPV+OPSCC, which represents a unique, unexplored target for therapeutics that can alter the global post-transcriptional landscape.


Assuntos
Processamento Alternativo , Carcinoma de Células Escamosas/genética , Montagem e Desmontagem da Cromatina , Regulação Neoplásica da Expressão Gênica , Neoplasias Orofaríngeas/genética , Alphapapillomavirus/patogenicidade , Carcinoma de Células Escamosas/metabolismo , Carcinoma de Células Escamosas/virologia , Linhagem Celular Tumoral , Epigênese Genética , Loci Gênicos , Código das Histonas , Histonas/química , Histonas/metabolismo , Humanos , Neoplasias Orofaríngeas/metabolismo , Neoplasias Orofaríngeas/virologia
16.
Genome Res ; 30(7): 1073-1081, 2020 07.
Artigo em Inglês | MEDLINE | ID: mdl-32079618

RESUMO

Long noncoding RNAs (lncRNAs) have emerged as key coordinators of biological and cellular processes. Characterizing lncRNA expression across cells and tissues is key to understanding their role in determining phenotypes, including human diseases. We present here FC-R2, a comprehensive expression atlas across a broadly defined human transcriptome, inclusive of over 109,000 coding and noncoding genes, as described in the FANTOM CAGE-Associated Transcriptome (FANTOM-CAT) study. This atlas greatly extends the gene annotation used in the original recount2 resource. We demonstrate the utility of the FC-R2 atlas by reproducing key findings from published large studies and by generating new results across normal and diseased human samples. In particular, we (a) identify tissue-specific transcription profiles for distinct classes of coding and noncoding genes, (b) perform differential expression analysis across thirteen cancer types, identifying novel noncoding genes potentially involved in tumor pathogenesis and progression, and (c) confirm the prognostic value for several enhancer lncRNAs expression in cancer. Our resource is instrumental for the systematic molecular characterization of lncRNA by the FANTOM6 Consortium. In conclusion, comprised of over 70,000 samples, the FC-R2 atlas will empower other researchers to investigate functions and biological roles of both known coding genes and novel lncRNAs.


Assuntos
Transcriptoma , Bases de Dados Genéticas , Elementos Facilitadores Genéticos , Perfilação da Expressão Gênica , Genoma Humano , Humanos , Neoplasias/genética , Especificidade de Órgãos , Prognóstico , RNA Longo não Codificante/genética , RNA Longo não Codificante/metabolismo , RNA Mensageiro/metabolismo
17.
Head Neck ; 42(4): 688-697, 2020 04.
Artigo em Inglês | MEDLINE | ID: mdl-31850594

RESUMO

BACKGROUND: We aimed to use genomic data for optimizing polymerase chain reaction (PCR) primer/probe sets for detection of human papillomavirus (HPV)-16 in body fluids of patients with HPV-related head and neck squamous cell carcinoma (HPV-HNSCC). METHODS: We used genomic HPV-HNSCC sequencing data from a single institutional and a TCGA cohort. Optimized primer/probe sets were designed and tested for analytical performance in CaSki HPV-16 genome and confirmed in salivary rinse samples from patients with HPV-HNSCC. RESULTS: The highest read density was observed between E5 and L2 regions. The E1 region contained a region that was universally present. Among candidate PCR primer/probe sets created, six reliably detected 30 HPV-16 copy number. In a CLIA certified laboratory setting, the combination of two novel primer/probe with E7 sets improved performance in salivary rinse samples with a sensitivity of 96% and specificity of 100%. CONCLUSIONS: PCR-based detection of HPV-16 DNA in HPV-HNSCC can be improved using rational genomic design.


Assuntos
Neoplasias de Cabeça e Pescoço , Infecções por Papillomavirus , DNA Viral/genética , Genômica , Papillomavirus Humano 16/genética , Humanos , Papillomaviridae/genética , Infecções por Papillomavirus/diagnóstico , Carcinoma de Células Escamosas de Cabeça e Pescoço/genética
18.
Front Genet ; 10: 1078, 2019.
Artigo em Inglês | MEDLINE | ID: mdl-31737053

RESUMO

Many problems of modern genetics and functional genomics require the assessment of functional effects of sequence variants, including gene expression changes. Machine learning is considered to be a promising approach for solving this task, but its practical applications remain a challenge due to the insufficient volume and diversity of training data. A promising source of valuable data is a saturation mutagenesis massively parallel reporter assay, which quantitatively measures changes in transcription activity caused by sequence variants. Here, we explore the computational predictions of the effects of individual single-nucleotide variants on gene transcription measured in the massively parallel reporter assays, based on the data from the recent "Regulation Saturation" Critical Assessment of Genome Interpretation challenge. We show that the estimated prediction quality strongly depends on the structure of the training and validation data. Particularly, training on the sequence segments located next to the validation data results in the "information leakage" caused by the local context. This information leakage allows reproducing the prediction quality of the best CAGI challenge submissions with a fairly simple machine learning approach, and even obtaining notably better-than-random predictions using irrelevant genomic regions. Validation scenarios preventing such information leakage dramatically reduce the measured prediction quality. The performance at independent regulatory regions entirely excluded from the training set appears to be much lower than needed for practical applications, and even the performance estimation will become reliable only in the future with richer data from multiple reporters. The source code and data are available at https://bitbucket.org/autosomeru_cagi2018/cagi2018_regsat and https://genomeinterpretation.org/content/expression-variants.

19.
Sci Rep ; 9(1): 15034, 2019 10 21.
Artigo em Inglês | MEDLINE | ID: mdl-31636280

RESUMO

Current literature suggests that epigenetically regulated super-enhancers (SEs) are drivers of aberrant gene expression in cancers. Many tumor types are still missing chromatin data to define cancer-specific SEs and their role in carcinogenesis. In this work, we develop a simple pipeline, which can utilize chromatin data from etiologically similar tumors to discover tissue-specific SEs and their target genes using gene expression and DNA methylation data. As an example, we applied our pipeline to human papillomavirus-related oropharyngeal squamous cell carcinoma (HPV + OPSCC). This tumor type is characterized by abundant gene expression changes, which cannot be explained by genetic alterations alone. Chromatin data are still limited for this disease, so we used 3627 SE elements from public domain data for closely related tissues, including normal and tumor lung, and cervical cancer cell lines. We integrated the available DNA methylation and gene expression data for HPV + OPSCC samples to filter the candidate SEs to identify functional SEs and their affected targets, which are essential for cancer development. Overall, we found 159 differentially methylated SEs, including 87 SEs that actively regulate expression of 150 nearby genes (211 SE-gene pairs) in HPV + OPSCC. Of these, 132 SE-gene pairs were validated in a related TCGA cohort. Pathway analysis revealed that the SE-regulated genes were associated with pathways known to regulate nasopharyngeal, breast, melanoma, and bladder carcinogenesis and are regulated by the epigenetic landscape in those cancers. Thus, we propose that gene expression in HPV + OPSCC may be controlled by epigenetic alterations in SE elements, which are common between related tissues. Our pipeline can utilize a diversity of data inputs and can be further adapted to SE analysis of diseased and non-diseased tissues from different organisms.


Assuntos
Carcinoma de Células Escamosas/genética , Metilação de DNA/genética , Elementos Facilitadores Genéticos/genética , Regulação Neoplásica da Expressão Gênica , Neoplasias de Cabeça e Pescoço/genética , Carcinoma de Células Escamosas/virologia , Neoplasias de Cabeça e Pescoço/virologia , Humanos , Papillomaviridae/fisiologia , Regiões Promotoras Genéticas/genética , Reprodutibilidade dos Testes
20.
Proc Natl Acad Sci U S A ; 116(42): 21104-21112, 2019 10 15.
Artigo em Inglês | MEDLINE | ID: mdl-31578251

RESUMO

Influenza A virus (IAV) is a major public health problem and a pandemic threat. Its evolution is largely driven by diversifying positive selection so that relative fitness of different amino acid variants changes with time due to changes in herd immunity or genomic context, and novel amino acid variants attain fitness advantage. Here, we hypothesize that diversifying selection also has another manifestation: the fitness associated with a particular amino acid variant should decline with time since its origin, as the herd immunity adapts to it. By tracing the evolution of antigenic sites at IAV surface proteins, we show that an amino acid variant becomes progressively more likely to become replaced by another variant with time since its origin-a phenomenon we call "senescence." Senescence is particularly pronounced at experimentally validated antigenic sites, implying that it is largely driven by host immunity. By contrast, at internal sites, existing variants become more favorable with time, probably due to arising contingent mutations at other epistatically interacting sites. Our findings reveal a previously undescribed facet of adaptive evolution and suggest approaches for prediction of evolutionary dynamics of pathogens.


Assuntos
Aminoácidos/genética , Vírus da Influenza A/genética , Proteínas de Membrana/genética , Proteínas Virais/genética , Alelos , Aminoácidos/imunologia , Antígenos Virais/genética , Antígenos Virais/imunologia , Evolução Molecular , Variação Genética/genética , Variação Genética/imunologia , Vírus da Influenza A/imunologia , Proteínas de Membrana/imunologia , Pandemias , Proteínas Virais/imunologia
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...