Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 14 de 14
Filter
Add more filters










Publication year range
1.
Database (Oxford) ; 20232023 03 04.
Article in English | MEDLINE | ID: mdl-36869839

ABSTRACT

Long non-coding ribonucleic acids (lncRNAs) account for the largest group of non-coding RNAs. However, knowledge about their function and regulation is limited. lncHUB2 is a web server database that provides known and inferred knowledge about the function of 18 705 human and 11 274 mouse lncRNAs. lncHUB2 produces reports that contain the secondary structure fold of the lncRNA, related publications, the most correlated coding genes, the most correlated lncRNAs, a network that visualizes the most correlated genes, predicted mouse phenotypes, predicted membership in biological processes and pathways, predicted upstream transcription factor regulators, and predicted disease associations. In addition, the reports include subcellular localization information; expression across tissues, cell types, and cell lines, and predicted small molecules and CRISPR knockout (CRISPR-KO) genes prioritized based on their likelihood to up- or downregulate the expression of the lncRNA. Overall, lncHUB2 is a database with rich information about human and mouse lncRNAs and as such it can facilitate hypothesis generation for many future studies. The lncHUB2 database is available at https://maayanlab.cloud/lncHUB2. Database URL: https://maayanlab.cloud/lncHUB2.


Subject(s)
RNA, Long Noncoding , Humans , Animals , Mice , Cell Line , Clustered Regularly Interspaced Short Palindromic Repeats , Databases, Factual , Knowledge
2.
BMC Bioinformatics ; 23(1): 374, 2022 Sep 13.
Article in English | MEDLINE | ID: mdl-36100892

ABSTRACT

The L1000 technology, a cost-effective high-throughput transcriptomics technology, has been applied to profile a collection of human cell lines for their gene expression response to > 30,000 chemical and genetic perturbations. In total, there are currently over 3 million available L1000 profiles. Such a dataset is invaluable for the discovery of drug and target candidates and for inferring mechanisms of action for small molecules. The L1000 assay only measures the mRNA expression of 978 landmark genes while 11,350 additional genes are computationally reliably inferred. The lack of full genome coverage limits knowledge discovery for half of the human protein coding genes, and the potential for integration with other transcriptomics profiling data. Here we present a Deep Learning two-step model that transforms L1000 profiles to RNA-seq-like profiles. The input to the model are the measured 978 landmark genes while the output is a vector of 23,614 RNA-seq-like gene expression profiles. The model first transforms the landmark genes into RNA-seq-like 978 gene profiles using a modified CycleGAN model applied to unpaired data. The transformed 978 RNA-seq-like landmark genes are then extrapolated into the full genome space with a fully connected neural network model. The two-step model achieves 0.914 Pearson's correlation coefficients and 1.167 root mean square errors when tested on a published paired L1000/RNA-seq dataset produced by the LINCS and GTEx programs. The processed RNA-seq-like profiles are made available for download, signature search, and gene centric reverse search with unique case studies.


Subject(s)
Deep Learning , Gene Expression Profiling , Humans , RNA-Seq , Transcriptome
3.
Curr Protoc ; 2(7): e487, 2022 Jul.
Article in English | MEDLINE | ID: mdl-35876555

ABSTRACT

The Library of Integrated Network-based Cellular Signatures (LINCS) was an NIH Common Fund program that aimed to expand our knowledge about human cellular responses to chemical, genetic, and microenvironment perturbations. Responses to perturbations were measured by transcriptomics, proteomics, cellular imaging, and other high content assays. The second phase of the LINCS program, which lasted 7 years, involved the engagement of six data and signature generation centers (DSGCs) and one data coordination and integration center (DCIC). The DSGCs and the DCIC developed several digital resources, including tools, databases, and workflows that aim to facilitate the use of the LINCS data and integrate this data with other publicly available data. The digital resources developed by the DSGCs and the DCIC can be used to gain new biological and pharmacological insights that can lead to the development of novel therapeutics. This protocol provides step-by-step instructions for processing the LINCS data into signatures, and utilizing the digital resources developed by the LINCS consortia for hypothesis generation and knowledge discovery. © 2022 The Authors. Current Protocols published by Wiley Periodicals LLC. Basic Protocol 1: Navigating L1000 tools and data in CLUE.io Basic Protocol 2: Computing signatures from the L1000 data with the CD method Basic Protocol 3: Analyzing lists of differentially expressed genes and querying them against the L1000 data with BioJupies and the Bulk RNA-seq Appyter Basic Protocol 4: Utilizing the L1000FWD resource for drug discovery Basic Protocol 5: KINOMEscan and the KINOMEscan Appyter Basic Protocol 6: LINCS P100 and GCP Proteomics Assays Basic Protocol 7: The LINCS Joint Project (LJP) Basic Protocol 8: The LINCS Data Portals and SigCom LINCS Basic Protocol 9: Creating and analyzing signatures with iLINCS.


Subject(s)
Drug Discovery , Proteomics , Databases, Factual , Drug Discovery/methods , Gene Library , Humans , Transcriptome
4.
Nucleic Acids Res ; 50(W1): W697-W709, 2022 07 05.
Article in English | MEDLINE | ID: mdl-35524556

ABSTRACT

Millions of transcriptome samples were generated by the Library of Integrated Network-based Cellular Signatures (LINCS) program. When these data are processed into searchable signatures along with signatures extracted from Genotype-Tissue Expression (GTEx) and Gene Expression Omnibus (GEO), connections between drugs, genes, pathways and diseases can be illuminated. SigCom LINCS is a webserver that serves over a million gene expression signatures processed, analyzed, and visualized from LINCS, GTEx, and GEO. SigCom LINCS is built with Signature Commons, a cloud-agnostic skeleton Data Commons with a focus on serving searchable signatures. SigCom LINCS provides a rapid signature similarity search for mimickers and reversers given sets of up and down genes, a gene set, a single gene, or any search term. Additionally, users of SigCom LINCS can perform a metadata search to find and analyze subsets of signatures and find information about genes and drugs. SigCom LINCS is findable, accessible, interoperable, and reusable (FAIR) with metadata linked to standard ontologies and vocabularies. In addition, all the data and signatures within SigCom LINCS are available via a well-documented API. In summary, SigCom LINCS, available at https://maayanlab.cloud/sigcom-lincs, is a rich webserver resource for accelerating drug and target discovery in systems pharmacology.


Subject(s)
Metadata , Transcriptome , Transcriptome/genetics , Search Engine
5.
Patterns (N Y) ; 2(3): 100213, 2021 Mar 12.
Article in English | MEDLINE | ID: mdl-33748796

ABSTRACT

Jupyter Notebooks have transformed the communication of data analysis pipelines by facilitating a modular structure that brings together code, markdown text, and interactive visualizations. Here, we extended Jupyter Notebooks to broaden their accessibility with Appyters. Appyters turn Jupyter Notebooks into fully functional standalone web-based bioinformatics applications. Appyters present to users an entry form enabling them to upload their data and set various parameters for a multitude of data analysis workflows. Once the form is filled, the Appyter executes the corresponding notebook in the cloud, producing the output without requiring the user to interact directly with the code. Appyters were used to create many bioinformatics web-based reusable workflows, including applications to build customized machine learning pipelines, analyze omics data, and produce publishable figures. These Appyters are served in the Appyters Catalog at https://appyters.maayanlab.cloud. In summary, Appyters enable the rapid development of interactive web-based bioinformatics applications.

6.
Front Immunol ; 12: 636289, 2021.
Article in English | MEDLINE | ID: mdl-33763080

ABSTRACT

Although widely prevalent, Lyme disease is still under-diagnosed and misunderstood. Here we followed 73 acute Lyme disease patients and uninfected controls over a period of a year. At each visit, RNA-sequencing was applied to profile patients' peripheral blood mononuclear cells in addition to extensive clinical phenotyping. Based on the projection of the RNA-seq data into lower dimensions, we observe that the cases are separated from controls, and almost all cases never return to cluster with the controls over time. Enrichment analysis of the differentially expressed genes between clusters identifies up-regulation of immune response genes. This observation is also supported by deconvolution analysis to identify the changes in cell type composition due to Lyme disease infection. Importantly, we developed several machine learning classifiers that attempt to perform various Lyme disease classifications. We show that Lyme patients can be distinguished from the controls as well as from COVID-19 patients, but classification was not successful in distinguishing those patients with early Lyme disease cases that would advance to develop post-treatment persistent symptoms.


Subject(s)
Leukocytes, Mononuclear/immunology , Lyme Disease/genetics , Adult , COVID-19/genetics , COVID-19/immunology , Cytokines/genetics , Cytokines/immunology , Female , Follow-Up Studies , Humans , Leukocytes, Mononuclear/chemistry , Lyme Disease/blood , Lyme Disease/immunology , Machine Learning , Male , Middle Aged , Prospective Studies , RNA-Seq
7.
Curr Protoc ; 1(3): e90, 2021 Mar.
Article in English | MEDLINE | ID: mdl-33780170

ABSTRACT

Profiling samples from patients, tissues, and cells with genomics, transcriptomics, epigenomics, proteomics, and metabolomics ultimately produces lists of genes and proteins that need to be further analyzed and integrated in the context of known biology. Enrichr (Chen et al., 2013; Kuleshov et al., 2016) is a gene set search engine that enables the querying of hundreds of thousands of annotated gene sets. Enrichr uniquely integrates knowledge from many high-profile projects to provide synthesized information about mammalian genes and gene sets. The platform provides various methods to compute gene set enrichment, and the results are visualized in several interactive ways. This protocol provides a summary of the key features of Enrichr, which include using Enrichr programmatically and embedding an Enrichr button on any website. © 2021 Wiley Periodicals LLC. Basic Protocol 1: Analyzing lists of differentially expressed genes from transcriptomics, proteomics and phosphoproteomics, GWAS studies, or other experimental studies Basic Protocol 2: Searching Enrichr by a single gene or key search term Basic Protocol 3: Preparing raw or processed RNA-seq data through BioJupies in preparation for Enrichr analysis Basic Protocol 4: Analyzing gene sets for model organisms using modEnrichr Basic Protocol 5: Using Enrichr in Geneshot Basic Protocol 6: Using Enrichr in ARCHS4 Basic Protocol 7: Using the enrichment analysis visualization Appyter to visualize Enrichr results Basic Protocol 8: Using the Enrichr API Basic Protocol 9: Adding an Enrichr button to a website.


Subject(s)
Knowledge Discovery , Software , Animals , Computational Biology , Genomics , Humans , RNA-Seq
8.
Diabetes ; 70(2): 589-602, 2021 02.
Article in English | MEDLINE | ID: mdl-33067313

ABSTRACT

Diabetic kidney disease (DKD) remains the most common cause of kidney failure, and the treatment options are insufficient. Here, we used a connectivity mapping approach to first collect 15 gene expression signatures from 11 DKD-related published independent studies. Then, by querying the Library of Integrated Network-based Cellular Signatures (LINCS) L1000 data set, we identified drugs and other bioactive small molecules that are predicted to reverse these gene signatures in the diabetic kidney. Among the top consensus candidates, we selected a PLK1 inhibitor (BI-2536) for further experimental validation. We found that PLK1 expression was increased in the glomeruli of both human and mouse diabetic kidneys and localized largely in mesangial cells. We also found that BI-2536 inhibited mesangial cell proliferation and extracellular matrix in vitro and ameliorated proteinuria and kidney injury in DKD mice. Further pathway analysis of the genes predicted to be reversed by the PLK1 inhibitor was of members of the TNF-α/NF-κB, JAK/STAT, and TGF-ß/Smad3 pathways. In vitro, either BI-2536 treatment or knockdown of PLK1 dampened the NF-κB and Smad3 signal transduction and transcriptional activation. Together, these results suggest that the PLK1 inhibitor BI-2536 should be further investigated as a novel therapy for DKD.


Subject(s)
Diabetes Mellitus , Diabetic Nephropathies , Pharmaceutical Preparations , Animals , Diabetic Nephropathies/drug therapy , Kidney , Mice , Pteridines , Transcriptome
9.
Glia ; 68(10): 2148-2166, 2020 10.
Article in English | MEDLINE | ID: mdl-32639068

ABSTRACT

Glioblastoma (GBM) is the most aggressive primary brain tumor. In addition to being genetically heterogeneous, GBMs are also immunologically heterogeneous. However, whether the differences in immune microenvironment are driven by genetic driver mutation is unexplored. By leveraging the versatile RCAS/tv-a somatic gene transfer system, we establish a mouse model for Classical GBM by introducing EGFRvIII expression in Nestin-positive neural stem/progenitor cells in adult mice. Along with our previously published Nf1-silenced and PDGFB-overexpressing models, we investigate the immune microenvironments of the three models of human GBM subtypes by unbiased multiplex profiling. We demonstrate that both the quantity and composition of the microenvironmental myeloid cells are dictated by the genetic driver mutations, closely mimicking what was observed in human GBM subtypes. These myeloid cells express high levels of the immune checkpoint protein PD-L1; however, PD-L1 targeted therapies alone or in combination with irradiation are unable to increase the survival time of tumor-bearing mice regardless of the driver mutations, reflecting the outcomes of recent human trials. Together, these results highlight the critical utility of immunocompetent mouse models for preclinical studies of GBM, making these models indispensable tools for understanding the resistance mechanisms of immune checkpoint blockade in GBM and immune cell-targeting drug discovery.


Subject(s)
Brain Neoplasms/genetics , Brain Neoplasms/immunology , Glioblastoma/genetics , Glioblastoma/immunology , Immune Checkpoint Inhibitors/therapeutic use , Mutation/physiology , Animals , Brain Neoplasms/drug therapy , Brain Neoplasms/pathology , Female , Glioblastoma/drug therapy , Glioblastoma/pathology , Humans , Male , Mice , Mice, Inbred C57BL , Mice, Transgenic , Tumor Cells, Cultured
11.
Cell Syst ; 9(5): 417-421, 2019 11 27.
Article in English | MEDLINE | ID: mdl-31677972

ABSTRACT

As more digital resources are produced by the research community, it is becoming increasingly important to harmonize and organize them for synergistic utilization. The findable, accessible, interoperable, and reusable (FAIR) guiding principles have prompted many stakeholders to consider strategies for tackling this challenge. The FAIRshake toolkit was developed to enable the establishment of community-driven FAIR metrics and rubrics paired with manual and automated FAIR assessments. FAIR assessments are visualized as an insignia that can be embedded within digital-resources-hosting websites. Using FAIRshake, a variety of biomedical digital resources were manually and automatically evaluated for their level of FAIRness.


Subject(s)
Information Dissemination/methods , Internet/trends , Online Systems/standards , Health Resources/standards , Humans
12.
Nucleic Acids Res ; 47(W1): W183-W190, 2019 07 02.
Article in English | MEDLINE | ID: mdl-31069376

ABSTRACT

High-throughput experiments produce increasingly large datasets that are difficult to analyze and integrate. While most data integration approaches focus on aligning metadata, data integration can be achieved by abstracting experimental results into gene sets. Such gene sets can be made available for reuse through gene set enrichment analysis tools such as Enrichr. Enrichr currently only supports gene sets compiled from human and mouse, limiting accessibility for investigators that study other model organisms. modEnrichr is an expansion of Enrichr for four model organisms: fish, fly, worm and yeast. The gene set libraries within FishEnrichr, FlyEnrichr, WormEnrichr and YeastEnrichr are created from the Gene Ontology, mRNA expression profiles, GeneRIF, pathway databases, protein domain databases and other organism-specific resources. Additionally, libraries were created by predicting gene function from RNA-seq co-expression data processed uniformly from the gene expression omnibus for each organism. The modEnrichr suite of tools provides the ability to convert gene lists across species using an ortholog conversion tool that automatically detects the species. For complex analyses, modEnrichr provides API access that enables submitting batch queries. In summary, modEnrichr leverages existing model organism databases and other resources to facilitate comprehensive hypothesis generation through data integration.


Subject(s)
Databases, Genetic , Gene Expression/genetics , Gene Library , Genomic Library , Software , Animals , Computational Biology , Gene Ontology , Humans , Metadata
13.
Nucleic Acids Res ; 47(W1): W571-W577, 2019 07 02.
Article in English | MEDLINE | ID: mdl-31114885

ABSTRACT

The frequency by which genes are studied correlates with the prior knowledge accumulated about them. This leads to an imbalance in research attention where some genes are highly investigated while others are ignored. Geneshot is a search engine developed to illuminate this gap and to promote attention to the under-studied genome. Through a simple web interface, Geneshot enables researchers to enter arbitrary search terms, to receive ranked lists of genes relevant to the search terms. Returned ranked gene lists contain genes that were previously published in association with the search terms, as well as genes predicted to be associated with the terms based on data integration from multiple sources. The search results are presented with interactive visualizations. To predict gene function, Geneshot utilizes gene-gene similarity matrices from processed RNA-seq data, or from gene-gene co-occurrence data obtained from multiple sources. In addition, Geneshot can be used to analyze the novelty of gene sets and augment gene sets with additional relevant genes. The Geneshot web-server and API are freely and openly available from https://amp.pharm.mssm.edu/geneshot.


Subject(s)
Genes , Software , Data Mining , Gene Expression , Internet , Publications , RNA-Seq , Research Personnel , User-Computer Interface
14.
Nucleic Acids Res ; 47(W1): W212-W224, 2019 07 02.
Article in English | MEDLINE | ID: mdl-31114921

ABSTRACT

Identifying the transcription factors (TFs) responsible for observed changes in gene expression is an important step in understanding gene regulatory networks. ChIP-X Enrichment Analysis 3 (ChEA3) is a transcription factor enrichment analysis tool that ranks TFs associated with user-submitted gene sets. The ChEA3 background database contains a collection of gene set libraries generated from multiple sources including TF-gene co-expression from RNA-seq studies, TF-target associations from ChIP-seq experiments, and TF-gene co-occurrence computed from crowd-submitted gene lists. Enrichment results from these distinct sources are integrated to generate a composite rank that improves the prediction of the correct upstream TF compared to ranks produced by individual libraries. We compare ChEA3 with existing TF prediction tools and show that ChEA3 performs better. By integrating the ChEA3 libraries, we illuminate general transcription factor properties such as whether the TF behaves as an activator or a repressor. The ChEA3 web-server is available from https://amp.pharm.mssm.edu/ChEA3.


Subject(s)
Computational Biology/methods , Databases, Genetic , Gene Library , Transcription Factors/genetics , Chromatin Immunoprecipitation Sequencing/methods , Datasets as Topic , Gene Expression Regulation/genetics , Gene Regulatory Networks/genetics , Humans
SELECTION OF CITATIONS
SEARCH DETAIL
...