Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 57
Filter
1.
PLoS One ; 18(11): e0284232, 2023.
Article in English | MEDLINE | ID: mdl-37910468

ABSTRACT

Pancreatic ductal adenocarcinoma (PDAC) is a poor prognosis cancer with an aggressive growth profile that is often diagnosed at late stage and that has few curative or therapeutic options. PDAC growth has been linked to alterations in the pancreas microbiome, which could include the presence of the fungus Malassezia. We used RNA-sequencing to compare 14 matched tumor and normal (tumor adjacent) pancreatic cancer samples and found Malassezia RNA in both the PDAC and normal tissues. Although the presence of Malassezia was not correlated with tumor growth, a set of immune- and inflammatory-related genes were up-regulated in the PDAC compared to the normal samples, suggesting that they are involved in tumor progression. Gene set enrichment analysis suggests that activation of the complement cascade pathway and inflammation could be involved in pro PDAC growth.


Subject(s)
Carcinoma, Pancreatic Ductal , Pancreatic Neoplasms , Humans , Pancreatic Neoplasms/pathology , Carcinoma, Pancreatic Ductal/pathology , Pancreas/pathology , RNA/metabolism , Prognosis , Gene Expression Regulation, Neoplastic
2.
J Transl Med ; 21(1): 783, 2023 11 04.
Article in English | MEDLINE | ID: mdl-37925448

ABSTRACT

Prior research has shown that the deconvolution of cell-free RNA can uncover the tissue origin. The conventional deconvolution approaches rely on constructing a reference tissue-specific gene panel, which cannot capture the inherent variation present in actual data. To address this, we have developed a novel method that utilizes a neural network framework to leverage the entire training dataset. Our approach involved training a model that incorporated 15 distinct tissue types. Through one semi-independent and two complete independent validations, including deconvolution using a semi in silico dataset, deconvolution with a custom normal tissue mixture RNA-seq data, and deconvolution of longitudinal circulating tumor cell RNA-seq (ctcRNA) data from a cancer patient with metastatic tumors, we demonstrate the efficacy and advantages of the deep-learning approach which were exerted by effectively capturing the inherent variability present in the dataset, thus leading to enhanced accuracy. Sensitivity analyses reveal that neural network models are less susceptible to the presence of missing data, making them more suitable for real-world applications. Moreover, by leveraging the concept of organotropism, we applied our approach to trace the migration of circulating tumor cell-derived RNA (ctcRNA) in a cancer patient with metastatic tumors, thereby highlighting the potential clinical significance of early detection of cancer metastasis.


Subject(s)
Neoplastic Cells, Circulating , RNA , Humans , Neural Networks, Computer , RNA-Seq , Sequence Analysis, RNA
3.
Bioinformatics ; 39(9)2023 09 02.
Article in English | MEDLINE | ID: mdl-37624931

ABSTRACT

MOTIVATION: As an important player in transcriptome regulation, microRNAs may effectively diffuse somatic mutation impacts to broad cellular processes and ultimately manifest disease and dictate prognosis. Previous studies that tried to correlate mutation with gene expression dysregulation neglected to adjust for the disparate multitudes of false positives associated with unequal sample sizes and uneven class balancing scenarios. RESULTS: To properly address this issue, we developed a statistical framework to rigorously assess the extent of mutation impact on microRNAs in relation to a permutation-based null distribution of a matching sample structure. Carrying out the framework in a pan-cancer study, we ascertained 9008 protein-coding genes with statistically significant mutation impacts on miRNAs. Of these, the collective miRNA expression for 83 genes showed significant prognostic power in nine cancer types. For example, in lower-grade glioma, 10 genes' mutations broadly impacted miRNAs, all of which showed prognostic value with the corresponding miRNA expression. Our framework was further validated with functional analysis and augmented with rich features including the ability to analyze miRNA isoforms; aggregative prognostic analysis; advanced annotations such as mutation type, regulator alteration, somatic motif, and disease association; and instructive visualization such as mutation OncoPrint, Ideogram, and interactive mRNA-miRNA network. AVAILABILITY AND IMPLEMENTATION: The data underlying this article are available in MutMix, at http://innovebioinfo.com/Database/TmiEx/MutMix.php.


Subject(s)
Glioma , MicroRNAs , Humans , Diffusion , MicroRNAs/genetics , Mutation , RNA, Messenger
4.
Cancer Res Commun ; 3(2): 309-324, 2023 02.
Article in English | MEDLINE | ID: mdl-36860657

ABSTRACT

The importance of the immune microenvironment in ovarian cancer progression, metastasis, and response to therapies has become increasingly clear, especially with the new emphasis on immunotherapies. To leverage the power of patient-derived xenograft (PDX) models within a humanized immune microenvironment, three ovarian cancer PDXs were grown in humanized NBSGW (huNBSGW) mice engrafted with human CD34+ cord blood-derived hematopoietic stem cells. Analysis of cytokine levels in the ascites fluid and identification of infiltrating immune cells in the tumors demonstrated that these humanized PDX (huPDX) established an immune tumor microenvironment similar to what has been reported for patients with ovarian cancer. The lack of human myeloid cell differentiation has been a major setback for humanized mouse models, but our analysis shows that PDX engraftment increases the human myeloid population in the peripheral blood. Analysis of cytokines within the ascites fluid of huPDX revealed high levels of human M-CSF, a key myeloid differentiation factor as well as other elevated cytokines that have previously been identified in ovarian cancer patient ascites fluid including those involved in immune cell differentiation and recruitment. Human tumor-associated macrophages and tumor-infiltrating lymphocytes were detected within the tumors of humanized mice, demonstrating immune cell recruitment to tumors. Comparison of the three huPDX revealed certain differences in cytokine signatures and in the extent of immune cell recruitment. Our studies show that huNBSGW PDX models reconstitute important aspects of the ovarian cancer immune tumor microenvironment, which may recommend these models for preclinical therapeutic trials. Significance: huPDX models are ideal preclinical models for testing novel therapies. They reflect the genetic heterogeneity of the patient population, enhance human myeloid differentiation, and recruit immune cells to the tumor microenvironment.


Subject(s)
Ovarian Neoplasms , Peritoneal Cavity , Humans , Mice , Animals , Female , Heterografts , Ascites , Ovarian Neoplasms/therapy , Cytokines , Tumor Microenvironment
5.
Cancers (Basel) ; 15(5)2023 Feb 22.
Article in English | MEDLINE | ID: mdl-36900183

ABSTRACT

Adenoid cystic carcinoma (ACC) is an aggressive malignancy that most often arises in salivary or lacrimal glands but can also occur in other tissues. We used optimized RNA-sequencing to analyze the transcriptomes of 113 ACC tumor samples from salivary gland, lacrimal gland, breast or skin. ACC tumors from different organs displayed remarkedly similar transcription profiles, and most harbored translocations in the MYB or MYBL1 genes, which encode oncogenic transcription factors that may induce dramatic genetic and epigenetic changes leading to a dominant 'ACC phenotype'. Further analysis of the 56 salivary gland ACC tumors led to the identification of three distinct groups of patients, based on gene expression profiles, including one group with worse survival. We tested whether this new cohort could be used to validate a biomarker developed previously with a different set of 68 ACC tumor samples. Indeed, a 49-gene classifier developed with the earlier cohort correctly identified 98% of the poor survival patients from the new set, and a 14-gene classifier was almost as accurate. These validated biomarkers form a platform to identify and stratify high-risk ACC patients into clinical trials of targeted therapies for sustained clinical response.

6.
Genome Res ; 32(10): 1930-1940, 2022 10.
Article in English | MEDLINE | ID: mdl-36100435

ABSTRACT

Mutation density patterns reveal unique biological properties of specific genomic regions and shed light on the mechanisms of carcinogenesis. Although previous studies reported insightful mutation density patterns associated with certain genomic regions such as transcription start sites and DNA replication origins, a tool that can systematically investigate mutational spatial patterns is still lacking. Thus, we developed MutDens, a bioinformatic tool for comprehensive analysis of mutation density patterns around genomic features, namely, genomic positions, in humans and model species. By scanning the bidirectional vicinity regions of given positions, MutDens systematically characterizes the mutation density for single-base substitution mutational classes after adjusting for total mutation burden and local nucleotide proportion. Analysis results using MutDens not only verified the previously reported transcriptional strand bias around transcription start sites and replicative strand bias around DNA replication origins, but also identified novel mutation density patterns around other genomics features, such as enhancers and retrotransposon insertion polymorphism sites. To our knowledge, MutDens is the first tool that systematically calculates, examines, and compares mutation density patterns, thus providing a valuable avenue for investigating the mutational landscapes associated with important genomic features.


Subject(s)
Genomics , Replication Origin , Humans , Mutation , Transcription Initiation Site , DNA
7.
Elife ; 112022 07 05.
Article in English | MEDLINE | ID: mdl-35787784

ABSTRACT

Background: Lymphatic malformations (LMs) often pose treatment challenges due to a large size or a critical location that could lead to disfigurement, and there are no standardized treatment approaches for either refractory or unresectable cases. Methods: We examined the genomic landscape of a patient cohort of LMs (n = 30 cases) that underwent comprehensive genomic profiling using a large-panel next-generation sequencing assay. Immunohistochemical analyses were completed in parallel. Results: These LMs had low mutational burden with hotspot PIK3CA mutations (n = 20) and NRAS (n = 5) mutations being most frequent, and mutually exclusive. All LM cases with Kaposi sarcoma-like (kaposiform) histology had NRAS mutations. One index patient presented with subacute abdominal pain and was diagnosed with a large retroperitoneal LM harboring a somatic PIK3CA gain-of-function mutation (H1047R). The patient achieved a rapid and durable radiologic complete response, as defined in RECIST1.1, to the PI3Kα inhibitor alpelisib within the context of a personalized N-of-1 clinical trial (NCT03941782). In translational correlative studies, canonical PI3Kα pathway activation was confirmed by immunohistochemistry and human LM-derived lymphatic endothelial cells carrying an allele with an activating mutation at the same locus were sensitive to alpelisib treatment in vitro, which was demonstrated by a concentration-dependent drop in measurable impedance, an assessment of cell status. Conclusions: Our findings establish that LM patients with conventional or kaposiform histology have distinct, yet targetable, driver mutations. Funding: R.P. and W.A. are supported by awards from the Levy-Longenbaugh Fund. S.G. is supported by awards from the Hugs for Brady Foundation. This work has been funded in part by the NCI Cancer Center Support Grants (CCSG; P30) to the University of Arizona Cancer Center (CA023074), the University of New Mexico Comprehensive Cancer Center (CA118100), and the Rutgers Cancer Institute of New Jersey (CA072720). B.K.M. was supported by National Science Foundation via Graduate Research Fellowship DGE-1143953. Clinical trial number: NCT03941782.


Subject(s)
Antineoplastic Agents , Class I Phosphatidylinositol 3-Kinases , GTP Phosphohydrolases , Lymphangioma , Lymphatic Abnormalities , Membrane Proteins , Thiazoles , Antineoplastic Agents/pharmacology , Antineoplastic Agents/therapeutic use , Class I Phosphatidylinositol 3-Kinases/antagonists & inhibitors , Class I Phosphatidylinositol 3-Kinases/genetics , Class I Phosphatidylinositol 3-Kinases/metabolism , Class Ia Phosphatidylinositol 3-Kinase/metabolism , Endothelial Cells/drug effects , Endothelial Cells/metabolism , GTP Phosphohydrolases/genetics , Genomics , High-Throughput Nucleotide Sequencing , Humans , Immunohistochemistry , Lymphangioma/drug therapy , Lymphangioma/genetics , Lymphatic Abnormalities/drug therapy , Lymphatic Abnormalities/genetics , Membrane Proteins/genetics , Mutation , Sequence Analysis, DNA , Thiazoles/pharmacology , Thiazoles/therapeutic use
8.
Cancers (Basel) ; 14(2)2022 Jan 14.
Article in English | MEDLINE | ID: mdl-35053577

ABSTRACT

Somatic mutations are one of the most important factors in tumorigenesis and are the focus of most cancer-sequencing efforts. The co-occurrence of multiple mutations in one tumor has gained increasing attention as a means of identifying cooperating mutations or pathways that contribute to cancer. Using multi-omics, phenotypical, and clinical data from 29,559 cancer subjects and 1747 cancer cell lines covering 78 distinct cancer types, we show that co-mutations are associated with prognosis, drug sensitivity, and disparities in sex, age, and race. Some co-mutation combinations displayed stronger effects than their corresponding single mutations. For example, co-mutation TP53:KRAS in pancreatic adenocarcinoma is significantly associated with disease specific survival (hazard ratio = 2.87, adjusted p-value = 0.0003) and its prognostic predictive power is greater than either TP53 or KRAS as individually mutated genes. Functional analyses revealed that co-mutations with higher prognostic values have higher potential impact and cause greater dysregulation of gene expression. Furthermore, many of the prognostically significant co-mutations caused gains or losses of binding sequences of RNA binding proteins or micro RNAs with known cancer associations. Thus, detailed analyses of co-mutations can identify mechanisms that cooperate in tumorigenesis.

9.
Nucleic Acids Res ; 50(1): e4, 2022 01 11.
Article in English | MEDLINE | ID: mdl-34606615

ABSTRACT

Efficient annotation of alterations in binding sequences of molecular regulators can help identify novel candidates for mechanisms study and offer original therapeutic hypotheses. In this work, we developed Somatic Binding Sequence Annotator (SBSA) as a full-capacity online tool to annotate altered binding motifs/sequences, addressing diverse types of genomic variants and molecular regulators. The genomic variants can be somatic mutation, single nucleotide polymorphism, RNA editing, etc. The binding motifs/sequences involve transcription factors (TFs), RNA-binding proteins, miRNA seeds, miRNA-mRNA 3'-UTR binding target, or can be any custom motifs/sequences. Compared to similar tools, SBSA is the first to support miRNA seeds and miRNA-mRNA 3'-UTR binding target, and it unprecedentedly implements a personalized genome approach that accommodates joint adjacent variants. SBSA is empowered to support an indefinite species, including preloaded reference genomes for SARS-Cov-2 and 25 other common organisms. We demonstrated SBSA by annotating multi-omics data from over 30,890 human subjects. Of the millions of somatic binding sequences identified, many are with known severe biological repercussions, such as the somatic mutation in TERT promoter region which causes a gained binding sequence for E26 transformation-specific factor (ETS1). We further validated the function of this TERT mutation using experimental data in cancer cells. Availability:http://innovebioinfo.com/Annotation/SBSA/SBSA.php.


Subject(s)
COVID-19/virology , Computational Biology/instrumentation , Genomics/instrumentation , Mutation , Proteomics/instrumentation , SARS-CoV-2 , 3' Untranslated Regions , Algorithms , Amino Acid Motifs , COVID-19/metabolism , Computational Biology/methods , Computers , Genetic Techniques , Genome, Human , Genomics/methods , Humans , Internet , MicroRNAs/metabolism , Phenotype , Promoter Regions, Genetic , Protein Binding , Proteomics/methods , Proto-Oncogene Protein c-ets-1/genetics , Proto-Oncogene Protein c-ets-1/metabolism , RNA-Binding Proteins/metabolism , Telomerase/metabolism
10.
Genomics ; 113(6): 3864-3871, 2021 11.
Article in English | MEDLINE | ID: mdl-34562567

ABSTRACT

RNA editing exerts critical impacts on numerous biological processes. While millions of RNA editings have been identified in humans, much more are expected to be discovered. In this work, we constructed Convolutional Neural Network (CNN) models to predict human RNA editing events in both Alu regions and non-Alu regions. With a validation dataset resulting from CRISPR/Cas9 knockout of the ADAR1 enzyme, the validation accuracies reached 99.5% and 93.6% for Alu and non-Alu regions, respectively. We ported our CNN models in a web service named EditPredict. EditPredict not only works on reference genome sequences but can also take into consideration single nucleotide variants in personal genomes. In addition to the human genome, EditPredict tackles other model organisms including bumblebee, fruitfly, mouse, and squid genomes. EditPredict can be used stand-alone to predict novel RNA editing and it can be used to assist in filtering for candidate RNA editing detected from RNA-Seq data.


Subject(s)
Neural Networks, Computer , RNA Editing , Animals , Genome , RNA , RNA-Seq
11.
PLoS Comput Biol ; 17(5): e1008976, 2021 05.
Article in English | MEDLINE | ID: mdl-33945541

ABSTRACT

Analyses of gene set differential coexpression may shed light on molecular mechanisms underlying phenotypes and diseases. However, differential coexpression analyses of conceptually similar individual studies are often inconsistent and underpowered to provide definitive results. Researchers can greatly benefit from an open-source application facilitating the aggregation of evidence of differential coexpression across studies and the estimation of more robust common effects. We developed Meta Gene Set Coexpression Analysis (MetaGSCA), an analytical tool to systematically assess differential coexpression of an a priori defined gene set by aggregating evidence across studies to provide a definitive result. In the kernel, a nonparametric approach that accounts for the gene-gene correlation structure is used to test whether the gene set is differentially coexpressed between two comparative conditions, from which a permutation test p-statistic is computed for each individual study. A meta-analysis is then performed to combine individual study results with one of two options: a random-intercept logistic regression model or the inverse variance method. We demonstrated MetaGSCA in case studies investigating two human diseases and identified pathways highly relevant to each disease across studies. We further applied MetaGSCA in a pan-cancer analysis with hundreds of major cellular pathways in 11 cancer types. The results indicated that a majority of the pathways identified were dysregulated in the pan-cancer scenario, many of which have been previously reported in the cancer literature. Our analysis with randomly generated gene sets showed excellent specificity, indicating that the significant pathways/gene sets identified by MetaGSCA are unlikely false positives. MetaGSCA is a user-friendly tool implemented in both forms of a Web-based application and an R package "MetaGSCA". It enables comprehensive meta-analyses of gene set differential coexpression data, with an optional module of post hoc pathway crosstalk network analysis to identify and visualize pathways having similar coexpression profiles.


Subject(s)
Gene Expression Regulation , Algorithms , Computational Biology/methods , Gene Regulatory Networks , Humans , Neoplasms/genetics
12.
Nat Protoc ; 16(4): 2190-2212, 2021 04.
Article in English | MEDLINE | ID: mdl-33731963

ABSTRACT

UV radiation may lead to melanoma and nonmelanoma skin cancers by causing helix-distorting DNA damage such as cyclobutane pyrimidine dimers (CPDs). These DNA lesions, if located in important genes and not repaired promptly, are mutagenic and may eventually result in carcinogenesis. Examining CPD formation and repair processes across the genome can shed light on the mutagenesis mechanisms associated with UV damage in relevant cancers. We recently developed CPD-Seq, a high-throughput and single-nucleotide resolution sequencing technique that can specifically capture UV-induced CPD lesions across the genome. This novel technique has been increasingly used in studies of UV damage and can be adapted to sequence other clinically relevant DNA lesions. Although the library preparation protocol has been established, a systematic protocol to analyze CPD-Seq data has not been described yet. To streamline the various general or specific analysis steps, we developed a protocol named CPDSeqer to assist researchers with CPD-Seq data processing. CPDSeqer can accommodate both a single- and multiple-sample experimental design, and it allows both genome-wide analyses and regional scrutiny (such as of suspected UV damage hotspots). The runtime of CPDSeqer scales with raw data size and takes roughly 4 h per sample with the possibility of acceleration by parallel computing. Various guiding graphics are generated to help diagnose the performance of the experiment and inform regional enrichment of CPD formation. UV damage comparison analyses are set forth in three analysis scenarios, and the resulting HTML pages report damage directional trends and statistical significance. CPDSeqer can be accessed at https://github.com/shengqh/cpdseqer .


Subject(s)
Pyrimidine Dimers/genetics , Sequence Analysis, DNA/methods , Databases, Genetic , Gene Expression Regulation , Genome , Humans , Nucleosomes/metabolism , Quality Control , Ultraviolet Rays
13.
Cancers (Basel) ; 12(12)2020 Dec 04.
Article in English | MEDLINE | ID: mdl-33291726

ABSTRACT

Global autozygosity quantifies the genome-wide levels of homozygous and heterozygous variants. It is the signature of non-random reproduction, though it can also be driven by other factors, and has been used to assess risk in various diseases. However, the association between global autozygosity and cancer risk has not been studied. From 4057 cancer subjects and 1668 healthy controls, we found strong associations between global autozygosity and risk in ten different cancer types. For example, the heterozygosity ratio was found to be significantly associated with breast invasive carcinoma in Blacks and with male skin cutaneous melanoma in Caucasians. We also discovered eleven associations between global autozygosity and mutational signatures which can explain a portion of the etiology. Furthermore, four significant associations for heterozygosity ratio were revealed in disease-specific survival analyses. This study demonstrates that global autozygosity is effective for cancer risk assessment.

14.
PLoS One ; 15(10): e0232646, 2020.
Article in English | MEDLINE | ID: mdl-33035235

ABSTRACT

Changes in gene expression can correlate with poor disease outcomes in two ways: through changes in relative transcript levels or through alternative RNA splicing leading to changes in relative abundance of individual transcript isoforms. The objective of this research is to develop new statistical methods in detecting and analyzing both differentially expressed and spliced isoforms, which appropriately account for the dependence between isoforms and multiple testing corrections for the multi-dimensional structure of at both the gene- and isoform- level. We developed a linear mixed effects model-based approach for analyzing the complex alternative RNA splicing regulation patterns detected by whole-transcriptome RNA-sequencing technologies. This approach thoroughly characterizes and differentiates three types of genes related to alternative RNA splicing events with distinct differential expression/splicing patterns. We applied the concept of appropriately controlling for the gene-level overall false discovery rate (OFDR) in this multi-dimensional alternative RNA splicing analysis utilizing a two-step hierarchical hypothesis testing framework. In the initial screening test we identify genes that have differentially expressed or spliced isoforms; in the subsequent confirmatory testing stage we examine only the isoforms for genes that have passed the screening tests. Comparisons with other methods through application to a whole transcriptome RNA-Seq study of adenoid cystic carcinoma and extensive simulation studies have demonstrated the advantages and improved performances of our method. Our proposed method appropriately controls the gene-level OFDR, maintains statistical power, and is flexible to incorporate advanced experimental designs.


Subject(s)
Alternative Splicing , Carcinoma, Adenoid Cystic/genetics , Gene Expression Profiling/methods , Gene Regulatory Networks , Leukemia, Myeloid, Acute/genetics , Algorithms , Child , Gene Expression Regulation, Neoplastic , Humans , Linear Models , Models, Genetic , RNA, Messenger/genetics , Sequence Analysis, RNA/methods , Exome Sequencing
15.
NAR Cancer ; 2(4): zcaa030, 2020 Dec.
Article in English | MEDLINE | ID: mdl-33094288

ABSTRACT

Binding motifs for transcription factors, RNA-binding proteins, microRNAs (miRNAs), etc. are vital for proper gene transcription and translation regulation. Sequence alteration mechanisms including single nucleotide mutations, insertion, deletion, RNA editing and single nucleotide polymorphism can lead to gains and losses of binding motifs; such consequentially emerged or vanished binding motifs are termed 'somatic motifs' by us. Somatic motifs have been studied sporadically but have never been curated into a comprehensive resource. By analyzing various types of sequence altering data from large consortiums, we successfully identified millions of somatic motifs, including those for important transcription factors, RNA-binding proteins, miRNA seeds and miRNA-mRNA 3'-UTR target motifs. While a few of these somatic motifs have been well studied, our results contain many novel somatic motifs that occur at high frequency and are thus likely to cause important biological repercussions. Genes targeted by these altered motifs are excellent candidates for further mechanism studies. Here, we present the first database that hosts millions of somatic motifs ascribed to a variety of sequence alteration mechanisms.

16.
Cancers (Basel) ; 12(9)2020 Aug 27.
Article in English | MEDLINE | ID: mdl-32867110

ABSTRACT

Acinic cell carcinoma (AcCC) is a morphologically distinctive salivary gland malignancy often associated with chromosome rearrangements leading to overexpression of the NR4A3 transcription factor. However, little is known about how NR4A3 contributes to AcCC biology. Detailed RNA-sequencing of 21 archived AcCC samples revealed fusion reads arising from recurrent t(4;9), t(9;12), t(8;9) or t(2;4) chromosomal translocations, which positioned highly active enhancers adjacent to the promoter of the NR4A3 gene or the closely related NR4A2 gene, resulting in their aberrant overexpression. Transcriptome analyses revealed several distinct subgroups of AcCC tumors, including a subgroup that overexpressed both NR4A3 and MSANTD3. A poor survival subset of the tumors with high-grade transformation expressed NR4A3 and POMC as well as MYB, an oncogene that is the major driver in a different type of salivary gland tumor, adenoid cystic carcinoma. The combination of NR4A3 and MYB showed cooperativity in regulating a distinct set of genes. In addition, the ligand binding domain of NR4A3 directly bound the Myb DNA binding domain. Transformation assays indicated that, while overexpressed NR4A3 was sufficient to generate transformed colonies, the combination of NR4A3 plus Myb was more potent, leading to anchorage-independent growth and increased cellular invasiveness. The results confirm that NR4A3 and NR4A2 are the main driver genes of AcCC and suggest that concurrent overexpression of NR4A3 and MYB defines a subset of AcCC patients with high-grade transformation that display exceptionally poor outcome.

17.
Trends Genet ; 36(11): 857-867, 2020 11.
Article in English | MEDLINE | ID: mdl-32773169

ABSTRACT

One of the forerunners that pioneered the revolution of high-throughput genomic technologies is the genotyping microarray technology, which can genotype millions of single-nucleotide variants simultaneously. Owing to apparent benefits, such as high speed, low cost, and high throughput, the genotyping array has gained lasting applications in genome-wide association studies (GWAS) and thus accumulated an enormous amount of data. Empowered by continuous manufactural upgrades and analytical innovation, unconventional applications of genotyping array data have emerged to address more diverse genetic problems, holding promise of boosting genetic research into human diseases through the re-mining of the rich accumulated data. Here, we review several unconventional genotyping array analysis techniques that have been built on the idea of large-scale multivariant analysis and provide empirical application examples. These unconventional outcomes of genotyping arrays include polygenic score, runs of homozygosity (ROH)/heterozygosity ratio, distant pedigree computation, and mitochondrial DNA (mtDNA) copy number inference.


Subject(s)
Computational Biology/methods , Genome-Wide Association Study , Genome , Genotyping Techniques/methods , Oligonucleotide Array Sequence Analysis/methods , Polymorphism, Single Nucleotide , Animals , Genomics , Genotype , Humans
18.
PLoS Comput Biol ; 16(6): e1007968, 2020 06.
Article in English | MEDLINE | ID: mdl-32511223

ABSTRACT

Very short tandem repeats bear substantial genetic, evolutional, and pathological significance in genome analyses. Here, we compiled a census of tandem mono-nucleotide/di-nucleotide/tri-nucleotide repeats (MNRs/DNRs/TNRs) in GRCh38, which we term "polytracts" in general. Of the human genome, 144.4 million nucleotides (4.7%) are occupied by polytracts, and 0.47 million single nucleotides are identified as polytract hinges, i.e., break-points of tandem polytracts. Preliminary exploration of the census suggested polytract hinge sites and boundaries of AAC polytracts may bear a higher mapping error rate than other polytract regions. Further, we revealed landscapes of polytract enrichment with respect to nearly a hundred genomic features. We found MNRs, DNRs, and TNRs displayed noticeable difference in terms of locational enrichment for miscellaneous genomic features, especially RNA editing events. Non-canonical and C-to-U RNA-editing events are enriched inside and/or adjacent to MNRs, while all categories of RNA-editing events are under-represented in DNRs. A-to-I RNA-editing events are generally under-represented in polytracts. The selective enrichment of non-canonical RNA-editing events within MNR adjacency provides a negative evidence against their authenticity. To enable similar locational enrichment analyses in relation to polytracts, we developed a software Polytrap which can handle 11 reference genomes. Additionally, we compiled polytracts of four model organisms into a Track Hub which can be integrated into USCS Genome Browser as an official track for convenient visualization of polytracts.


Subject(s)
DNA/genetics , Genome, Human , Microsatellite Repeats/genetics , RNA/genetics , Humans , RNA Editing , Software
19.
Front Genet ; 11: 162, 2020.
Article in English | MEDLINE | ID: mdl-32161619

ABSTRACT

[This corrects the article DOI: 10.3389/fgene.2019.00211.].

20.
Bioinformatics ; 36(9): 2899-2901, 2020 05 01.
Article in English | MEDLINE | ID: mdl-31930398

ABSTRACT

MOTIVATION: Genome annotation is an important step for all in-depth bioinformatics analysis. It is imperative to augment quantity and diversity of genome-wide annotation data for the latest reference genome to promote its adoption by ongoing and future impactful studies. RESULTS: We developed a python toolkit AnnoGen, which at the first time, allows the annotation of three pragmatic genomic features for the GRCh38 genome in enormous base-wise quantities. The three features are chemical binding Energy, sequence information Entropy and Homology Score. The Homology Score is an exceptional feature that captures the genome-wide homology through single-base-offset tiling windows of 100 continual nucleotide bases. AnnoGen is capable of annotating the proprietary pragmatic features for variable user-interested genomic regions and optionally comparing two parallel sets of genomic regions. AnnoGen is characterized with simple utility modes and succinct HTML report of informative statistical tables and plots. AVAILABILITY AND IMPLEMENTATION: https://github.com/shengqh/annogen.


Subject(s)
Genome , Software , Computational Biology , Genomics
SELECTION OF CITATIONS
SEARCH DETAIL
...