Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 67
Filter
1.
J Biol Chem ; : 107368, 2024 May 13.
Article in English | MEDLINE | ID: mdl-38750793

ABSTRACT

Activating signal co-integrator complex 1 (ASCC1) acts with ASCC-ALKBH3 complex in alkylation damage responses. ASCC1 uniquely combines two evolutionarily ancient domains: nucleotide-binding K-Homology (KH) (associated with regulating splicing, transcriptional, and translation) and two-histidine phosphodiesterase (PDE) (associated with hydrolysis of cyclic nucleotide phosphate bonds). Germline mutations link loss of ASCC1 function to spinal muscular atrophy with congenital bone fractures 2 (SMABF2). Herein analysis of The Cancer Genome Atlas (TCGA) suggests ASCC1 RNA overexpression in certain tumors correlates with poor survival, Signatures 29 and 3 mutations, and genetic instability markers. We determined crystal structures of Alvinella pompejana (Ap) ASCC1 and Human (Hs) PDE domain revealing high resolution details and features conserved over 500 million years of evolution. Extending understanding of the KH domain Gly-X-X-Gly sequence motif, we define a novel structural Helix-Clasp-Helix (HCH) nucleotide binding motif and show ASCC1 sequence-specific binding to CGCG-containing RNA. The V-shaped PDE nucleotide binding channel has two His-Φ-Ser/Thr-Φ (HXT) motifs (Φ being hydrophobic) positioned to initiate cyclic phosphate bond hydrolysis. A conserved atypical active-site histidine torsion angle implies a novel PDE substrate. Flexible active site loop and arginine-rich domain linker appear regulatory. Small angle X-ray scattering (SAXS) revealed aligned KH-PDE RNA binding sites with limited flexibility in solution. Quantitative evolutionary bioinformatic analyses of disease and cancer-associated mutations support implied functional roles for RNA binding, phosphodiesterase activity, and regulation. Collective results inform ASCC1 roles in transactivation and alkylation damage responses, its targeting by structure-based inhibitors, and how ASCC1 mutations may impact inherited disease and cancer.

2.
bioRxiv ; 2024 May 19.
Article in English | MEDLINE | ID: mdl-38798479

ABSTRACT

Continued advances in variant effect prediction are necessary to demonstrate the ability of machine learning methods to accurately determine the clinical impact of variants of unknown significance (VUS). Towards this goal, the ARSA Critical Assessment of Genome Interpretation (CAGI) challenge was designed to characterize progress by utilizing 219 experimentally assayed missense VUS in the Arylsulfa-tase A ( ARSA ) gene to assess the performance of community-submitted predictions of variant functional effects. The challenge involved 15 teams, and evaluated additional predictions from established and recently released models. Notably, a model developed by participants of a genetics and coding bootcamp, trained with standard machine-learning tools in Python, demonstrated superior performance among sub-missions. Furthermore, the study observed that state-of-the-art deep learning methods provided small but statistically significant improvement in predictive performance compared to less elaborate techniques. These findings underscore the utility of variant effect prediction, and the potential for models trained with modest resources to accurately classify VUS in genetic and clinical research.

3.
Hum Genomics ; 18(1): 44, 2024 Apr 29.
Article in English | MEDLINE | ID: mdl-38685113

ABSTRACT

BACKGROUND: A major obstacle faced by families with rare diseases is obtaining a genetic diagnosis. The average "diagnostic odyssey" lasts over five years and causal variants are identified in under 50%, even when capturing variants genome-wide. To aid in the interpretation and prioritization of the vast number of variants detected, computational methods are proliferating. Knowing which tools are most effective remains unclear. To evaluate the performance of computational methods, and to encourage innovation in method development, we designed a Critical Assessment of Genome Interpretation (CAGI) community challenge to place variant prioritization models head-to-head in a real-life clinical diagnostic setting. METHODS: We utilized genome sequencing (GS) data from families sequenced in the Rare Genomes Project (RGP), a direct-to-participant research study on the utility of GS for rare disease diagnosis and gene discovery. Challenge predictors were provided with a dataset of variant calls and phenotype terms from 175 RGP individuals (65 families), including 35 solved training set families with causal variants specified, and 30 unlabeled test set families (14 solved, 16 unsolved). We tasked teams to identify causal variants in as many families as possible. Predictors submitted variant predictions with estimated probability of causal relationship (EPCR) values. Model performance was determined by two metrics, a weighted score based on the rank position of causal variants, and the maximum F-measure, based on precision and recall of causal variants across all EPCR values. RESULTS: Sixteen teams submitted predictions from 52 models, some with manual review incorporated. Top performers recalled causal variants in up to 13 of 14 solved families within the top 5 ranked variants. Newly discovered diagnostic variants were returned to two previously unsolved families following confirmatory RNA sequencing, and two novel disease gene candidates were entered into Matchmaker Exchange. In one example, RNA sequencing demonstrated aberrant splicing due to a deep intronic indel in ASNS, identified in trans with a frameshift variant in an unsolved proband with phenotypes consistent with asparagine synthetase deficiency. CONCLUSIONS: Model methodology and performance was highly variable. Models weighing call quality, allele frequency, predicted deleteriousness, segregation, and phenotype were effective in identifying causal variants, and models open to phenotype expansion and non-coding variants were able to capture more difficult diagnoses and discover new diagnoses. Overall, computational models can significantly aid variant prioritization. For use in diagnostics, detailed review and conservative assessment of prioritized variants against established criteria is needed.


Subject(s)
Rare Diseases , Humans , Rare Diseases/genetics , Rare Diseases/diagnosis , Genome, Human/genetics , Genetic Variation/genetics , Computational Biology/methods , Phenotype
4.
Am J Hum Genet ; 111(3): 487-508, 2024 Mar 07.
Article in English | MEDLINE | ID: mdl-38325380

ABSTRACT

Pathogenic variants in multiple genes on the X chromosome have been implicated in syndromic and non-syndromic intellectual disability disorders. ZFX on Xp22.11 encodes a transcription factor that has been linked to diverse processes including oncogenesis and development, but germline variants have not been characterized in association with disease. Here, we present clinical and molecular characterization of 18 individuals with germline ZFX variants. Exome or genome sequencing revealed 11 variants in 18 subjects (14 males and 4 females) from 16 unrelated families. Four missense variants were identified in 11 subjects, with seven truncation variants in the remaining individuals. Clinical findings included developmental delay/intellectual disability, behavioral abnormalities, hypotonia, and congenital anomalies. Overlapping and recurrent facial features were identified in all subjects, including thickening and medial broadening of eyebrows, variations in the shape of the face, external eye abnormalities, smooth and/or long philtrum, and ear abnormalities. Hyperparathyroidism was found in four families with missense variants, and enrichment of different tumor types was observed. In molecular studies, DNA-binding domain variants elicited differential expression of a small set of target genes relative to wild-type ZFX in cultured cells, suggesting a gain or loss of transcriptional activity. Additionally, a zebrafish model of ZFX loss displayed an altered behavioral phenotype, providing additional evidence for the functional significance of ZFX. Our clinical and experimental data support that variants in ZFX are associated with an X-linked intellectual disability syndrome characterized by a recurrent facial gestalt, neurocognitive and behavioral abnormalities, and an increased risk for congenital anomalies and hyperparathyroidism.


Subject(s)
Hyperparathyroidism , Intellectual Disability , Neurodevelopmental Disorders , Male , Female , Animals , Humans , Intellectual Disability/pathology , Zebrafish/genetics , Mutation, Missense/genetics , Transcription Factors/genetics , Phenotype , Neurodevelopmental Disorders/genetics
5.
Hum Genet ; 2024 Jan 03.
Article in English | MEDLINE | ID: mdl-38170232

ABSTRACT

Variants which disrupt splicing are a frequent cause of rare disease that have been under-ascertained clinically. Accurate and efficient methods to predict a variant's impact on splicing are needed to interpret the growing number of variants of unknown significance (VUS) identified by exome and genome sequencing. Here, we present the results of the CAGI6 Splicing VUS challenge, which invited predictions of the splicing impact of 56 variants ascertained clinically and functionally validated to determine splicing impact. The performance of 12 prediction methods, along with SpliceAI and CADD, was compared on the 56 functionally validated variants. The maximum accuracy achieved was 82% from two different approaches, one weighting SpliceAI scores by minor allele frequency, and one applying the recently published Splicing Prediction Pipeline (SPiP). SPiP performed optimally in terms of sensitivity, while an ensemble method combining multiple prediction tools and information from databases exceeded all others for specificity. Several challenge methods equalled or exceeded the performance of SpliceAI, with ultimate choice of prediction method likely to depend on experimental or clinical aims. One quarter of the variants were incorrectly predicted by at least 50% of the methods, highlighting the need for further improvements to splicing prediction methods for successful clinical application.

6.
Nat Metab ; 5(10): 1673-1684, 2023 Oct.
Article in English | MEDLINE | ID: mdl-37709961

ABSTRACT

The glucagon-like peptide 1 receptor (GLP1R) is a major drug target with several agonists being prescribed in individuals with type 2 diabetes and obesity1,2. The impact of genetic variability of GLP1R on receptor function and its association with metabolic traits are unclear with conflicting reports. Here, we show an unexpected diversity of phenotypes ranging from defective cell surface expression to complete or pathway-specific gain of function (GoF) and loss of function (LoF), after performing a functional profiling of 60 GLP1R variants across four signalling pathways. The defective insulin secretion of GLP1R LoF variants is rescued by allosteric GLP1R ligands or high concentrations of exendin-4/semaglutide in INS-1 823/3 cells. Genetic association studies in 200,000 participants from the UK Biobank show that impaired GLP1R cell surface expression contributes to poor glucose control and increased adiposity with increased glycated haemoglobin A1c and body mass index. This study defines impaired GLP1R cell surface expression as a risk factor for traits associated with type 2 diabetes and obesity and provides potential treatment options for GLP1R LoF variant carriers.


Subject(s)
Blood Glucose , Diabetes Mellitus, Type 2 , Humans , Insulin/metabolism , Diabetes Mellitus, Type 2/genetics , Adiposity/genetics , Obesity/genetics
7.
Res Sq ; 2023 Aug 02.
Article in English | MEDLINE | ID: mdl-37577579

ABSTRACT

In the context of the Critical Assessment of the Genome Interpretation, 6th edition (CAGI6), the Genetics of Neurodevelopmental Disorders Lab in Padua proposed a new ID-challenge to give the opportunity of developing computational methods for predicting patient's phenotype and the causal variants. Eight research teams and 30 models had access to the phenotype details and real genetic data, based on the sequences of 74 genes (VCF format) in 415 pediatric patients affected by Neurodevelopmental Disorders (NDDs). NDDs are clinically and genetically heterogeneous conditions, with onset in infant age. In this study we evaluate the ability and accuracy of computational methods to predict comorbid phenotypes based on clinical features described in each patient and causal variants. Finally, we asked to develop a method to find new possible genetic causes for patients without a genetic diagnosis. As already done for the CAGI5, seven clinical features (ID, ASD, ataxia, epilepsy, microcephaly, macrocephaly, hypotonia), and variants (causative, putative pathogenic and contributing factors) were provided. Considering the overall clinical manifestation of our cohort, we give out the variant data and phenotypic traits of the 150 patients from CAGI5 ID-Challenge as training and validation for the prediction methods development.

8.
medRxiv ; 2023 Aug 04.
Article in English | MEDLINE | ID: mdl-37577678

ABSTRACT

Background: A major obstacle faced by rare disease families is obtaining a genetic diagnosis. The average "diagnostic odyssey" lasts over five years, and causal variants are identified in under 50%. The Rare Genomes Project (RGP) is a direct-to-participant research study on the utility of genome sequencing (GS) for diagnosis and gene discovery. Families are consented for sharing of sequence and phenotype data with researchers, allowing development of a Critical Assessment of Genome Interpretation (CAGI) community challenge, placing variant prioritization models head-to-head in a real-life clinical diagnostic setting. Methods: Predictors were provided a dataset of phenotype terms and variant calls from GS of 175 RGP individuals (65 families), including 35 solved training set families, with causal variants specified, and 30 test set families (14 solved, 16 unsolved). The challenge tasked teams with identifying the causal variants in as many test set families as possible. Ranked variant predictions were submitted with estimated probability of causal relationship (EPCR) values. Model performance was determined by two metrics, a weighted score based on rank position of true positive causal variants and maximum F-measure, based on precision and recall of causal variants across EPCR thresholds. Results: Sixteen teams submitted predictions from 52 models, some with manual review incorporated. Top performing teams recalled the causal variants in up to 13 of 14 solved families by prioritizing high quality variant calls that were rare, predicted deleterious, segregating correctly, and consistent with reported phenotype. In unsolved families, newly discovered diagnostic variants were returned to two families following confirmatory RNA sequencing, and two prioritized novel disease gene candidates were entered into Matchmaker Exchange. In one example, RNA sequencing demonstrated aberrant splicing due to a deep intronic indel in ASNS, identified in trans with a frameshift variant, in an unsolved proband with phenotype overlap with asparagine synthetase deficiency. Conclusions: By objective assessment of variant predictions, we provide insights into current state-of-the-art algorithms and platforms for genome sequencing analysis for rare disease diagnosis and explore areas for future optimization. Identification of diagnostic variants in unsolved families promotes synergy between researchers with clinical and computational expertise as a means of advancing the field of clinical genome interpretation.

9.
Bioinformatics ; 39(8)2023 08 01.
Article in English | MEDLINE | ID: mdl-37522889

ABSTRACT

SUMMARY: In any population under selective pressure, a central challenge is to distinguish the genes that drive adaptation from others which, subject to population variation, harbor many neutral mutations de novo. We recently showed that such genes could be identified by supplementing information on mutational frequency with an evolutionary analysis of the likely functional impact of coding variants. This approach improved the discovery of driver genes in both lab-evolved and environmental Escherichia coli strains. To facilitate general adoption, we now developed ShinyBioHEAT, an R Shiny web-based application that enables identification of phenotype driving gene in two commonly used model bacteria, E.coli and Bacillus subtilis, with no specific computational skill requirements. ShinyBioHEAT not only supports transparent and interactive analysis of lab evolution data in E.coli and B.subtilis, but it also creates dynamic visualizations of mutational impact on protein structures, which add orthogonal checks on predicted drivers. AVAILABILITY AND IMPLEMENTATION: Code for ShinyBioHEAT is available at https://github.com/LichtargeLab/ShinyBioHEAT. The Shiny application is additionally hosted at http://bioheat.lichtargelab.org/.


Subject(s)
Escherichia coli , Mobile Applications , Escherichia coli/genetics , Software , Mutation , Data Interpretation, Statistical , Mutation Rate
10.
Nat Commun ; 14(1): 2765, 2023 05 13.
Article in English | MEDLINE | ID: mdl-37179358

ABSTRACT

The incidence of Alzheimer's Disease in females is almost double that of males. To search for sex-specific gene associations, we build a machine learning approach focused on functionally impactful coding variants. This method can detect differences between sequenced cases and controls in small cohorts. In the Alzheimer's Disease Sequencing Project with mixed sexes, this approach identified genes enriched for immune response pathways. After sex-separation, genes become specifically enriched for stress-response pathways in male and cell-cycle pathways in female. These genes improve disease risk prediction in silico and modulate Drosophila neurodegeneration in vivo. Thus, a general approach for machine learning on functionally impactful variants can uncover sex-specific candidates towards diagnostic biomarkers and therapeutic targets.


Subject(s)
Alzheimer Disease , Sex Factors , Female , Humans , Male , Alzheimer Disease/genetics , Alzheimer Disease/metabolism
11.
Br J Cancer ; 128(11): 2013-2024, 2023 06.
Article in English | MEDLINE | ID: mdl-37012319

ABSTRACT

BACKGROUND: Cisplatin (CDDP) is a mainstay treatment for advanced head and neck squamous cell carcinomas (HNSCC) despite a high frequency of innate and acquired resistance. We hypothesised that tumours acquire CDDP resistance through an enhanced reductive state dependent on metabolic rewiring. METHODS: To validate this model and understand how an adaptive metabolic programme might be imprinted, we performed an integrated analysis of CDDP-resistant HNSCC clones from multiple genomic backgrounds by whole-exome sequencing, RNA-seq, mass spectrometry, steady state and flux metabolomics. RESULTS: Inactivating KEAP1 mutations or reductions in KEAP1 RNA correlated with Nrf2 activation in CDDP-resistant cells, which functionally contributed to resistance. Proteomics identified elevation of downstream Nrf2 targets and the enrichment of enzymes involved in generation of biomass and reducing equivalents, metabolism of glucose, glutathione, NAD(P), and oxoacids. This was accompanied by biochemical and metabolic evidence of an enhanced reductive state dependent on coordinated glucose and glutamine catabolism, associated with reduced energy production and proliferation, despite normal mitochondrial structure and function. CONCLUSIONS: Our analysis identified coordinated metabolic changes associated with CDDP resistance that may provide new therapeutic avenues through targeting of these convergent pathways.


Subject(s)
Antineoplastic Agents , Head and Neck Neoplasms , Humans , Cisplatin/metabolism , Squamous Cell Carcinoma of Head and Neck , Kelch-Like ECH-Associated Protein 1/genetics , NF-E2-Related Factor 2/genetics , Drug Resistance, Neoplasm/genetics , Cell Line, Tumor , Glucose , Antineoplastic Agents/pharmacology
12.
Nat Commun ; 13(1): 3189, 2022 06 09.
Article in English | MEDLINE | ID: mdl-35680894

ABSTRACT

Since antibiotic development lags, we search for potential drug targets through directed evolution experiments. A challenge is that many resistance genes hide in a noisy mutational background as mutator clones emerge in the adaptive population. Here, to overcome this noise, we quantify the impact of mutations through evolutionary action (EA). After sequencing ciprofloxacin or colistin resistance strains grown under different mutational regimes, we find that an elevated sum of the evolutionary action of mutations in a gene identifies known resistance drivers. This EA integration approach also suggests new antibiotic resistance genes which are then shown to provide a fitness advantage in competition experiments. Moreover, EA integration analysis of clinical and environmental isolates of antibiotic resistant of E. coli identifies gene drivers of resistance where a standard approach fails. Together these results inform the genetic basis of de novo colistin resistance and support the robust discovery of phenotype-driving genes via the evolutionary action of genetic perturbations in fitness landscapes.


Subject(s)
Anti-Bacterial Agents , Drug Resistance, Bacterial , Escherichia coli Proteins , Escherichia coli , Anti-Bacterial Agents/pharmacology , Ciprofloxacin/pharmacology , Colistin/pharmacology , Drug Resistance, Bacterial/genetics , Escherichia coli/drug effects , Escherichia coli/genetics , Escherichia coli Proteins/genetics , Microbial Sensitivity Tests , Mutation
13.
Hum Genet ; 141(10): 1549-1577, 2022 Oct.
Article in English | MEDLINE | ID: mdl-35488922

ABSTRACT

Estimating the effects of variants found in disease driver genes opens the door to personalized therapeutic opportunities. Clinical associations and laboratory experiments can only characterize a tiny fraction of all the available variants, leaving the majority as variants of unknown significance (VUS). In silico methods bridge this gap by providing instant estimates on a large scale, most often based on the numerous genetic differences between species. Despite concerns that these methods may lack reliability in individual subjects, their numerous practical applications over cohorts suggest they are already helpful and have a role to play in genome interpretation when used at the proper scale and context. In this review, we aim to gain insights into the training and validation of these variant effect predicting methods and illustrate representative types of experimental and clinical applications. Objective performance assessments using various datasets that are not yet published indicate the strengths and limitations of each method. These show that cautious use of in silico variant impact predictors is essential for addressing genome interpretation challenges.


Subject(s)
Genetic Testing , Genome, Human , Computational Biology , Genetic Variation , Humans , Reproducibility of Results
14.
Genome Res ; 32(5): 916-929, 2022 05.
Article in English | MEDLINE | ID: mdl-35301263

ABSTRACT

Genetic variants drive the evolution of traits and diseases. We previously modeled these variants as small displacements in fitness landscapes and estimated their functional impact by differentiating the evolutionary relationship between genotype and phenotype. Conversely, here we integrate these derivatives to identify genes steering specific traits. Over cancer cohorts, integration identified 460 likely tumor-driving genes. Many have literature and experimental support but had eluded prior genomic searches for positive selection in tumors. Beyond providing cancer insights, these results introduce a general calculus of evolution to quantify the genotype-phenotype relationship and discover genes associated with complex traits and diseases.


Subject(s)
Calculi , Neoplasms , Biological Evolution , Genetic Fitness , Genotype , Humans , Models, Genetic , Neoplasms/genetics , Phenotype , Selection, Genetic
15.
Front Mol Biosci ; 8: 791792, 2021.
Article in English | MEDLINE | ID: mdl-34966786

ABSTRACT

All tumors have DNA mutations, and a predictive understanding of those mutations could inform clinical treatments. However, 40% of the mutations are variants of unknown significance (VUS), with the challenge being to objectively predict whether a VUS is pathogenic and supports the tumor or whether it is benign. To objectively decode VUS, we mapped cancer sequence data and evolutionary trace (ET) scores onto crystallography and cryo-electron microscopy structures with variant impacts quantitated by evolutionary action (EA) measures. As tumors depend on helicases and nucleases to deal with transcription/replication stress, we targeted helicase-nuclease-RPA complexes: (1) XPB-XPD (within TFIIH), XPF-ERCC1, XPG, and RPA for transcription and nucleotide excision repair pathways and (2) BLM, EXO5, and RPA plus DNA2 for stalled replication fork restart. As validation, EA scoring predicts severe effects for most disease mutations, but disease mutants with low ET scores not only are likely destabilizing but also disrupt sophisticated allosteric mechanisms. For sites of disease mutations and VUS predicted to be severe, we found strong co-localization to ordered regions. Rare discrepancies highlighted the different survival requirements between disease and tumor mutations, as well as the value of examining proteins within complexes. In a genome-wide analysis of 33 cancer types, we found correlation between the number of mutations in each tumor and which pathways or functional processes in which the mutations occur, revealing different mutagenic routes to tumorigenesis. We also found upregulation of ancient genes including BLM, which supports a non-random and concerted cancer process: reversion to a unicellular, proliferation-uncontrolled, status by breaking multicellular constraints on cell division. Together, these genes and global analyses challenge the binary "driver" and "passenger" mutation paradigm, support a gradient impact as revealed by EA scoring from moderate to severe at a single gene level, and indicate reduced regulation as well as activity. The objective quantitative assessment of VUS scoring and gene overexpression in the context of functional interactions and pathways provides insights for biology, oncology, and precision medicine.

16.
Methods Enzymol ; 661: 407-431, 2021.
Article in English | MEDLINE | ID: mdl-34776222

ABSTRACT

We present a Chemistry and Structure Screen Integrated Efficiently (CASSIE) approach (named for Greek prophet Cassandra) to design inhibitors for cancer biology and pathogenesis. CASSIE provides an effective path to target master keys to control the repair-replication interface for cancer cells and SARS CoV-2 pathogenesis as exemplified here by specific targeting of Poly(ADP-ribose) glycohydrolase (PARG) and ADP-ribose glycohydrolase ARH3 macrodomains plus SARS CoV-2 nonstructural protein 3 (Nsp3) Macrodomain 1 (Mac1) and Nsp15 nuclease. As opposed to the classical massive effort employing libraries with large numbers of compounds against single proteins, we make inhibitor design for multiple targets efficient. Our compact, chemically diverse, 5000 compound Goldilocks (GL) library has an intermediate number of compounds sized between fragments and drugs with predicted favorable ADME (absorption, distribution, metabolism, and excretion) and toxicological profiles. Amalgamating our core GL library with an approved drug (AD) library, we employ a combined GLAD library virtual screen, enabling an effective and efficient design cycle of ranked computer docking, top hit biophysical and cell validations, and defined bound structures using human proteins or their avatars. As new drug design is increasingly pathway directed as well as molecular and mechanism based, our CASSIE approach facilitates testing multiple related targets by efficiently turning a set of interacting drug discovery problems into a tractable medicinal chemistry engineering problem of optimizing affinity and ADME properties based upon early co-crystal structures. Optimization efforts are made efficient by a computationally-focused iterative chemistry and structure screen. Thus, we herein describe and apply CASSIE to define prototypic, specific inhibitors for PARG vs distinct inhibitors for the related macrodomains of ARH3 and SARS CoV-2 Nsp3 plus the SARS CoV-2 Nsp15 RNA nuclease.


Subject(s)
COVID-19 , Nucleic Acids , Severe Acute Respiratory Syndrome , DNA Repair , Humans , Molecular Docking Simulation , SARS-CoV-2
17.
Cancers (Basel) ; 13(16)2021 Aug 07.
Article in English | MEDLINE | ID: mdl-34439147

ABSTRACT

BACKGROUND: PPARγ (peroxisome proliferator-activated receptor gamma) is involved in the pathology of numerous diseases, including UM and other types of cancer. Emerging evidence suggests that an interaction between PPARγ and DNMTs (DNA methyltransferase) plays a role in cancer that is yet to be defined. METHODS: The configuration of the repeating elements was performed with CAP3 and MAFFT, and the structural modelling was conducted with HDOCK. An evolutionary action scores algorithm was used to identify oncogenic variants. A systematic bioinformatic appraisal of PPARγ and DNMT1 was performed across 29 tumor types and UM available in The Cancer Genome Atlas (TCGA). RESULTS: PPAR-responsive elements (PPREs) enriched with Alu repeats are associated with different genomic regions, particularly the promotor region of DNMT1. PPARγ-DNMT1 co-expression is significantly associated with several cancers. C-terminals of PPARγ and DNMT1 appear to be the potential protein-protein interaction sites where disease-specific mutations may directly impair the respective protein functions. Furthermore, PPARγ expression could be identified as an additional prognostic marker for UM. CONCLUSIONS: We hypothesize that the function of PPARγ requires an additional contribution of Alu repeats which may directly influence the DNMT1 network. Regarding UM, PPARγ appears to be an additional discriminatory prognostic marker, in particular in disomy 3 tumors.

18.
Comput Struct Biotechnol J ; 19: 2307-2317, 2021.
Article in English | MEDLINE | ID: mdl-33995922

ABSTRACT

Our understanding of enzymes with high substrate ambiguity remains limited because their large active sites allow substrate docking freedom to an extent that seems incompatible with stereospecificity. One possibility is that some of these enzymes evolved a set of evolutionarily fitted sequence positions that stringently allow switching substrate ambiguity and chiral specificity. To explore this hypothesis, we targeted for mutation a serine ester hydrolase (EH3) that exhibits an impressive 71-substrate repertoire but is not stereospecific (e.e. 50%). We used structural actions and the computational evolutionary trace method to explore specificity-swapping sequence positions and hypothesized that position I244 was critical. Driven by evolutionary action analysis, this position was substituted to leucine, which together with isoleucine appears to be the amino acid most commonly present in the closest homologous sequences (max. identity, ca. 67.1%), and to phenylalanine, which appears in distant homologues. While the I244L mutation did not have any functional consequences, the I244F mutation allowed the esterase to maintain a remarkable 53-substrate range while gaining stereospecificity properties (e.e. 99.99%). These data support the possibility that some enzymes evolve sequence positions that control the substrate scope and stereospecificity. Such residues, which can be evolutionarily screened, may serve as starting points for further designing substrate-ambiguous, yet chiral-specific, enzymes that are greatly appreciated in biotechnology and synthetic chemistry.

19.
Bioinformatics ; 37(22): 4033-4040, 2021 11 18.
Article in English | MEDLINE | ID: mdl-34043002

ABSTRACT

MOTIVATION: Since the first recognized case of COVID-19, more than 100 million people have been infected worldwide. Global efforts in drug and vaccine development to fight the disease have yielded vaccines and drug candidates to cure COVID-19. However, the spread of SARS-CoV-2 variants threatens the continued efficacy of these treatments. In order to address this, we interrogate the evolutionary history of the entire SARS-CoV-2 proteome to identify evolutionarily conserved functional sites that can inform the search for treatments with broader coverage across the coronavirus family. RESULTS: Combining coronavirus family sequence information with the mutations observed in the current COVID-19 outbreak, we systematically and comprehensively define evolutionarily stable sites that may provide useful drug and vaccine targets and which are less likely to be compromised by the emergence of new virus strains. Several experimentally validated effective drugs interact with these proposed target sites. In addition, the same evolutionary information can prioritize cross reactive antigens that are useful in directing multi-epitope vaccine strategies to illicit broadly neutralizing immune responses to the betacoronavirus family. Although the results are focused on SARS-CoV-2, these approaches stem from evolutionary principles that are agnostic to the organism or infective agent. AVAILABILITY AND IMPLEMENTATION: The results of this work are made interactively available at http://cov.lichtargelab.org. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
COVID-19 , Viral Vaccines , Humans , SARS-CoV-2/genetics , Proteome , COVID-19 Vaccines , Viral Vaccines/genetics
20.
Sci Transl Med ; 13(594)2021 05 19.
Article in English | MEDLINE | ID: mdl-34011629

ABSTRACT

Genotype-phenotype relationships shape health and population fitness but remain difficult to predict and interpret. Here, we apply an evolutionary action method to de novo missense variants in whole-exome sequences of individuals with autism spectrum disorder (ASD) to unravel genes and pathways connected to ASD. Evolutionary action predicts the impact of missense variants on protein function by measuring the fitness effect based on phylogenetic distances and substitution odds in homologous gene sequences. By examining de novo missense variants in 2384 individuals with ASD (probands) compared to matched siblings without ASD, we found missense variants in 398 genes representing 23 pathways that were biased toward higher evolutionary action scores than expected by random chance; these pathways were involved in axonogenesis, synaptic transmission, and neurodevelopment. The predicted fitness impact of de novo and inherited missense variants in candidate genes correlated with the IQ of individuals with ASD, even for new gene candidates. Taking an evolutionary action method, we detected those missense variants most likely to contribute to ASD pathogenesis and elucidated their phenotypic impact. This approach could be applied to integrate missense variants across a patient cohort to identify genes contributing to a shared phenotype in other complex diseases.


Subject(s)
Autism Spectrum Disorder , Autistic Disorder , Autism Spectrum Disorder/genetics , Autistic Disorder/genetics , Exome/genetics , Genetic Predisposition to Disease , Humans , Phenotype , Phylogeny
SELECTION OF CITATIONS
SEARCH DETAIL
...