Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 6 de 6
Filter
Add more filters










Database
Language
Publication year range
1.
Alzheimers Dement ; 2024 Jun 26.
Article in English | MEDLINE | ID: mdl-38923692

ABSTRACT

INTRODUCTION: Variants of uncertain significance (VUS) surged with affordable genetic testing, posing challenges for determining pathogenicity. We examine the pathogenicity of a novel VUS P93S in Annexin A11 (ANXA11) - an amyotrophic lateral sclerosis/frontotemporal dementia-associated gene - in a corticobasal syndrome kindred. Established ANXA11 mutations cause ANXA11 aggregation, altered lysosomal-RNA granule co-trafficking, and transactive response DNA binding protein of 43 kDa (TDP-43) mis-localization. METHODS: We described the clinical presentation and explored the phenotypic diversity of ANXA11 variants. P93S's effect on ANXA11 function and TDP-43 biology was characterized in induced pluripotent stem cell-derived neurons alongside multiomic neuronal and microglial profiling. RESULTS: ANXA11 mutations were linked to corticobasal syndrome cases. P93S led to decreased lysosome colocalization, neuritic RNA, and nuclear TDP-43 with cryptic exon expression. Multiomic microglial signatures implicated immune dysregulation and interferon signaling pathways. DISCUSSION: This study establishes ANXA11 P93S pathogenicity, broadens the phenotypic spectrum of ANXA11 mutations, underscores neuronal and microglial dysfunction in ANXA11 pathophysiology, and demonstrates the potential of cellular models to determine variant pathogenicity. HIGHLIGHTS: ANXA11 P93S is a pathogenic variant. Corticobasal syndrome is part of the ANXA11 phenotypic spectrum. Hybridization chain reaction fluorescence in situ hybridization (HCR FISH) is a new tool for the detection of cryptic exons due to TDP-43-related loss of splicing regulation. Microglial ANXA11 and related immune pathways are important drivers of disease. Cellular models are powerful tools for adjudicating variants of uncertain significance.

2.
Res Sq ; 2024 Jan 04.
Article in English | MEDLINE | ID: mdl-38260496

ABSTRACT

Identifying causal mutations accelerates genetic disease diagnosis, and therapeutic development. Missense variants present a bottleneck in genetic diagnoses as their effects are less straightforward than truncations or nonsense mutations. While computational prediction methods are increasingly successful at prediction for variants in known disease genes, they do not generalize well to other genes as the scores are not calibrated across the proteome1-6. To address this, we developed a deep generative model, popEVE, that combines evolutionary information with population sequence data7 and achieves state-of-the-art performance at ranking variants by severity to distinguish patients with severe developmental disorders8 from potentially healthy individuals9. popEVE identifies 442 genes in patients this developmental disorder cohort, including evidence of 123 novel genetic disorders, many without the need for gene-level enrichment and without overestimating the prevalence of pathogenic variants in the population. A majority of these variants are close to interacting partners in 3D complexes. Preliminary analyses on child exomes indicate that popEVE can identify candidate variants without the need for inheritance labels. By placing variants on a unified scale, our model offers a comprehensive perspective on the distribution of fitness effects across the entire proteome and the broader human population. popEVE provides compelling evidence for genetic diagnoses even in exceptionally rare single-patient disorders where conventional techniques relying on repeated observations may not be applicable.

3.
medRxiv ; 2023 Nov 28.
Article in English | MEDLINE | ID: mdl-38076790

ABSTRACT

Identifying causal mutations accelerates genetic disease diagnosis, and therapeutic development. Missense variants present a bottleneck in genetic diagnoses as their effects are less straightforward than truncations or nonsense mutations. While computational prediction methods are increasingly successful at prediction for variants in known disease genes, they do not generalize well to other genes as the scores are not calibrated across the proteome. To address this, we developed a deep generative model, popEVE, that combines evolutionary information with population sequence data and achieves state-of-the-art performance at ranking variants by severity to distinguish patients with severe developmental disorders from potentially healthy individuals. popEVE identifies 442 genes in a cohort of developmental disorder cases, including evidence of 119 novel genetic disorders without the need for gene-level enrichment and without overestimating the prevalence of pathogenic variants in the population. By placing variants on a unified scale, our model offers a comprehensive perspective on the distribution of fitness effects across the entire proteome and the broader human population. popEVE provides compelling evidence for genetic diagnoses even in exceptionally rare single-patient disorders where conventional techniques relying on repeated observations may not be applicable. Interactive web viewer and downloads available at pop.evemodel.org.

4.
bioRxiv ; 2023 Dec 08.
Article in English | MEDLINE | ID: mdl-38106144

ABSTRACT

Predicting the effects of mutations in proteins is critical to many applications, from understanding genetic disease to designing novel proteins that can address our most pressing challenges in climate, agriculture and healthcare. Despite a surge in machine learning-based protein models to tackle these questions, an assessment of their respective benefits is challenging due to the use of distinct, often contrived, experimental datasets, and the variable performance of models across different protein families. Addressing these challenges requires scale. To that end we introduce ProteinGym, a large-scale and holistic set of benchmarks specifically designed for protein fitness prediction and design. It encompasses both a broad collection of over 250 standardized deep mutational scanning assays, spanning millions of mutated sequences, as well as curated clinical datasets providing high-quality expert annotations about mutation effects. We devise a robust evaluation framework that combines metrics for both fitness prediction and design, factors in known limitations of the underlying experimental methods, and covers both zero-shot and supervised settings. We report the performance of a diverse set of over 70 high-performing models from various subfields (eg., alignment-based, inverse folding) into a unified benchmark suite. We open source the corresponding codebase, datasets, MSAs, structures, model predictions and develop a user-friendly website that facilitates data access and analysis.

5.
Res Sq ; 2023 Oct 19.
Article in English | MEDLINE | ID: mdl-37886540

ABSTRACT

As genetic testing has become more accessible and affordable, variants of uncertain significance (VUS) are increasingly identified, and determining whether these variants play causal roles in disease is a major challenge. The known disease-associated Annexin A11 (ANXA11) mutations result in ANXA11 aggregation, alterations in lysosomal-RNA granule co-trafficking, and TDP-43 mis-localization and present as amyotrophic lateral sclerosis or frontotemporal dementia. We identified a novel VUS in ANXA11 (P93S) in a kindred with corticobasal syndrome and unique radiographic features that segregated with disease. We then queried neurodegenerative disorder clinic databases to identify the phenotypic spread of ANXA11 mutations. Multi-modal computational analysis of this variant was performed and the effect of this VUS on ANXA11 function and TDP-43 biology was characterized in iPSC-derived neurons. Single-cell sequencing and proteomic analysis of iPSC-derived neurons and microglia were used to determine the multiomic signature of this VUS. Mutations in ANXA11 were found in association with clinically diagnosed corticobasal syndrome, thereby establishing corticobasal syndrome as part of ANXA11 clinical spectrum. In iPSC-derived neurons expressing mutant ANXA11, we found decreased colocalization of lysosomes and decreased neuritic RNA as well as decreased nuclear TDP-43 and increased formation of cryptic exons compared to controls. Multiomic assessment of the P93S variant in iPSC-derived neurons and microglia indicates that the pathogenic omic signature in neurons is modest compared to microglia. Additionally, omic studies reveal that immune dysregulation and interferon signaling pathways in microglia are central to disease. Collectively, these findings identify a new pathogenic variant in ANXA11, expand the range of clinical syndromes caused by ANXA11 mutations, and implicate both neuronal and microglia dysfunction in ANXA11 pathophysiology. This work illustrates the potential for iPSC-derived cellular models to revolutionize the variant annotation process and provides a generalizable approach to determining causality of novel variants across genes.

6.
Nat Commun ; 12(1): 2403, 2021 04 23.
Article in English | MEDLINE | ID: mdl-33893299

ABSTRACT

The ability to design functional sequences and predict effects of variation is central to protein engineering and biotherapeutics. State-of-art computational methods rely on models that leverage evolutionary information but are inadequate for important applications where multiple sequence alignments are not robust. Such applications include the prediction of variant effects of indels, disordered proteins, and the design of proteins such as antibodies due to the highly variable complementarity determining regions. We introduce a deep generative model adapted from natural language processing for prediction and design of diverse functional sequences without the need for alignments. The model performs state-of-art prediction of missense and indel effects and we successfully design and test a diverse 105-nanobody library that shows better expression than a 1000-fold larger synthetic library. Our results demonstrate the power of the alignment-free autoregressive model in generalizing to regions of sequence space traditionally considered beyond the reach of prediction and design.


Subject(s)
Algorithms , Computational Biology/methods , Neural Networks, Computer , Protein Engineering/methods , Amino Acid Sequence , Antibodies/genetics , Antibodies/immunology , Antibodies/metabolism , Antigens/immunology , Genotype , Humans , Mutation , Phenotype , Proteins/genetics , Proteins/immunology , Proteins/metabolism
SELECTION OF CITATIONS
SEARCH DETAIL
...