Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 10 de 10
Filter
Add more filters










Publication year range
1.
bioRxiv ; 2024 Feb 27.
Article in English | MEDLINE | ID: mdl-38464325

ABSTRACT

Prediction of RNA structure from sequence remains an unsolved problem, and progress has been slowed by a paucity of experimental data. Here, we present Ribonanza, a dataset of chemical mapping measurements on two million diverse RNA sequences collected through Eterna and other crowdsourced initiatives. Ribonanza measurements enabled solicitation, training, and prospective evaluation of diverse deep neural networks through a Kaggle challenge, followed by distillation into a single, self-contained model called RibonanzaNet. When fine tuned on auxiliary datasets, RibonanzaNet achieves state-of-the-art performance in modeling experimental sequence dropout, RNA hydrolytic degradation, and RNA secondary structure, with implications for modeling RNA tertiary structure.

2.
bioRxiv ; 2024 Jan 16.
Article in English | MEDLINE | ID: mdl-38260323

ABSTRACT

Designing single molecules that compute general functions of input molecular partners represents a major unsolved challenge in molecular design. Here, we demonstrate that high-throughput, iterative experimental testing of diverse RNA designs crowdsourced from Eterna yields sensors of increasingly complex functions of input oligonucleotide concentrations. After designing single-input RNA sensors with activation ratios beyond our detection limits, we created logic gates, including challenging XOR and XNOR gates, and sensors that respond to the ratio of two inputs. Finally, we describe the OpenTB challenge, which elicited 85-nucleotide sensors that compute a score for diagnosing active tuberculosis, based on the ratio of products of three gene segments. Building on OpenTB design strategies, we created an algorithm Nucleologic that produces similarly compact sensors for the three-gene score based on RNA and DNA. These results open new avenues for diverse applications of compact, single molecule sensors previously limited by design complexity.

3.
PLoS Comput Biol ; 18(6): e1010271, 2022 06.
Article in English | MEDLINE | ID: mdl-35759518

ABSTRACT

While deep learning models have seen increasing applications in protein science, few have been implemented for protein backbone generation-an important task in structure-based problems such as active site and interface design. We present a new approach to building class-specific backbones, using a variational auto-encoder to directly generate the 3D coordinates of immunoglobulins. Our model is torsion- and distance-aware, learns a high-resolution embedding of the dataset, and generates novel, high-quality structures compatible with existing design tools. We show that the Ig-VAE can be used with Rosetta to create a computational model of a SARS-CoV2-RBD binder via latent space sampling. We further demonstrate that the model's generative prior is a powerful tool for guiding computational protein design, motivating a new paradigm under which backbone design is solved as constrained optimization problem in the latent space of a generative model.


Subject(s)
COVID-19 , RNA, Viral , Humans , Immunoglobulins , Proteins/chemistry , SARS-CoV-2
4.
Nat Commun ; 13(1): 1536, 2022 03 22.
Article in English | MEDLINE | ID: mdl-35318324

ABSTRACT

Therapeutic mRNAs and vaccines are being developed for a broad range of human diseases, including COVID-19. However, their optimization is hindered by mRNA instability and inefficient protein expression. Here, we describe design principles that overcome these barriers. We develop an RNA sequencing-based platform called PERSIST-seq to systematically delineate in-cell mRNA stability, ribosome load, as well as in-solution stability of a library of diverse mRNAs. We find that, surprisingly, in-cell stability is a greater driver of protein output than high ribosome load. We further introduce a method called In-line-seq, applied to thousands of diverse RNAs, that reveals sequence and structure-based rules for mitigating hydrolytic degradation. Our findings show that highly structured "superfolder" mRNAs can be designed to improve both stability and expression with further enhancement through pseudouridine nucleoside modification. Together, our study demonstrates simultaneous improvement of mRNA stability and protein expression and provides a computational-experimental platform for the enhancement of mRNA medicines.


Subject(s)
COVID-19 , RNA , COVID-19/therapy , Humans , Pseudouridine/metabolism , RNA Stability/genetics , RNA, Messenger/metabolism
5.
FEBS J ; 289(12): 3505-3520, 2022 06.
Article in English | MEDLINE | ID: mdl-35030303

ABSTRACT

Staphylococcus aureus expresses several hemolytic pore-forming toxins (PFTs), which are all commonly composed of three domains: cap, rim and stem. PFTs are expressed as soluble monomers and assemble to form a transmembrane ß-barrel pore in the erythrocyte cell membrane. The stem domain undergoes dramatic conformational changes to form a pore. Staphylococcal PFTs are classified into two groups: one-component α-hemolysin (α-HL) and two-component γ-hemolysin (γ-HL). The α-HL forms a homo-heptamer, whereas γ-HL is an octamer composed of F-component (LukF) and S-component (Hlg2). Because PFTs are used as materials for nanopore-based sensors, knowledge of the functional properties of PFTs is used to develop new, engineered PFTs. However, it remains challenging to design PFTs with a ß-barrel pore because their formation as transmembrane protein assemblies requires large conformational changes. In the present study, aiming to investigate the design principles of the ß-barrel formed as a consequence of the conformational change, chimeric mutants composed of the cap/rim domains of α-HL and the stem of LukF or Hlg2 were prepared. Biochemical characterization and electron microscopy showed that one of them assembles as a heptameric one-component PFT, whereas another participates as both a heptameric one- and heptameric/octameric two-component PFT. All chimeric mutants intrinsically assemble into SDS-resistant oligomers. Based on these observations, the role of the stem domain of these PFTs is discussed. These findings provide clues for the engineering of staphylococcal PFT ß-barrels for use in further promising applications.


Subject(s)
Bacterial Toxins , Hemolysin Proteins , Bacterial Toxins/metabolism , Hemolysin Proteins/metabolism , Hemolysis , Leukocidins/chemistry , Leukocidins/metabolism , Staphylococcus aureus/genetics , Staphylococcus aureus/metabolism
7.
Nucleic Acids Res ; 49(18): 10604-10617, 2021 10 11.
Article in English | MEDLINE | ID: mdl-34520542

ABSTRACT

RNA hydrolysis presents problems in manufacturing, long-term storage, world-wide delivery and in vivo stability of messenger RNA (mRNA)-based vaccines and therapeutics. A largely unexplored strategy to reduce mRNA hydrolysis is to redesign RNAs to form double-stranded regions, which are protected from in-line cleavage and enzymatic degradation, while coding for the same proteins. The amount of stabilization that this strategy can deliver and the most effective algorithmic approach to achieve stabilization remain poorly understood. Here, we present simple calculations for estimating RNA stability against hydrolysis, and a model that links the average unpaired probability of an mRNA, or AUP, to its overall hydrolysis rate. To characterize the stabilization achievable through structure design, we compare AUP optimization by conventional mRNA design methods to results from more computationally sophisticated algorithms and crowdsourcing through the OpenVaccine challenge on the Eterna platform. We find that rational design on Eterna and the more sophisticated algorithms lead to constructs with low AUP, which we term 'superfolder' mRNAs. These designs exhibit a wide diversity of sequence and structure features that may be desirable for translation, biophysical size, and immunogenicity. Furthermore, their folding is robust to temperature, computer modeling method, choice of flanking untranslated regions, and changes in target protein sequence, as illustrated by rapid redesign of superfolder mRNAs for B.1.351, P.1 and B.1.1.7 variants of the prefusion-stabilized SARS-CoV-2 spike protein. Increases in in vitro mRNA half-life by at least two-fold appear immediately achievable.


Subject(s)
Algorithms , RNA, Double-Stranded/chemistry , RNA, Messenger/chemistry , RNA, Viral/chemistry , SARS-CoV-2/genetics , Spike Glycoprotein, Coronavirus/genetics , Base Pairing , Base Sequence , COVID-19/prevention & control , Humans , Hydrolysis , RNA Stability , RNA, Double-Stranded/genetics , RNA, Double-Stranded/immunology , RNA, Messenger/genetics , RNA, Messenger/immunology , RNA, Viral/genetics , RNA, Viral/immunology , SARS-CoV-2/immunology , Spike Glycoprotein, Coronavirus/immunology , Thermodynamics
8.
bioRxiv ; 2021 Mar 30.
Article in English | MEDLINE | ID: mdl-33821271

ABSTRACT

Therapeutic mRNAs and vaccines are being developed for a broad range of human diseases, including COVID-19. However, their optimization is hindered by mRNA instability and inefficient protein expression. Here, we describe design principles that overcome these barriers. We develop a new RNA sequencing-based platform called PERSIST-seq to systematically delineate in-cell mRNA stability, ribosome load, as well as in-solution stability of a library of diverse mRNAs. We find that, surprisingly, in-cell stability is a greater driver of protein output than high ribosome load. We further introduce a method called In-line-seq, applied to thousands of diverse RNAs, that reveals sequence and structure-based rules for mitigating hydrolytic degradation. Our findings show that "superfolder" mRNAs can be designed to improve both stability and expression that are further enhanced through pseudouridine nucleoside modification. Together, our study demonstrates simultaneous improvement of mRNA stability and protein expression and provides a computational-experimental platform for the enhancement of mRNA medicines.

9.
bioRxiv ; 2021 Feb 19.
Article in English | MEDLINE | ID: mdl-32869022

ABSTRACT

RNA hydrolysis presents problems in manufacturing, long-term storage, world-wide delivery, and in vivo stability of messenger RNA (mRNA)-based vaccines and therapeutics. A largely unexplored strategy to reduce mRNA hydrolysis is to redesign RNAs to form double-stranded regions, which are protected from in-line cleavage and enzymatic degradation, while coding for the same proteins. The amount of stabilization that this strategy can deliver and the most effective algorithmic approach to achieve stabilization remain poorly understood. Here, we present simple calculations for estimating RNA stability against hydrolysis, and a model that links the average unpaired probability of an mRNA, or AUP, to its overall hydrolysis rate. To characterize the stabilization achievable through structure design, we compare AUP optimization by conventional mRNA design methods to results from more computationally sophisticated algorithms and crowdsourcing through the OpenVaccine challenge on the Eterna platform. These computational tests were carried out on both model mRNAs and COVID-19 mRNA vaccine candidates. We find that rational design on Eterna and the more sophisticated algorithms lead to constructs with low AUP, which we term 'superfolder' mRNAs. These designs exhibit wide diversity of sequence and structure features that may be desirable for translation, biophysical size, and immunogenicity, and their folding is robust to temperature, choice of flanking untranslated regions, and changes in target protein sequence, as illustrated by rapid redesign of superfolder mRNAs for B.1.351, P.1, and B.1.1.7 variants of the prefusion-stabilized SARS-CoV-2 spike protein. Increases in in vitro mRNA half-life by at least two-fold appear immediately achievable.

10.
Nat Commun ; 10(1): 4121, 2019 09 11.
Article in English | MEDLINE | ID: mdl-31511508

ABSTRACT

The functionality of most secreted proteins depends on their assembly into a defined quaternary structure. Despite this, it remains unclear how cells discriminate unassembled proteins en route to the native state from misfolded ones that need to be degraded. Here we show how chaperones can regulate and control assembly of heterodimeric proteins, using interleukin 23 (IL-23) as a model. We find that the IL-23 α-subunit remains partially unstructured until assembly with its ß-subunit occurs and identify a major site of incomplete folding. Incomplete folding is recognized by different chaperones along the secretory pathway, realizing reliable assembly control by sequential checkpoints. Structural optimization of the chaperone recognition site allows it to bypass quality control checkpoints and provides a secretion-competent IL-23α subunit, which can still form functional heterodimeric IL-23. Thus, locally-restricted incomplete folding within single-domain proteins can be used to regulate and control their assembly.


Subject(s)
Interleukin-23/metabolism , Molecular Chaperones/metabolism , Animals , COS Cells , Chlorocebus aethiops , Cysteine/metabolism , Endoplasmic Reticulum/metabolism , Half-Life , Humans , Interleukin-23/chemistry , Models, Biological , Protein Folding , Protein Stability , Protein Structure, Secondary
SELECTION OF CITATIONS
SEARCH DETAIL
...