Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 18 de 18
Filtrar
Mais filtros











Base de dados
Intervalo de ano de publicação
1.
bioRxiv ; 2024 Jun 11.
Artigo em Inglês | MEDLINE | ID: mdl-38464325

RESUMO

Prediction of RNA structure from sequence remains an unsolved problem, and progress has been slowed by a paucity of experimental data. Here, we present Ribonanza, a dataset of chemical mapping measurements on two million diverse RNA sequences collected through Eterna and other crowdsourced initiatives. Ribonanza measurements enabled solicitation, training, and prospective evaluation of diverse deep neural networks through a Kaggle challenge, followed by distillation into a single, self-contained model called RibonanzaNet. When fine tuned on auxiliary datasets, RibonanzaNet achieves state-of-the-art performance in modeling experimental sequence dropout, RNA hydrolytic degradation, and RNA secondary structure, with implications for modeling RNA tertiary structure.

2.
bioRxiv ; 2024 Jan 16.
Artigo em Inglês | MEDLINE | ID: mdl-38260323

RESUMO

Designing single molecules that compute general functions of input molecular partners represents a major unsolved challenge in molecular design. Here, we demonstrate that high-throughput, iterative experimental testing of diverse RNA designs crowdsourced from Eterna yields sensors of increasingly complex functions of input oligonucleotide concentrations. After designing single-input RNA sensors with activation ratios beyond our detection limits, we created logic gates, including challenging XOR and XNOR gates, and sensors that respond to the ratio of two inputs. Finally, we describe the OpenTB challenge, which elicited 85-nucleotide sensors that compute a score for diagnosing active tuberculosis, based on the ratio of products of three gene segments. Building on OpenTB design strategies, we created an algorithm Nucleologic that produces similarly compact sensors for the three-gene score based on RNA and DNA. These results open new avenues for diverse applications of compact, single molecule sensors previously limited by design complexity.

3.
Nat Mach Intell ; 4(12): 1174-1184, 2022.
Artigo em Inglês | MEDLINE | ID: mdl-36567960

RESUMO

Medicines based on messenger RNA (mRNA) hold immense potential, as evidenced by their rapid deployment as COVID-19 vaccines. However, worldwide distribution of mRNA molecules has been limited by their thermostability, which is fundamentally limited by the intrinsic instability of RNA molecules to a chemical degradation reaction called in-line hydrolysis. Predicting the degradation of an RNA molecule is a key task in designing more stable RNA-based therapeutics. Here, we describe a crowdsourced machine learning competition ('Stanford OpenVaccine') on Kaggle, involving single-nucleotide resolution measurements on 6,043 diverse 102-130-nucleotide RNA constructs that were themselves solicited through crowdsourcing on the RNA design platform Eterna. The entire experiment was completed in less than 6 months, and 41% of nucleotide-level predictions from the winning model were within experimental error of the ground truth measurement. Furthermore, these models generalized to blindly predicting orthogonal degradation data on much longer mRNA molecules (504-1,588 nucleotides) with improved accuracy compared with previously published models. These results indicate that such models can represent in-line hydrolysis with excellent accuracy, supporting their use for designing stabilized messenger RNAs. The integration of two crowdsourcing platforms, one for dataset creation and another for machine learning, may be fruitful for other urgent problems that demand scientific discovery on rapid timescales.

4.
Nat Commun ; 13(1): 1536, 2022 03 22.
Artigo em Inglês | MEDLINE | ID: mdl-35318324

RESUMO

Therapeutic mRNAs and vaccines are being developed for a broad range of human diseases, including COVID-19. However, their optimization is hindered by mRNA instability and inefficient protein expression. Here, we describe design principles that overcome these barriers. We develop an RNA sequencing-based platform called PERSIST-seq to systematically delineate in-cell mRNA stability, ribosome load, as well as in-solution stability of a library of diverse mRNAs. We find that, surprisingly, in-cell stability is a greater driver of protein output than high ribosome load. We further introduce a method called In-line-seq, applied to thousands of diverse RNAs, that reveals sequence and structure-based rules for mitigating hydrolytic degradation. Our findings show that highly structured "superfolder" mRNAs can be designed to improve both stability and expression with further enhancement through pseudouridine nucleoside modification. Together, our study demonstrates simultaneous improvement of mRNA stability and protein expression and provides a computational-experimental platform for the enhancement of mRNA medicines.


Assuntos
COVID-19 , RNA , COVID-19/terapia , Humanos , Pseudouridina/metabolismo , Estabilidade de RNA/genética , RNA Mensageiro/metabolismo
5.
ArXiv ; 2021 Oct 14.
Artigo em Inglês | MEDLINE | ID: mdl-34671698

RESUMO

Messenger RNA-based medicines hold immense potential, as evidenced by their rapid deployment as COVID-19 vaccines. However, worldwide distribution of mRNA molecules has been limited by their thermostability, which is fundamentally limited by the intrinsic instability of RNA molecules to a chemical degradation reaction called in-line hydrolysis. Predicting the degradation of an RNA molecule is a key task in designing more stable RNA-based therapeutics. Here, we describe a crowdsourced machine learning competition ("Stanford OpenVaccine") on Kaggle, involving single-nucleotide resolution measurements on 6043 102-130-nucleotide diverse RNA constructs that were themselves solicited through crowdsourcing on the RNA design platform Eterna. The entire experiment was completed in less than 6 months, and 41% of nucleotide-level predictions from the winning model were within experimental error of the ground truth measurement. Furthermore, these models generalized to blindly predicting orthogonal degradation data on much longer mRNA molecules (504-1588 nucleotides) with improved accuracy compared to previously published models. Top teams integrated natural language processing architectures and data augmentation techniques with predictions from previous dynamic programming models for RNA secondary structure. These results indicate that such models are capable of representing in-line hydrolysis with excellent accuracy, supporting their use for designing stabilized messenger RNAs. The integration of two crowdsourcing platforms, one for data set creation and another for machine learning, may be fruitful for other urgent problems that demand scientific discovery on rapid timescales.

7.
Nucleic Acids Res ; 49(18): 10604-10617, 2021 10 11.
Artigo em Inglês | MEDLINE | ID: mdl-34520542

RESUMO

RNA hydrolysis presents problems in manufacturing, long-term storage, world-wide delivery and in vivo stability of messenger RNA (mRNA)-based vaccines and therapeutics. A largely unexplored strategy to reduce mRNA hydrolysis is to redesign RNAs to form double-stranded regions, which are protected from in-line cleavage and enzymatic degradation, while coding for the same proteins. The amount of stabilization that this strategy can deliver and the most effective algorithmic approach to achieve stabilization remain poorly understood. Here, we present simple calculations for estimating RNA stability against hydrolysis, and a model that links the average unpaired probability of an mRNA, or AUP, to its overall hydrolysis rate. To characterize the stabilization achievable through structure design, we compare AUP optimization by conventional mRNA design methods to results from more computationally sophisticated algorithms and crowdsourcing through the OpenVaccine challenge on the Eterna platform. We find that rational design on Eterna and the more sophisticated algorithms lead to constructs with low AUP, which we term 'superfolder' mRNAs. These designs exhibit a wide diversity of sequence and structure features that may be desirable for translation, biophysical size, and immunogenicity. Furthermore, their folding is robust to temperature, computer modeling method, choice of flanking untranslated regions, and changes in target protein sequence, as illustrated by rapid redesign of superfolder mRNAs for B.1.351, P.1 and B.1.1.7 variants of the prefusion-stabilized SARS-CoV-2 spike protein. Increases in in vitro mRNA half-life by at least two-fold appear immediately achievable.


Assuntos
Algoritmos , RNA de Cadeia Dupla/química , RNA Mensageiro/química , RNA Viral/química , SARS-CoV-2/genética , Glicoproteína da Espícula de Coronavírus/genética , Pareamento de Bases , Sequência de Bases , COVID-19/prevenção & controle , Humanos , Hidrólise , Estabilidade de RNA , RNA de Cadeia Dupla/genética , RNA de Cadeia Dupla/imunologia , RNA Mensageiro/genética , RNA Mensageiro/imunologia , RNA Viral/genética , RNA Viral/imunologia , SARS-CoV-2/imunologia , Glicoproteína da Espícula de Coronavírus/imunologia , Termodinâmica
8.
bioRxiv ; 2021 Mar 30.
Artigo em Inglês | MEDLINE | ID: mdl-33821271

RESUMO

Therapeutic mRNAs and vaccines are being developed for a broad range of human diseases, including COVID-19. However, their optimization is hindered by mRNA instability and inefficient protein expression. Here, we describe design principles that overcome these barriers. We develop a new RNA sequencing-based platform called PERSIST-seq to systematically delineate in-cell mRNA stability, ribosome load, as well as in-solution stability of a library of diverse mRNAs. We find that, surprisingly, in-cell stability is a greater driver of protein output than high ribosome load. We further introduce a method called In-line-seq, applied to thousands of diverse RNAs, that reveals sequence and structure-based rules for mitigating hydrolytic degradation. Our findings show that "superfolder" mRNAs can be designed to improve both stability and expression that are further enhanced through pseudouridine nucleoside modification. Together, our study demonstrates simultaneous improvement of mRNA stability and protein expression and provides a computational-experimental platform for the enhancement of mRNA medicines.

9.
bioRxiv ; 2021 Feb 19.
Artigo em Inglês | MEDLINE | ID: mdl-32869022

RESUMO

RNA hydrolysis presents problems in manufacturing, long-term storage, world-wide delivery, and in vivo stability of messenger RNA (mRNA)-based vaccines and therapeutics. A largely unexplored strategy to reduce mRNA hydrolysis is to redesign RNAs to form double-stranded regions, which are protected from in-line cleavage and enzymatic degradation, while coding for the same proteins. The amount of stabilization that this strategy can deliver and the most effective algorithmic approach to achieve stabilization remain poorly understood. Here, we present simple calculations for estimating RNA stability against hydrolysis, and a model that links the average unpaired probability of an mRNA, or AUP, to its overall hydrolysis rate. To characterize the stabilization achievable through structure design, we compare AUP optimization by conventional mRNA design methods to results from more computationally sophisticated algorithms and crowdsourcing through the OpenVaccine challenge on the Eterna platform. These computational tests were carried out on both model mRNAs and COVID-19 mRNA vaccine candidates. We find that rational design on Eterna and the more sophisticated algorithms lead to constructs with low AUP, which we term 'superfolder' mRNAs. These designs exhibit wide diversity of sequence and structure features that may be desirable for translation, biophysical size, and immunogenicity, and their folding is robust to temperature, choice of flanking untranslated regions, and changes in target protein sequence, as illustrated by rapid redesign of superfolder mRNAs for B.1.351, P.1, and B.1.1.7 variants of the prefusion-stabilized SARS-CoV-2 spike protein. Increases in in vitro mRNA half-life by at least two-fold appear immediately achievable.

10.
Educ Prim Care ; 29(6): 327-335, 2018 11.
Artigo em Inglês | MEDLINE | ID: mdl-30346250

RESUMO

The recent rise to prominence of healthcare leadership worldwide has prompted those involved in medical education to consider how to facilitate learning to lead effectively. Research has focused on formal curriculum activities. Curricular theory suggests that trainee doctors may also learn through the informal curriculum but there is a lack of medical education literature on this. We aimed to explore how GP trainees learn about leadership in their GP training practices. Epistemologically grounded in social constructionism, this research involved 15 semi-structured interviews with GP trainees about to complete their training. Interviews were conducted using an online video conferencing method, audio-taped, transcribed and analysed using thematic framework analysis. We identified three learning processes contributing to leadership development; evaluating leadership, formulating views on leadership and constructing a personal leadership identity. Other factors operating within the informal curriculum included leadership terminology, and the quality of relationships and networks. Paradoxically, a role model's fallibility could positively influence leadership learning. Based on our findings, we present a model for the informal leadership learning process. This may enhance the facilitation of leadership learning by trainers and the wider clinical team, and positively influence the delivery and content of formal leadership courses.


Assuntos
Medicina Geral/educação , Liderança , Estudantes de Medicina , Currículo , Humanos , Relações Interpessoais , Entrevistas como Assunto , Pesquisa Qualitativa
17.
Bioinformatics ; 25(20): 2730-1, 2009 Oct 15.
Artigo em Inglês | MEDLINE | ID: mdl-19654113

RESUMO

UNLABELLED: Experimental techniques that survey an entire genome demand flexible, highly interactive visualization tools that can display new data alongside foundation datasets, such as reference gene annotations. The Integrated Genome Browser (IGB) aims to meet this need. IGB is an open source, desktop graphical display tool implemented in Java that supports real-time zooming and panning through a genome; layout of genomic features and datasets in moveable, adjustable tiers; incremental or genome-scale data loading from remote web servers or local files; and dynamic manipulation of quantitative data via genome graphs. AVAILABILITY: The application and source code are available from http://igb.bioviz.org and http://genoviz.sourceforge.net.


Assuntos
Biologia Computacional/métodos , Genoma , Genômica/métodos , Armazenamento e Recuperação da Informação/métodos , Software , Animais , Bases de Dados Genéticas , Humanos
18.
BMC Bioinformatics ; 10: 266, 2009 Aug 25.
Artigo em Inglês | MEDLINE | ID: mdl-19706180

RESUMO

BACKGROUND: Visualization software can expose previously undiscovered patterns in genomic data and advance biological science. RESULTS: The Genoviz Software Development Kit (SDK) is an open source, Java-based framework designed for rapid assembly of visualization software applications for genomics. The Genoviz SDK framework provides a mechanism for incorporating adaptive, dynamic zooming into applications, a desirable feature of genome viewers. Visualization capabilities of the Genoviz SDK include automated layout of features along genetic or genomic axes; support for user interactions with graphical elements (Glyphs) in a map; a variety of Glyph sub-classes that promote experimentation with new ways of representing data in graphical formats; and support for adaptive, semantic zooming, whereby objects change their appearance depending on zoom level and zooming rate adapts to the current scale. Freely available demonstration and production quality applications, including the Integrated Genome Browser, illustrate Genoviz SDK capabilities. CONCLUSION: Separation between graphics components and genomic data models makes it easy for developers to add visualization capability to pre-existing applications or build new applications using third-party data models. Source code, documentation, sample applications, and tutorials are available at http://genoviz.sourceforge.net/.


Assuntos
Genômica/métodos , Interpretação de Imagem Assistida por Computador/métodos , Linguagens de Programação , Software , Gráficos por Computador , Bases de Dados Factuais , Armazenamento e Recuperação da Informação/métodos , Interface Usuário-Computador
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA