Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 5 de 5
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
Biophys J ; 123(1): 101-113, 2024 01 02.
Artigo em Inglês | MEDLINE | ID: mdl-38053335

RESUMO

Intrinsically disordered proteins are characterized by a conformational ensemble. While computational approaches such as molecular dynamics simulations have been used to generate such ensembles, their computational costs can be prohibitive. An alternative approach is to learn from data and train machine-learning models to generate conformational ensembles of disordered proteins. This has been a relatively unexplored approach, and in this work we demonstrate a proof-of-principle approach to do so. Specifically, we devised a two-stage computational pipeline: in the first stage, we employed supervised machine-learning models to predict ensemble-derived two-dimensional (2D) properties of a sequence, given the conformational ensemble of a closely related sequence. In the second stage, we used denoising diffusion models to generate three-dimensional (3D) coarse-grained conformational ensembles, given the two-dimensional predictions outputted by the first stage. We trained our models on a data set of coarse-grained molecular dynamics simulations of thousands of rationally designed synthetic sequences. The accuracy of our 2D and 3D predictions was validated across multiple metrics, and our work demonstrates the applicability of machine-learning techniques to predicting higher-dimensional properties of disordered proteins.


Assuntos
Proteínas Intrinsicamente Desordenadas , Simulação de Dinâmica Molecular , Conformação Proteica , Proteínas Intrinsicamente Desordenadas/metabolismo , Aprendizado de Máquina
2.
Curr Res Struct Biol ; 3: 216-228, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-34557680

RESUMO

Intrinsically disordered proteins and protein regions (IDRs) make up around 30% of the human proteome where they play essential roles in dictating and regulating many core biological processes. While IDRs are often studied as isolated domains, in naturally occurring proteins most IDRs are found adjacent to folded domains, where they exist as either N- or C-terminal tails or as linkers connecting two folded domains. Prior work has shown that charge properties of IDRs can influence their conformational behavior, both in isolation and in the context of folded domains. In contrast, the converse scenario is less well-explored: how do the charge properties of folded domains influence IDR conformational behavior? To answer this question, we combined a large-scale structural bioinformatics analysis with all-atom implicit solvent simulations of both rationally designed and naturally occurring proteins. Our results reveal three key takeaways. Firstly, the relative position and accessibility of charged residues across the surface of a folded domain can dictate IDR conformational behavior, overriding expectations based on net surface charge properties. Secondly, naturally occurring proteins possess multiple charge patches that are physically accessible to local IDRs. Finally, even modest changes in the local electrostatic environment of a folded domain can substantially modulate IDR-folded domain interactions. Taken together, our results suggest that folded domain surfaces can act as local determinants of IDR conformational behavior.

3.
Clin Transl Sci ; 14(4): 1578-1589, 2021 07.
Artigo em Inglês | MEDLINE | ID: mdl-33786999

RESUMO

Sepsis is a major cause of mortality among hospitalized patients worldwide. Shorter time to administration of broad-spectrum antibiotics is associated with improved outcomes, but early recognition of sepsis remains a major challenge. In a two-center cohort study with prospective sample collection from 1400 adult patients in emergency departments suspected of sepsis, we sought to determine the diagnostic and prognostic capabilities of a machine-learning algorithm based on clinical data and a set of uncommonly measured biomarkers. Specifically, we demonstrate that a machine-learning model developed using this dataset outputs a score with not only diagnostic capability but also prognostic power with respect to hospital length of stay (LOS), 30-day mortality, and 3-day inpatient re-admission both in our entire testing cohort and various subpopulations. The area under the receiver operating curve (AUROC) for diagnosis of sepsis was 0.83. Predicted risk scores for patients with septic shock were higher compared with patients with sepsis but without shock (p < 0.0001). Scores for patients with infection and organ dysfunction were higher compared with those without either condition (p < 0.0001). Stratification based on predicted scores of the patients into low, medium, and high-risk groups showed significant differences in LOS (p < 0.0001), 30-day mortality (p < 0.0001), and 30-day inpatient readmission (p < 0.0001). In conclusion, a machine-learning algorithm based on electronic medical record (EMR) data and three nonroutinely measured biomarkers demonstrated good diagnostic and prognostic capability at the time of initial blood culture.


Assuntos
Diagnóstico Precoce , Registros Eletrônicos de Saúde/estatística & dados numéricos , Aprendizado de Máquina , Sepse/diagnóstico , Idoso , Área Sob a Curva , Biomarcadores/sangue , Serviço Hospitalar de Emergência/estatística & dados numéricos , Feminino , Mortalidade Hospitalar , Humanos , Tempo de Internação/estatística & dados numéricos , Masculino , Pessoa de Meia-Idade , Readmissão do Paciente/estatística & dados numéricos , Prognóstico , Estudos Prospectivos , Curva ROC , Sepse/sangue , Sepse/microbiologia , Sepse/mortalidade
5.
Sci Rep ; 7(1): 10800, 2017 09 07.
Artigo em Inglês | MEDLINE | ID: mdl-28883645

RESUMO

Sepsis is a leading cause of death and is the most expensive condition to treat in U.S. hospitals. Despite targeted efforts to automate earlier detection of sepsis, current techniques rely exclusively on using either standard clinical data or novel biomarker measurements. In this study, we apply machine learning techniques to assess the predictive power of combining multiple biomarker measurements from a single blood sample with electronic medical record data (EMR) for the identification of patients in the early to peak phase of sepsis in a large community hospital setting. Combining biomarkers and EMR data achieved an area under the receiver operating characteristic (ROC) curve (AUC) of 0.81, while EMR data alone achieved an AUC of 0.75. Furthermore, a single measurement of six biomarkers (IL-6, nCD64, IL-1ra, PCT, MCP1, and G-CSF) yielded the same predictive power as collecting an additional 16 hours of EMR data(AUC of 0.80), suggesting that the biomarkers may be useful for identifying these patients earlier. Ultimately, supervised learning using a subset of biomarker and EMR data as features may be capable of identifying patients in the early to peak phase of sepsis in a diverse population and may provide a tool for more timely identification and intervention.


Assuntos
Biomarcadores/análise , Técnicas de Apoio para a Decisão , Registros Eletrônicos de Saúde/estatística & dados numéricos , Aprendizado de Máquina , Sepse/diagnóstico , Sepse/patologia , Processamento Eletrônico de Dados/métodos , Humanos , Valor Preditivo dos Testes , Curva ROC , Estados Unidos
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...