Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 18 de 18
Filter
1.
Am J Clin Nutr ; 2024 Jun 07.
Article in English | MEDLINE | ID: mdl-38852855

ABSTRACT

BACKGROUND: The quality of carbohydrate intake, as measured by the glycemic index (GI), has not been evaluated nationally over the past 2 decades in the United States. OBJECTIVES: We aimed to develop a comprehensive and nationally representative dietary GI and glycemic load (GL) database from 1999 to 2018 National Health and Nutrition Examination Survey (NHANES) and to examine GI and GL time trends and subpopulation differences. METHODS: We used an artificial intelligence (AI)-enabled model to match GI values from 2 GI databases to food codes from United States Department of Agriculture, which were manually validated. We examined nationally representative distributions of dietary GI and GL from 1999 to 2018 using the multistage, clustered sampling design of NHANES. RESULTS: Assigned GI values covered 99.9% of total carbohydrate intake. The initial AI accuracy was 75.0%, with 31.3% retained after manual curation guided by substantive domain expertise. A total of 7976 unique food codes were matched to GI values, of which soft drinks and white bread were top contributors to dietary GI and GL. Of the 49,205 NHANES adult participants, the mean dietary GI was 55.7 (95% confidence interval [CI]: 55.5, 55.8) and energy-adjusted dietary GL was 133.0 (95% CI: 132.3, 133.8). From 1999 to 2018, dietary GI and GL decreased by 4.6% and 13.8%, respectively. Dietary GL was higher among females (134.6; 95% CI: 133.8, 135.5) than among males (131.3; 95% CI: 130.3, 132.3), those with ≤high school degree (137.7; 95% CI: 136.8, 138.7) than among those with ≥college degree (126.5; 95% CI: 125.3, 127.7), and those living under the poverty level (140.9; 95% CI: 139.6, 142.1) than among those above the poverty level. Differences in race were observed (Black adults, 139.4; 95% CI: 138.2, 140.7; White adults, 131.6; 95% CI: 130.5, 132.6). CONCLUSIONS: The national GI and GL database facilitates large-scale and high-quality surveillance or cohort studies of diet and health outcomes in the United States.

2.
Drug Discov Today ; 29(4): 103944, 2024 Apr.
Article in English | MEDLINE | ID: mdl-38460570

ABSTRACT

The Allotrope Foundation (AF) started as a group of pharmaceutical companies, instrument, and software vendors that set out to simplify the exchange of data in the laboratory. After a decade of work, they released products that have found adoption in various companies. Most recently, the Allotrope Simple Model (ASM) was developed to speed up and widen the adoption. As a result, the Foundation has recently added chemical companies and, importantly, is reworking its business model to lower the entry barrier for smaller companies. Here, we present the proceedings from the Allotrope Connect Fall 2023 conference and summarize the technical and organizational developments at the Foundation since 2020.


Subject(s)
Commerce , Small Business
3.
J Chem Theory Comput ; 20(1): 199-211, 2024 Jan 09.
Article in English | MEDLINE | ID: mdl-38150692

ABSTRACT

Accurate interatomic energies and forces enable high-quality molecular dynamics simulations, torsion scans, potential energy surface mappings, and geometry optimizations. Machine learning algorithms have enabled rapid estimates of the energies and forces with high accuracy. Further development of machine learning algorithms holds promise for producing universal potentials that support many different atomic species. We present the Transformer Interatomic Potential (TrIP): a chemically sound potential based on the SE(3)-Transformer. TrIP's species-agnostic architecture, which uses continuous atomic representation and homogeneous graph convolutions, encourages parameter sharing between atomic species for more general representations of chemical environments, maintains a reasonable number of parameters, serves as a form of regularization, and is a step toward accurate universal interatomic potentials. TrIP achieves state-of-the-art accuracies on the COMP6 benchmark with an energy prediction of just 1.02 kcal/mol MAE. We introduce physical bias in the form of Ziegler-Biersack-Littmark-screened nuclear repulsion and constrained atomization energies. An energy scan of a water molecule demonstrates that these changes improve long- and near-range interactions compared to other neural network potentials. TrIP also demonstrates stability in molecular dynamics simulations, demonstrating reasonable exploration of Ramachandran space.

4.
Sci Rep ; 13(1): 15493, 2023 09 19.
Article in English | MEDLINE | ID: mdl-37726313

ABSTRACT

Various approaches have used neural networks as probabilistic models for the design of protein sequences. These "inverse folding" models employ different objective functions, which come with trade-offs that have not been assessed in detail before. This study introduces probabilistic definitions of protein stability and conformational specificity and demonstrates the relationship between these chemical properties and the [Formula: see text] Boltzmann probability objective. This links the Boltzmann probability objective function to experimentally verifiable outcomes. We propose a novel sequence decoding algorithm, referred to as "BayesDesign", that leverages Bayes' Rule to maximize the [Formula: see text] objective instead of the [Formula: see text] objective common in inverse folding models. The efficacy of BayesDesign is evaluated in the context of two protein model systems, the NanoLuc enzyme and the WW structural motif. Both BayesDesign and the baseline ProteinMPNN algorithm increase the thermostability of NanoLuc and increase the conformational specificity of WW. The possible sources of error in the model are analyzed.


Subject(s)
Algorithms , Bayes Theorem , Protein Stability , Amino Acid Sequence , Likelihood Functions
5.
J Chem Inf Model ; 62(22): 5342-5350, 2022 11 28.
Article in English | MEDLINE | ID: mdl-36342217

ABSTRACT

Molecular docking tools are regularly used to computationally identify new molecules in virtual screening for drug discovery. However, docking tools suffer from inaccurate scoring functions with widely varying performance on different proteins. To enable more accurate ranking of active over inactive ligands in virtual screening, we created a machine learning consensus docking tool, MILCDock, that uses predictions from five traditional molecular docking tools to predict the probability a ligand binds to a protein. MILCDock was trained and tested on data from both the DUD-E and LIT-PCBA docking datasets and shows improved performance over traditional molecular docking tools and other consensus docking methods on the DUD-E dataset. LIT-PCBA targets proved to be difficult for all methods tested. We also find that DUD-E data, although biased, can be effective in training machine learning tools if care is taken to avoid DUD-E's biases during training.


Subject(s)
Drug Discovery , Machine Learning , Molecular Docking Simulation , Consensus , Ligands , Protein Binding
6.
Acta Crystallogr D Struct Biol ; 78(Pt 8): 936-944, 2022 Aug 01.
Article in English | MEDLINE | ID: mdl-35916219

ABSTRACT

Effective mentoring of undergraduate students is a growing requirement for the promotion of faculty at many universities. It is often challenging for young investigators to define a successful mentoring strategy, partially due to the absence of a broadly accepted definition of what mentoring should entail. To overcome this, an outcome-oriented mentoring framework was developed and used with more than 25 students over three years. It was found that a systematic mentoring approach can help students quickly realize their scientific potential and result in meaningful contributions to science. This report especially shows how the Critical Assessment of Protein Structure Prediction (CASP14) challenge was used to amplify student research efforts. As a result of this challenge, multiple publications, presentations and scholarships were awarded to the participating students. The mentoring framework continues to see much success in allowing undergraduate students, including students from underrepresented groups, to foster scientific talent and make meaningful contributions to the scientific community.


Subject(s)
Mentoring , Humans , Mentors , Students , Universities
7.
Drug Discov Today ; 27(1): 207-214, 2022 01.
Article in English | MEDLINE | ID: mdl-34332096

ABSTRACT

Standardizing data is crucial for preserving and exchanging scientific information. In particular, recording the context in which data were created ensures that information remains findable, accessible, interoperable, and reusable. Here, we introduce the concept of self-reporting data assets (SRDAs), which preserve data and contextual information. SRDAs are an abstract concept, which requires a suitable data format for implementation. Four promising data formats or languages are popularly used to represent data in pharma: JCAMP-DX, JSON, AnIML, and, more recently, the Allotrope Data Format (ADF). Here, we evaluate these four options in common use cases within the pharmaceutical industry using multiple criteria. The evaluation shows that ADF is the most suitable format for the implementation of SRDAs.


Subject(s)
Data Accuracy , Data Curation , Drug Industry , Information Dissemination/methods , Research Design/standards , Data Curation/methods , Data Curation/standards , Diffusion of Innovation , Drug Industry/methods , Drug Industry/organization & administration , Humans , Proof of Concept Study , Reference Standards , Technology, Pharmaceutical/methods
8.
Commun Chem ; 5(1): 69, 2022 Jun 02.
Article in English | MEDLINE | ID: mdl-36697757

ABSTRACT

Molten salts are important thermal conductors used in molten salt reactors and solar applications. To use molten salts safely, accurate knowledge of their thermophysical properties is necessary. However, it is experimentally challenging to measure these properties and a comprehensive evaluation of the full chemical space is unfeasible. Computational methods provide an alternative route to access these properties. Here, we summarize the developments in methods over the last 70 years and cluster them into three relevant eras. We review the main advances and limitations of each era and conclude with an optimistic perspective for the next decade, which will likely be dominated by emerging machine learning techniques. This article is aimed to help researchers in peripheral scientific domains understand the current challenges of molten salt simulation and identify opportunities to contribute.

9.
Int J Mol Sci ; 22(23)2021 Nov 27.
Article in English | MEDLINE | ID: mdl-34884640

ABSTRACT

The field of protein structure prediction has recently been revolutionized through the introduction of deep learning. The current state-of-the-art tool AlphaFold2 can predict highly accurate structures; however, it has a prohibitively long inference time for applications that require the folding of hundreds of sequences. The prediction of protein structure annotations, such as amino acid distances, can be achieved at a higher speed with existing tools, such as the ProSPr network. Here, we report on important updates to the ProSPr network, its performance in the recent Critical Assessment of Techniques for Protein Structure Prediction (CASP14) competition, and an evaluation of its accuracy dependency on sequence length and multiple sequence alignment depth. We also provide a detailed description of the architecture and the training process, accompanied by reusable code. This work is anticipated to provide a solid foundation for the further development of protein distance prediction tools.


Subject(s)
Neural Networks, Computer , Proteins/chemistry , Amino Acid Sequence , Computational Biology/methods , Humans , Protein Conformation , Protein Folding , Protein Structural Elements , Sequence Alignment/methods , Software Design
10.
Proteins ; 89(12): 1987-1996, 2021 12.
Article in English | MEDLINE | ID: mdl-34462960

ABSTRACT

Critical Assessment of Structure Prediction (CASP) is an organization aimed at advancing the state of the art in computing protein structure from sequence. In the spring of 2020, CASP launched a community project to compute the structures of the most structurally challenging proteins coded for in the SARS-CoV-2 genome. Forty-seven research groups submitted over 3000 three-dimensional models and 700 sets of accuracy estimates on 10 proteins. The resulting models were released to the public. CASP community members also worked together to provide estimates of local and global accuracy and identify structure-based domain boundaries for some proteins. Subsequently, two of these structures (ORF3a and ORF8) have been solved experimentally, allowing assessment of both model quality and the accuracy estimates. Models from the AlphaFold2 group were found to have good agreement with the experimental structures, with main chain GDT_TS accuracy scores ranging from 63 (a correct topology) to 87 (competitive with experiment).


Subject(s)
SARS-CoV-2/chemistry , Viral Proteins/chemistry , COVID-19/virology , Genome, Viral , Humans , Models, Molecular , Protein Conformation , Protein Domains , SARS-CoV-2/genetics , Viral Proteins/genetics , Viroporin Proteins/chemistry , Viroporin Proteins/genetics
11.
Sci Rep ; 11(1): 8039, 2021 04 13.
Article in English | MEDLINE | ID: mdl-33850214

ABSTRACT

The prediction of amino acid contacts from protein sequence is an important problem, as protein contacts are a vital step towards the prediction of folded protein structures. We propose that a powerful concept from deep learning, called ensembling, can increase the accuracy of protein contact predictions by combining the outputs of different neural network models. We show that ensembling the predictions made by different groups at the recent Critical Assessment of Protein Structure Prediction (CASP13) outperforms all individual groups. Further, we show that contacts derived from the distance predictions of three additional deep neural networks-AlphaFold, trRosetta, and ProSPr-can be substantially improved by ensembling all three networks. We also show that ensembling these recent deep neural networks with the best CASP13 group creates a superior contact prediction tool. Finally, we demonstrate that two ensembled networks can successfully differentiate between the folds of two highly homologous sequences. In order to build further on these findings, we propose the creation of a better protein contact benchmark set and additional open-source contact prediction methods.


Subject(s)
Computational Biology , Proteins , Neural Networks, Computer , Protein Conformation , Protein Folding
12.
Drug Discov Today ; 26(8): 1922-1928, 2021 08.
Article in English | MEDLINE | ID: mdl-33831582

ABSTRACT

The Allotrope Foundation (AF) is a group of pharmaceutical, device vendor, and software companies that develops and releases technologies [the Allotrope Data Format (ADF), the Allotrope Foundation Ontology (AFO), and the Allotrope Data Models (ADM)] to simplify the exchange of electronic data. We present here the first comprehensive history of the AF, its structure, a list of members and partners, and an introduction to the technologies. Finally, we provide current insights into the adoption and development of the technologies by summarizing the Fall 2020 Allotrope Connect virtual conference. This overview provides an easy access to the AF and highlights opportunities for collaboration.


Subject(s)
Clinical Laboratory Information Systems , Software , Cooperative Behavior , Humans
13.
Nutr Rev ; 79(3): 274-288, 2021 02 11.
Article in English | MEDLINE | ID: mdl-32984896

ABSTRACT

OBJECTIVE: To provide a systematic overview of world dietary sugar and sugar-sweetened beverage (SSB) intake trends in children and adolescents. DATA SOURCES: Medline, Embase, and the Cochrane Central Register of Controlled Trials in the Cochrane Library were searched through January 2019 to identify longitudinal follow-up studies with time-trend data and repeated cross-sectional studies. DATA EXTRACTION: Data from studies reporting ≥ 2 measurements (sugars, SSB, or sweets/candy) over ≥ 2 years and included ≥ 20 healthy, normal- or overweight children or adolescents aged 1-19 years. DATA ANALYSIS: Data from 43 articles (n = 4 prospective cohort studies; n = 39 repeated cross-sectional studies) from 15 countries (n = 8 European countries plus Australia, Canada, China, South Korea, Mexico, Russia, and the United States) are presented narratively. According to the risk of bias in nonrandomized studies of interventions tool, 34 studies were judged to have a moderate risk of bias, and 5 to have a serious risk of bias. CONCLUSIONS: Consumption among US children and adolescents increased substantially in the decades preceding 2000, followed by a faster and continued decline. As a whole, other international intake trends did not reveal drastic increases and decreases in SSB and dietary sugars; they tended to change only slightly across 3 decades.


Subject(s)
Dietary Sugars , Eating , Global Health/trends , Sugar-Sweetened Beverages , Adolescent , Child , Humans
14.
Nat Commun ; 11(1): 4851, 2020 09 25.
Article in English | MEDLINE | ID: mdl-32978386

ABSTRACT

Cell factories converting bio-based precursors to chemicals present an attractive avenue to a sustainable economy, yet screening of genetically diverse strain libraries to identify the best-performing whole-cell biocatalysts is a low-throughput endeavor. For this reason, transcriptional biosensors attract attention as they allow the screening of vast libraries when used in combination with fluorescence-activated cell sorting (FACS). However, broad ligand specificity of transcriptional regulators (TRs) often prohibits the development of such ultra-high-throughput screens. Here, we solve the structure of the TR LysG of Corynebacterium glutamicum, which detects all three basic amino acids. Based on this information, we follow a semi-rational engineering approach using a FACS-based screening/counterscreening strategy to generate an L-lysine insensitive LysG-based biosensor. This biosensor can be used to isolate L-histidine-producing strains by FACS, showing that TR engineering towards a more focused ligand spectrum can expand the scope of application of such metabolite sensors.


Subject(s)
Amino Acid Transport Systems, Basic/chemistry , Bacterial Proteins/chemistry , Biosensing Techniques/methods , Ligands , Metabolic Engineering/methods , Amino Acid Transport Systems, Basic/metabolism , Bacterial Proteins/metabolism , Corynebacterium glutamicum/metabolism , Crystallography , Flow Cytometry/methods , High-Throughput Screening Assays/methods , Lysine/metabolism , Microfluidic Analytical Techniques , Models, Molecular , Protein Conformation , Protein Domains , Thermodynamics
15.
Biochemistry ; 59(17): 1672-1679, 2020 05 05.
Article in English | MEDLINE | ID: mdl-32270676

ABSTRACT

Here we show that a solvent-exposed f-position (i.e., residue 14) within a well-characterized trimeric helix bundle can facilitate a stabilizing long-range synergistic interaction involving b-position Glu10 (i.e., i - 4 relative to residue 14) and c-position Lys18 (i.e., i + 4), depending the identity of residue 14. The extent of stabilization associated with the Glu10-Lys18 pair depends primarily on the presence of a side-chain hydrogen-bond donor at residue 14; the nonpolar or hydrophobic character of residue 14 plays a smaller but still significant role. Crystal structures and molecular dynamics simulations indicate that Glu10 and Lys18 do not interact directly with each other but suggest the possibility that the proximity of residue 14 with Lys18 allows Glu10 to interact favorably with nearby Lys7. Subsequent thermodynamic experiments confirm the important role of Lys7 in the large synergistic stabilization associated with the Glu10-Lys18 pair. Our results highlight the exquisite complexity and surprising long-range synergistic interactions among b-, c-, and f-position residues within helix bundles, suggesting new possibilities for engineering hyperstable helix bundles and emphasizing the need to consider carefully the impact of substitutions at these positions for application-specific purposes.


Subject(s)
Peptides/chemistry , Protein Multimerization , Solvents/chemistry , Amino Acid Sequence , Models, Molecular , Protein Conformation, alpha-Helical , Protein Folding , Thermodynamics , Transition Temperature
16.
J Phys Chem B ; 123(7): 1453-1480, 2019 02 21.
Article in English | MEDLINE | ID: mdl-30525615

ABSTRACT

Understanding the function of a protein requires not only knowledge of its tertiary structure but also an understanding of its conformational dynamics. Nuclear magnetic resonance (NMR) spectroscopy, polarization-resolved fluorescence spectroscopy and molecular dynamics (MD) simulations are powerful methods to provide detailed insight into protein dynamics on multiple time scales by monitoring global rotational diffusion and local flexibility (order parameters) that are sensitive to inter- and intramolecular interactions, respectively. We present an integrated approach where data from these techniques are analyzed and interpreted within a joint theoretical description of depolarization and diffusion, demonstrating their conceptual similarities. This integrated approach is then applied to the autophagy-related protein GABARAP in its cytosolic form, elucidating its dynamics on the pico- to nanosecond time scale and its rotational and translational diffusion for protein concentrations spanning 9 orders of magnitude. We compare the dynamics of GABARAP as monitored by 15N spin relaxation of the backbone amide groups, fluorescence anisotropy decays and fluorescence correlation spectroscopy of side chains labeled with BODIPY FL, and molecular movies of the protein from MD simulations. The recovered parameters agree very well between the distinct techniques if the different measurement conditions (probe localization, sample concentration) are taken into account. Moreover, we propose a method that compares the order parameters of the backbone and side chains to identify potential hinges for large-scale, functionally relevant intradomain motions, such as residues 27/28 at the interface between the two subdomains of GABARAP. In conclusion, the integrated concept of cross-fertilizing techniques presented here is fundamental to obtaining a comprehensive quantitative picture of multiscale protein dynamics and solvation. The possibility to employ these validated techniques under cellular conditions and combine them with fluorescence imaging opens up the perspective of studying the functional dynamics of GABARAP or other proteins in live cells.


Subject(s)
Apoptosis Regulatory Proteins/chemistry , Fluorescence Polarization , Microtubule-Associated Proteins/chemistry , Molecular Dynamics Simulation , Nuclear Magnetic Resonance, Biomolecular , Apoptosis Regulatory Proteins/metabolism , Boron Compounds/chemistry , Humans , Hydrodynamics , Microtubule-Associated Proteins/metabolism , Protein Structure, Tertiary
17.
Proteins ; 84 Suppl 1: 302-13, 2016 09.
Article in English | MEDLINE | ID: mdl-26441154

ABSTRACT

A novel protein refinement protocol is presented which utilizes molecular dynamics (MD) simulations of an ensemble of adaptively restrained homologous replicas. This approach adds evolutionary information to the force field and reduces random conformational fluctuations by coupling of several replicas. It is shown that this protocol refines the majority of models from the CASP11 refinement category and that larger conformational changes of the starting structure are possible than with current state of the art methods. The performance of this protocol in the CASP11 experiment is discussed. We found that the quality of the refined model is correlated with the structural variance of the coupled replicas, which therefore provides a good estimator of model quality. Furthermore, some remarkable refinement results are discussed in detail. Proteins 2016; 84(Suppl 1):302-313. © 2015 Wiley Periodicals, Inc.


Subject(s)
Computational Biology/statistics & numerical data , Models, Statistical , Molecular Dynamics Simulation , Proteins/chemistry , Software , Algorithms , Amino Acid Motifs , Benchmarking , Computational Biology/methods , Humans , Internet , Protein Conformation, alpha-Helical , Protein Conformation, beta-Strand , Protein Folding , Protein Interaction Domains and Motifs , Protein Structure, Tertiary , Sequence Homology, Amino Acid , Thermodynamics
18.
J Chem Theory Comput ; 11(12): 5578-82, 2015 Dec 08.
Article in English | MEDLINE | ID: mdl-26642980

ABSTRACT

Atomic models of proteins built by homology modeling or from low-resolution experimental data may contain considerable local errors. The refinement success of molecular dynamics simulations is usually limited by both force field accuracy and by the substantial width of the conformational distribution at physiological temperatures. We propose a method to overcome both these problems by coupling homologous replicas during a molecular dynamics simulation, which narrows the conformational distribution, and smoothens and even improves the energy landscape by adding evolutionary information.


Subject(s)
Proteins/chemistry , Algorithms , Molecular Dynamics Simulation , Protein Structure, Tertiary , Proteins/metabolism , Temperature
SELECTION OF CITATIONS
SEARCH DETAIL
...