Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 14 de 14
Filter
Add more filters










Publication year range
1.
Epigenetics Chromatin ; 16(1): 42, 2023 10 26.
Article in English | MEDLINE | ID: mdl-37880732

ABSTRACT

Cell-cell communication is mediated by membrane receptors and their ligands, such as the Eph/ephrin system, orchestrating cell migration during development and in diverse cancer types. Epigenetic mechanisms are key for integrating external "signals", e.g., from neighboring cells, into the transcriptome in health and disease. Previously, we reported ephrinA5 to trigger transcriptional changes of lncRNAs and protein-coding genes in cerebellar granule cells, a cell model for medulloblastoma. LncRNAs represent important adaptors for epigenetic writers through which they regulate gene expression. Here, we investigate a lncRNA-mediated targeting of DNMT1 to specific gene loci by the combined power of in silico modeling of RNA/DNA interactions and wet lab approaches, in the context of the clinically relevant use case of ephrinA5-dependent regulation of cellular motility of cerebellar granule cells. We provide evidence that Snhg15, a cancer-related lncRNA, recruits DNMT1 to the Ncam1 promoter through RNA/DNA triplex structure formation and the interaction with DNMT1. This mediates DNA methylation-dependent silencing of Ncam1, being abolished by ephrinA5 stimulation-triggered reduction of Snhg15 expression. Hence, we here propose a triple helix recognition mechanism, underlying cell motility regulation via lncRNA-targeted DNA methylation in a clinically relevant context.


Subject(s)
RNA, Long Noncoding , RNA, Long Noncoding/genetics , Gene Expression Regulation, Neoplastic , Cell Line, Tumor , DNA , Cell Movement
2.
Biology (Basel) ; 12(4)2023 Apr 10.
Article in English | MEDLINE | ID: mdl-37106781

ABSTRACT

The disordered nature of Intrinsically Disordered Proteins (IDPs) makes their structural ensembles particularly susceptible to changes in chemical environmental conditions, often leading to an alteration of their normal functions. A Radial Distribution Function (RDF) is considered a standard method for characterizing the chemical environment surrounding particles during atomistic simulations, commonly averaged over an entire or part of a trajectory. Given their high structural variability, such averaged information might not be reliable for IDPs. We introduce the Time-Resolved Radial Distribution Function (TRRDF), implemented in our open-source Python package SPEADI, which is able to characterize dynamic environments around IDPs. We use SPEADI to characterize the dynamic distribution of ions around the IDPs Alpha-Synuclein (AS) and Humanin (HN) from Molecular Dynamics (MD) simulations, and some of their selected mutants, showing that local ion-residue interactions play an important role in the structures and behaviors of IDPs.

3.
J Chem Inf Model ; 60(3): 1568-1584, 2020 03 23.
Article in English | MEDLINE | ID: mdl-31905288

ABSTRACT

Improving an enzyme's (thermo-)stability or tolerance against solvents and detergents is highly relevant in protein engineering and biotechnology. Recent developments have tended toward data-driven approaches, where available knowledge about the protein is used to identify substitution sites with high potential to yield protein variants with improved stability, and subsequently, substitutions are engineered by site-directed or site-saturation (SSM) mutagenesis. However, the development and validation of algorithms for data-driven approaches have been hampered by the lack of availability of large-scale data measured in a uniform way and being unbiased with respect to substitution types and locations. Here, we extend our knowledge on guidelines for protein engineering following a data-driven approach by scrutinizing the impact of substitution sites on thermostability or/and detergent tolerance for Bacillus subtilis lipase A (BsLipA) at very large scale. We systematically analyze a complete experimental SSM library of BsLipA containing all 3439 possible single variants, which was evaluated as to thermostability and tolerances against four detergents under respectively uniform conditions. Our results provide systematic and unbiased reference data at unprecedented scale for a biotechnologically important protein, identify consistently defined hot spot types for evaluating the performance of data-driven protein-engineering approaches, and show that the rigidity theory and ensemble-based approach Constraint Network Analysis yields hot spot predictions with an up to ninefold gain in precision over random classification.


Subject(s)
Bacillus subtilis , Lipase , Amino Acid Substitution , Bacillus subtilis/genetics , Bacillus subtilis/metabolism , Detergents , Enzyme Stability , Lipase/genetics , Lipase/metabolism , Mutagenesis
4.
Methods Mol Biol ; 1484: 65-82, 2017.
Article in English | MEDLINE | ID: mdl-27787821

ABSTRACT

More than two decades of research have enabled dihedral angle predictions at an accuracy that makes them an interesting alternative or supplement to secondary structure prediction that provides detailed local structure information for every residue of a protein. The evolution of dihedral angle prediction methods is closely linked to advancements in machine learning and other relevant technologies. Consequently recent improvements in large-scale training of deep neural networks have led to the best method currently available, which achieves a mean absolute error of 19° for phi, and 30° for psi. This performance opens interesting perspectives for the application of dihedral angle prediction in the comparison, prediction, and design of protein structures.


Subject(s)
Protein Structure, Secondary , Proteins/chemistry , Amino Acid Sequence/genetics , Databases, Protein , Models, Molecular , Neural Networks, Computer , Protein Conformation , Proteins/genetics
5.
Eur Phys J E Soft Matter ; 39(1): 11, 2016 Jan.
Article in English | MEDLINE | ID: mdl-26830760

ABSTRACT

In many applications in soft and biological physics, there are multiple time and length scales involved but often with a distinct separation between them. For instance, in enzyme kinetics, enzymes are relatively large, move slowly and their copy numbers are typically small, while the metabolites (being transformed by these enzymes) are often present in abundance, are small in size and diffuse fast. It seems thus natural to apply different techniques to different time and length levels and couple them. Here we explore this possibility by constructing a stochastic-deterministic discrete-continuous reaction-diffusion model with mobile sources and sinks. Such an approach allows in particular to separate different sources of stochasticity. We demonstrate its application by modelling enzyme-catalysed reactions with freely diffusing enzymes and a heterogeneous source of metabolites. Our calculations suggest that using a higher amount of less active enzymes, as compared to fewer more active enzymes, reduces the metabolite pool size and correspondingly the lag time, giving rise to a faster response to external stimuli. The methodology presented can be extended to more complex systems and offers exciting possibilities for studying problems where spatial heterogeneities, stochasticity or discreteness play a role.


Subject(s)
Biocatalysis , Models, Theoretical , Diffusion , Enzymes/chemistry , Kinetics , Stochastic Processes
6.
BMC Bioinformatics ; 16: 284, 2015 Sep 03.
Article in English | MEDLINE | ID: mdl-26335531

ABSTRACT

BACKGROUND: Peptide-spectrum matching is a common step in most data processing workflows for mass spectrometry-based proteomics. Many algorithms and software packages, both free and commercial, have been developed to address this task. However, these algorithms typically require the user to select instrument- and sample-dependent parameters, such as mass measurement error tolerances and number of missed enzymatic cleavages. In order to select the best algorithm and parameter set for a particular dataset, in-depth knowledge about the data as well as the algorithms themselves is needed. Most researchers therefore tend to use default parameters, which are not necessarily optimal. RESULTS: We have applied a new optimization framework for the Taverna scientific workflow management system (http://ms-utils.org/Taverna_Optimization.pdf) to find the best combination of parameters for a given scientific workflow to perform peptide-spectrum matching. The optimizations themselves are non-trivial, as demonstrated by several phenomena that can be observed when allowing for larger mass measurement errors in sequence database searches. On-the-fly parameter optimization embedded in scientific workflow management systems enables experts and non-experts alike to extract the maximum amount of information from the data. The same workflows could be used for exploring the parameter space and compare algorithms, not only for peptide-spectrum matching, but also for other tasks, such as retention time prediction. CONCLUSION: Using the optimization framework, we were able to learn about how the data was acquired as well as the explored algorithms. We observed a phenomenon identifying many ammonia-loss b-ion spectra as peptides with N-terminal pyroglutamate and a large precursor mass measurement error. These insights could only be gained with the extension of the common range for the mass measurement error tolerance parameters explored by the optimization framework.


Subject(s)
Algorithms , Computational Biology/standards , Peptide Fragments/analysis , Proteins/analysis , Proteomics/methods , Software , Workflow , Computational Biology/methods , Databases, Protein , Humans , Mass Spectrometry , Models, Statistical , Peptide Fragments/chemistry , Programming Languages , Proteins/chemistry , User-Computer Interface
7.
Phys Biol ; 12(4): 046003, 2015 May 28.
Article in English | MEDLINE | ID: mdl-26020120

ABSTRACT

We study diffusion of macromolecules in a crowded cytoplasm-like environment, focusing on its dependence on composition and its crossover to the anomalous subdiffusion. The crossover and the diffusion itself depend on both the volume fraction and the relative concentration of macromolecules. In accordance with previous theoretical and experimental studies, diffusion slows down when the volume fraction increases. Contrary to expectations, however, the diffusion is also strongly dependent on the molecular composition. The crossover time decreases and diffusion slows down when the smaller macromolecules start to dominate. Interestingly, diffusion is faster in a cytoplasm-like (more polydisperse) system than it is in a two-component system, at comparable packing fractions, or even when the cytoplasm packing fraction is larger.


Subject(s)
Cytoplasm/chemistry , Macromolecular Substances/chemistry , Diffusion , Models, Biological
8.
Bioinformatics ; 31(2): 201-8, 2015 Jan 15.
Article in English | MEDLINE | ID: mdl-25246432

ABSTRACT

MOTIVATION: Intrinsically disordered regions are key for the function of numerous proteins. Due to the difficulties in experimental disorder characterization, many computational predictors have been developed with various disorder flavors. Their performance is generally measured on small sets mainly from experimentally solved structures, e.g. Protein Data Bank (PDB) chains. MobiDB has only recently started to collect disorder annotations from multiple experimental structures. RESULTS: MobiDB annotates disorder for UniProt sequences, allowing us to conduct the first large-scale assessment of fast disorder predictors on 25 833 different sequences with X-ray crystallographic structures. In addition to a comprehensive ranking of predictors, this analysis produced the following interesting observations. (i) The predictors cluster according to their disorder definition, with a consensus giving more confidence. (ii) Previous assessments appear over-reliant on data annotated at the PDB chain level and performance is lower on entire UniProt sequences. (iii) Long disordered regions are harder to predict. (iv) Depending on the structural and functional types of the proteins, differences in prediction performance of up to 10% are observed. AVAILABILITY: The datasets are available from Web site at URL: http://mobidb.bio.unipd.it/lsd. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
Proteins/chemistry , Sequence Analysis, Protein/methods , Tumor Suppressor Protein p53/chemistry , Crystallography, X-Ray , Databases, Protein , Humans , Molecular Sequence Annotation , Protein Structure, Tertiary
9.
Proteins ; 81(8): 1446-56, 2013 Aug.
Article in English | MEDLINE | ID: mdl-23553942

ABSTRACT

For computational studies of protein folding, proteins with both helical and ß-sheet secondary structure elements are very challenging, as they expose subtle biases of the physical models. Here, we present reproducible folding of a 92 residue α/ß protein (residues 3-94 of Top7, PDB ID: 1QYS) in computer simulations starting from random initial conformations using a transferable physical model which has been previously shown to describe the folding and thermodynamic properties of about 20 other smaller proteins of different folds. Top7 is a de novo designed protein with two α-helices and a five stranded ß-sheet. Experimentally, it is known to be unusually stable for its size, and its folding transition distinctly deviates from the two-state behavior commonly seen in natural single domain proteins. In our all-atom implicit solvent parallel tempering Monte Carlo simulations, Top7 shows a rapid transition to a group of states with high native-like secondary structure, and a much slower subsequent transition to the native state with a root mean square deviation of about 3.5 Å from the experimentally determined structure. Consistent with experiments, we find Top7 to be thermally extremely stable, although the simulations also find a large number of very stable non-native states with high native-like secondary structure.


Subject(s)
Protein Folding , Proteins/chemistry , Monte Carlo Method , Protein Structure, Secondary , Thermodynamics
10.
J Chem Inf Model ; 48(9): 1903-8, 2008 Sep.
Article in English | MEDLINE | ID: mdl-18763837

ABSTRACT

Constraint generation for 3d structure prediction and structure-based database searches benefit from fine-grained prediction of local structure. In this work, we present LOCUSTRA, a novel scheme for the multiclass prediction of local structure that uses two layers of support vector machines (SVM). Using a 16-letter structural alphabet from de Brevern et al. (Proteins: Struct., Funct., Bioinf. 2000, 41, 271-287), we assess its prediction ability for an independent test set of 222 proteins and compare our method to three-class secondary structure prediction and direct prediction of dihedral angles. The prediction accuracy is Q16=61.0% for the 16 classes of the structural alphabet and Q3=79.2% for a simple mapping to the three secondary classes helix, sheet, and coil. We achieve a mean phi(psi) error of 24.74 degrees (38.35 degrees) and a median RMSDA (root-mean-square deviation of the (dihedral) angles) per protein chain of 52.1 degrees. These results compare favorably with related approaches. The LOCUSTRA web server is freely available to researchers at http://www.fz-juelich.de/nic/cbb/service/service.php.


Subject(s)
Computer Simulation , Models, Biological , Proteins/chemistry , Quantitative Structure-Activity Relationship , Algorithms , Databases, Factual , Models, Molecular , Peptidyl Transferases/chemistry , Predictive Value of Tests , Protein Structure, Tertiary , Streptomyces/enzymology
11.
Proc Natl Acad Sci U S A ; 105(23): 8004-7, 2008 Jun 10.
Article in English | MEDLINE | ID: mdl-18408166

ABSTRACT

Protein structures often feature beta-sheets in which adjacent beta-strands have large sequence separation. How the folding process orchestrates the formation and correct arrangement of these strands is not comprehensively understood. Particularly challenging are proteins in which beta-strands at the N and C termini are neighbors in a beta-sheet. The N-terminal beta-strand is synthesized early on, but it can not bind to the C terminus before the chain is fully synthesized. During this time, there is a danger that the beta-strand at the N terminus interacts with nearby molecules, leading to potentially harmful aggregates of incompletely folded proteins. Simulations of the C-terminal fragment of Top7 show that this risk of misfolding and aggregation can be avoided by a "caching" mechanism that relies on the "chameleon" behavior of certain segments.


Subject(s)
Computer Simulation , Peptide Fragments/chemistry , Protein Folding , Proteins/chemistry , Proteins/metabolism , Protein Structure, Secondary , Temperature , Thermodynamics
12.
Biochim Biophys Acta ; 1784(1): 252-8, 2008 Jan.
Article in English | MEDLINE | ID: mdl-18036571

ABSTRACT

Recent improvements in methodology and increased computer power now allow atomistic computer simulations of protein folding. We briefly review several advanced Monte Carlo algorithms that have contributed to this development. Details of folding simulations of three designed mini proteins are shown. Adding global translations and rotations has allowed us to handle multiple chains and to simulate the aggregation of six beta-amyloid fragments. In a different line of research we have developed several algorithms to predict local features from sequence. In an outlook we sketch how such biasing could extend the application spectrum of Monte Carlo simulations to structure prediction of larger proteins.


Subject(s)
Computational Biology/methods , Protein Folding , Proteins/chemistry , Algorithms , Computer Simulation , Models, Molecular , Monte Carlo Method , Protein Conformation
13.
In Silico Biol ; 7(4-5): 535-42, 2007.
Article in English | MEDLINE | ID: mdl-18391242

ABSTRACT

Most secondary structure prediction programs do not distinguish between parallel and antiparallel beta-sheets. However, such knowledge would constrain the available topologies of a protein significantly, and therefore aid existing fold recognition algorithms. For this reason, we propose a technique which, in combination with existing secondary structure programs such as PSIPRED, allows one to distinguish between parallel and antiparallel beta-sheets. We propose the use of a support vector machine (SVM) procedure, BETTY, to predict parallel and antiparallel sheets from sequence. We found that there is a strong signal difference in the sequence profiles which SVMs can efficiently extract. With strand type assignment accuracies of 90.7% and 83.3% for antiparallel and parallel strands, respectively, our method adds considerably to existing information on current 3-class secondary structure predictions. BETTY has been implemented as an online service which academic researchers can access from our website http://www.fz-juelich.de/nic/cbb/service/service.php.


Subject(s)
Amino Acid Motifs , Artificial Intelligence , Protein Structure, Secondary , Algorithms , Amino Acid Sequence , Models, Molecular , Protein Folding
14.
Bioinformatics ; 22(24): 3009-15, 2006 Dec 15.
Article in English | MEDLINE | ID: mdl-17005536

ABSTRACT

MOTIVATION: Most secondary structure prediction programs target only alpha helix and beta sheet structures and summarize all other structures in the random coil pseudo class. However, such an assignment often ignores existing local ordering in so-called random coil regions. Signatures for such ordering are distinct dihedral angle pattern. For this reason, we propose as an alternative approach to predict directly dihedral regions for each residue as this leads to a higher amount of structural information. RESULTS: We propose a multi-step support vector machine (SVM) procedure, dihedral prediction (DHPRED), to predict the dihedral angle state of residues from sequence. Trained on 20,000 residues our approach leads to dihedral region predictions, that in regions without alpha helices or beta sheets is higher than those from secondary structure prediction programs. AVAILABILITY: DHPRED has been implemented as a web service, which academic researchers can access from our webpage http://www.fz-juelich.de/nic/cbb


Subject(s)
Artificial Intelligence , Models, Chemical , Models, Molecular , Pattern Recognition, Automated/methods , Proteins/chemistry , Proteins/ultrastructure , Sequence Analysis, Protein/methods , Algorithms , Amino Acid Sequence , Computer Simulation , Molecular Sequence Data , Protein Structure, Secondary , Sequence Alignment/methods
SELECTION OF CITATIONS
SEARCH DETAIL
...