Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 27
Filter
Add more filters










Publication year range
1.
Phys Biol ; 21(2)2024 Jan 31.
Article in English | MEDLINE | ID: mdl-38237200

ABSTRACT

Proteins populate a manifold in the high-dimensional sequence space whose geometrical structure guides their natural evolution. Leveraging recently-developed structure prediction tools based on transformer models, we first examine the protein sequence landscape as defined by an effective energy that is a proxy of sequence foldability. This landscape shares characteristics with optimization challenges encountered in machine learning and constraint satisfaction problems. Our analysis reveals that natural proteins predominantly reside in wide, flat minima within this energy landscape. To investigate further, we employ statistical mechanics algorithms specifically designed to explore regions with high local entropy in relatively flat landscapes. Our findings indicate that these specialized algorithms can identify valleys with higher entropy compared to those found using traditional methods such as Monte Carlo Markov Chains. In a proof-of-concept case, we find that these highly entropic minima exhibit significant similarities to natural sequences, especially in critical key sites and local entropy. Additionally, evaluations through Molecular Dynamics suggests that the stability of these sequences closely resembles that of natural proteins. Our tool combines advancements in machine learning and statistical physics, providing new insights into the exploration of sequence landscapes where wide, flat minima coexist alongside a majority of narrower minima.


Subject(s)
Protein Folding , Proteins , Amino Acid Sequence , Proteins/chemistry , Entropy , Molecular Dynamics Simulation , Algorithms , Protein Conformation , Thermodynamics
2.
Phys Rev E ; 104(6-1): 064117, 2021 Dec.
Article in English | MEDLINE | ID: mdl-35030941

ABSTRACT

The differing ability of polypeptide conformations to act as the native state of proteins has long been rationalized in terms of differing kinetic accessibility or thermodynamic stability. Building on the successful applications of physical concepts and sampling algorithms recently introduced in the study of disordered systems, in particular artificial neural networks, we quantitatively explore how well a quantity known as the local entropy describes the native state of model proteins. In lattice models and all-atom representations of proteins, we are able to efficiently sample high local entropy states and to provide a proof of concept of enhanced stability and folding rate. Our methods are based on simple and general statistical-mechanics arguments, and thus we expect that they are of very general use.


Subject(s)
Protein Folding , Proteins , Entropy , Kinetics , Protein Conformation , Thermodynamics
3.
Article in English | MEDLINE | ID: mdl-23848635

ABSTRACT

Simple models of irreversible dynamical processes such as bootstrap percolation have been successfully applied to describe cascade processes in a large variety of different contexts. However, the problem of analyzing nontypical trajectories, which can be crucial for the understanding of out-of-equilibrium phenomena, is still considered to be intractable in most cases. Here we introduce an efficient method to find and analyze optimized trajectories of cascade processes. We show that for a wide class of irreversible dynamical rules, this problem can be solved efficiently on large-scale systems.

4.
Phys Rev E Stat Nonlin Soft Matter Phys ; 85(2 Pt 1): 021106, 2012 Feb.
Article in English | MEDLINE | ID: mdl-22463152

ABSTRACT

In this paper we study the hard sphere packing problem in the Hamming space by the cavity method. We show that both the replica symmetric and the replica symmetry breaking approximations give maximum rates of packing that are asymptotically the same as the lower bound of Gilbert and Varshamov. Consistently with known numerical results, the replica symmetric equations also suggest a crystalline solution, where for even diameters the spheres are more likely to be found in one of the subspaces (even or odd) of the Hamming space. These crystalline packings can be generated by a recursive algorithm which finds maximum packings in an ultrametric space. Finally, we design a message passing algorithm based on the cavity equations to find dense packings of hard spheres. Known maximum packings are reproduced efficiently in nontrivial ranges of dimensions and number of spheres.


Subject(s)
Colloids/chemistry , Models, Chemical , Models, Molecular , Nanospheres/chemistry , Computer Simulation
5.
Phys Rev E Stat Nonlin Soft Matter Phys ; 84(5 Pt 1): 051111, 2011 Nov.
Article in English | MEDLINE | ID: mdl-22181373

ABSTRACT

In this paper we discuss a novel data compression technique for binary symmetric sources based on the cavity method over GF(q), the Galois Field of order q. We present a scheme of low complexity and near-optimal empirical performance. The compression step is based on a reduction of a sparse low-density parity-check code over GF(q) and is done through the so-called reinforced belief-propagation equations. These reduced codes appear to have a nontrivial geometrical modification of the space of codewords, which makes such compression computationally feasible. The computational complexity is O(dnqlog(2)q) per iteration, where d is the average degree of the check nodes and n is the number of bits. For our code ensemble, decompression can be done in a time linear in the code's length by a simple leaf-removal algorithm.

6.
Phys Rev E Stat Nonlin Soft Matter Phys ; 83(5 Pt 2): 056114, 2011 May.
Article in English | MEDLINE | ID: mdl-21728612

ABSTRACT

We discuss how inference can be performed when data are sampled from the nonergodic phase of systems with multiple attractors. We take as a model system the finite connectivity Hopfield model in the memory phase and suggest a cavity method approach to reconstruct the couplings when the data are separately sampled from few attractor states. We also show how the inference results can be converted into a learning protocol for neural networks in which patterns are presented through weak external fields. The protocol is simple and fully local, and is able to store patterns with a finite overlap with the input patterns without ever reaching a spin-glass phase where all memories are lost.


Subject(s)
Data Interpretation, Statistical , Pattern Recognition, Automated , Algorithms
7.
Phys Rev Lett ; 106(19): 190601, 2011 May 13.
Article in English | MEDLINE | ID: mdl-21668134

ABSTRACT

The matching problem plays a basic role in combinatorial optimization and in statistical mechanics. In its stochastic variants, optimization decisions have to be taken given only some probabilistic information about the instance. While the deterministic case can be solved in polynomial time, stochastic variants are worst-case intractable. We propose an efficient method to solve stochastic matching problems which combines some features of the survey propagation equations and of the cavity method. We test it on random bipartite graphs, for which we analyze the phase diagram and compare the results with exact bounds. Our approach is shown numerically to be effective on the full range of parameters, and to outperform state-of-the-art methods. Finally we discuss how the method can be generalized to other problems of optimization under uncertainty.


Subject(s)
Algorithms , Artificial Intelligence , Nonlinear Dynamics , Numerical Analysis, Computer-Assisted , Computer Simulation , Stochastic Processes , Uncertainty
8.
Proc Natl Acad Sci U S A ; 108(2): 882-7, 2011 Jan 11.
Article in English | MEDLINE | ID: mdl-21187432

ABSTRACT

External information propagates in the cell mainly through signaling cascades and transcriptional activation, allowing it to react to a wide spectrum of environmental changes. High-throughput experiments identify numerous molecular components of such cascades that may, however, interact through unknown partners. Some of them may be detected using data coming from the integration of a protein-protein interaction network and mRNA expression profiles. This inference problem can be mapped onto the problem of finding appropriate optimal connected subgraphs of a network defined by these datasets. The optimization procedure turns out to be computationally intractable in general. Here we present a new distributed algorithm for this task, inspired from statistical physics, and apply this scheme to alpha factor and drug perturbations data in yeast. We identify the role of the COS8 protein, a member of a gene family of previously unknown function, and validate the results by genetic experiments. The algorithm we present is specially suited for very large datasets, can run in parallel, and can be adapted to other problems in systems biology. On renowned benchmarks it outperforms other algorithms in the field.


Subject(s)
Computational Biology/methods , Signal Transduction/physiology , Adenosine Triphosphatases/chemistry , Algorithms , Alleles , Biophysics/methods , Endosomal Sorting Complexes Required for Transport/chemistry , Models, Biological , Models, Statistical , Pheromones , Plasmids/metabolism , Protein Interaction Mapping , RNA, Messenger/metabolism , Saccharomyces cerevisiae/genetics , Saccharomyces cerevisiae Proteins/chemistry , Software , Transcription, Genetic
9.
Phys Rev Lett ; 101(3): 037208, 2008 Jul 18.
Article in English | MEDLINE | ID: mdl-18764290

ABSTRACT

The minimum weight Steiner tree (MST) is an important combinatorial optimization problem over networks that has applications in a wide range of fields. Here we discuss a general technique to translate the imposed global connectivity constrain into many local ones that can be analyzed with cavity equation techniques. This approach leads to a new optimization algorithm for MST and allows us to analyze the statistical mechanics properties of MST on random graphs of various types.


Subject(s)
Algorithms , Models, Statistical , Population Dynamics
10.
Phys Rev E Stat Nonlin Soft Matter Phys ; 77(3 Pt 1): 031118, 2008 Mar.
Article in English | MEDLINE | ID: mdl-18517340

ABSTRACT

We study the entropy landscape of solutions for the bicoloring problem in random graphs, a representative difficult constraint satisfaction problem. Our goal is to classify which types of clusters of solutions are addressed by different algorithms. In the first part of the study we use the cavity method to obtain the number of clusters with a given internal entropy and determine the phase diagram of the problem--e.g., dynamical, rigidity, and satisfiability-unsatisfiability (SAT-UNSAT) transitions. In the second part of the paper we analyze different algorithms and locate their behavior in the entropy landscape of the problem. For instance, we show that a smoothed version of a decimation strategy based on belief propagation is able to find solutions belonging to subdominant clusters even beyond the so-called rigidity transition where the thermodynamically relevant clusters become frozen. These nonequilibrium solutions belong to the most probable unfrozen clusters.

11.
Chaos ; 17(2): 026109, 2007 Jun.
Article in English | MEDLINE | ID: mdl-17614696

ABSTRACT

Boolean networks and their dynamics are of great interest as abstract modeling schemes in various disciplines, ranging from biology to computer science. Whereas parallel update schemes have been studied extensively in past years, the level of understanding of asynchronous updates schemes is still very poor. In this paper we study the propagation of external information given by regulatory input variables into a random Boolean network. We compute both analytically and numerically the time evolution and the asymptotic behavior of this propagation of external regulation (PER). In particular, this allows us to identify variables that are completely determined by this external information. All those variables in the network that are not directly fixed by PER form a core which contains, in particular, all nontrivial feedback loops. We design a message-passing approach allowing to characterize the statistical properties of these cores in dependence of the Boolean network and the external condition. At the end we establish a link between PER dynamics and the full random asynchronous dynamics of a Boolean network.

12.
Phys Rev Lett ; 96(1): 018101, 2006 Jan 13.
Article in English | MEDLINE | ID: mdl-16486521

ABSTRACT

The determination and classification of fixed points of large Boolean networks is addressed in terms of a constraint-satisfaction problem. We develop a general simplification scheme that, removing all those variables and functions belonging to trivial logical cascades, returns the computational core of the network. The transition line from an easy to a complex regulatory phase is described as a function of the parameters of the model, identifying thereby both theoretically and algorithmically the relevant regulatory variables.


Subject(s)
Algorithms , Gene Expression Regulation , Models, Genetic
13.
Phys Rev Lett ; 94(19): 197205, 2005 May 20.
Article in English | MEDLINE | ID: mdl-16090207

ABSTRACT

Using elementary rigorous methods we prove the existence of a clustered phase in the random K-SAT problem, for K > or = 8. In this phase the solutions are grouped into clusters which are far away from each other. The results are in agreement with previous predictions of the cavity method and give a rigorous confirmation to one of its main building blocks. It can be generalized to other systems of both physical and computational interest.

14.
Phys Rev E Stat Nonlin Soft Matter Phys ; 68(3 Pt 2): 036702, 2003 Sep.
Article in English | MEDLINE | ID: mdl-14524921

ABSTRACT

We study the graph coloring problem over random graphs of finite average connectivity c. Given a number q of available colors, we find that graphs with low connectivity admit almost always a proper coloring whereas graphs with high connectivity are uncolorable. Depending on q, we find with a one-step replica-symmetry breaking approximation the precise value of the critical average connectivity c(q). Moreover, we show that below c(q) there exists a clustering phase c in [c(d),c(q)] in which ground states spontaneously divide into an exponential number of clusters. Furthermore, we extended our considerations to the case of single instances showing consistent results. This leads us to propose a different algorithm that is able to color in polynomial time random graphs in the hard but colorable region, i.e., when c in [c(d),c(q)].

15.
Phys Rev Lett ; 89(26): 268701, 2002 Dec 23.
Article in English | MEDLINE | ID: mdl-12484862

ABSTRACT

We study the graph coloring problem over random graphs of finite average connectivity c. Given a number q of available colors, we find that graphs with low connectivity admit almost always a proper coloring, whereas graphs with high connectivity are uncolorable. Depending on q, we find the precise value of the critical average connectivity c(q). Moreover, we show that below c(q) there exists a clustering phase c in [c(d),c(q)] in which ground states spontaneously divide into an exponential number of clusters and where the proliferation of metastable states is responsible for the onset of complexity in local search algorithms.

16.
Science ; 297(5582): 812-5, 2002 Aug 02.
Article in English | MEDLINE | ID: mdl-12089451

ABSTRACT

We study the satisfiability of random Boolean expressions built from many clauses with K variables per clause (K-satisfiability). Expressions with a ratio alpha of clauses to variables less than a threshold alphac are almost always satisfiable, whereas those with a ratio above this threshold are almost always unsatisfiable. We show the existence of an intermediate phase below alphac, where the proliferation of metastable states is responsible for the onset of complexity in search algorithms. We introduce a class of optimization algorithms that can deal with these metastable states; one such algorithm has been tested successfully on the largest existing benchmark of K-satisfiability.

17.
Phys Rev Lett ; 88(18): 188701, 2002 May 06.
Article in English | MEDLINE | ID: mdl-12005728

ABSTRACT

A major problem in evaluating stochastic local search algorithms for NP-complete problems is the need for a systematic generation of hard test instances having previously known properties of the optimal solutions. On the basis of statistical mechanics results, we propose random generators of hard and satisfiable instances for the 3-satisfiability problem. The design of the hardest problem instances is based on the existence of a first order ferromagnetic phase transition and the glassy nature of excited states. The analytical predictions are corroborated by numerical results obtained from complete as well as stochastic local algorithms.


Subject(s)
Models, Theoretical , Algorithms , Stochastic Processes
18.
Phys Rev Lett ; 87(20): 208701, 2001 Nov 12.
Article in English | MEDLINE | ID: mdl-11690517

ABSTRACT

We study analytically and by computer simulations a complex system of adaptive agents with finite memory. Borrowing the framework of the minority game and using the replica formalism we show the existence of an equilibrium phase transition as a function of the ratio between the memory lambda and the learning rates Gamma of the agents. We show that, starting from a random configuration, a dynamic phase transition also exists, which prevents agents from reaching optimal coordination. Furthermore, in a nonstationary environment, we show by numerical simulations that the phase transition becomes discontinuous.


Subject(s)
Learning , Models, Theoretical , Social Adjustment , Game Theory , Humans , Memory
19.
Phys Rev Lett ; 87(12): 127209, 2001 Sep 17.
Article in English | MEDLINE | ID: mdl-11580554

ABSTRACT

We study the low temperature properties of p-spin glass models with finite connectivity and of some optimization problems. Using a one-step functional replica symmetry breaking ansatz we can solve exactly the saddle-point equations for graphs with uniform connectivity. The resulting ground state energy is in perfect agreement with numerical simulations. For fluctuating connectivity graphs, the same ansatz can be used in a variational way: For p-spin models (known as p-XOR-SAT in computer science) it provides the exact configurational entropy together with the dynamical and static critical connectivities (for p = 3, gamma(d) = 0.818, and gamma(s) = 0.918), whereas for hard optimization problems like 3-SAT or Bicoloring it provides new upper bounds for their critical thresholds ( gamma(var)(c) = 4.396 and gamma(var)(c) = 2.149).

20.
Phys Rev E Stat Nonlin Soft Matter Phys ; 63(2 Pt 2): 026702, 2001 Feb.
Article in English | MEDLINE | ID: mdl-11308607

ABSTRACT

We study a simple and exactly solvable model for the generation of random satisfiability problems. These consist of gammaN random boolean constraints which are to be satisfied simultaneously by N logical variables. In statistical-mechanics language, the considered model can be seen as a diluted p-spin model at zero temperature. While such problems become extraordinarily hard to solve by local search methods in a large region of the parameter space, still at least one solution may be superimposed by construction. The statistical properties of the model can be studied exactly by the replica method and each single instance can be analyzed in polynomial time by a simple global solution method. The geometrical and topological structures responsible for dynamic and static phase transitions as well as for the onset of computational complexity in the local search method are thoroughly analyzed. Numerical analysis on very large samples allows for a precise characterization of the critical scaling behavior.

SELECTION OF CITATIONS
SEARCH DETAIL
...