Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 27
Filtrar
1.
Proc Natl Acad Sci U S A ; 121(27): e2311810121, 2024 Jul 02.
Artigo em Inglês | MEDLINE | ID: mdl-38913892

RESUMO

Recent years witnessed the development of powerful generative models based on flows, diffusion, or autoregressive neural networks, achieving remarkable success in generating data from examples with applications in a broad range of areas. A theoretical analysis of the performance and understanding of the limitations of these methods remain, however, challenging. In this paper, we undertake a step in this direction by analyzing the efficiency of sampling by these methods on a class of problems with a known probability distribution and comparing it with the sampling performance of more traditional methods such as the Monte Carlo Markov chain and Langevin dynamics. We focus on a class of probability distribution widely studied in the statistical physics of disordered systems that relate to spin glasses, statistical inference, and constraint satisfaction problems. We leverage the fact that sampling via flow-based, diffusion-based, or autoregressive networks methods can be equivalently mapped to the analysis of a Bayes optimal denoising of a modified probability measure. Our findings demonstrate that these methods encounter difficulties in sampling stemming from the presence of a first-order phase transition along the algorithm's denoising path. Our conclusions go both ways: We identify regions of parameters where these methods are unable to sample efficiently, while that is possible using standard Monte Carlo or Langevin approaches. We also identify regions where the opposite happens: standard approaches are inefficient while the discussed generative methods work well.

2.
Phys Rev E ; 109(3-1): 034305, 2024 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-38632742

RESUMO

While classical in many theoretical settings-and in particular in statistical physics-inspired works-the assumption of Gaussian i.i.d. input data is often perceived as a strong limitation in the context of statistics and machine learning. In this study, we redeem this line of work in the case of generalized linear classification, also known as the perceptron model, with random labels. We argue that there is a large universality class of high-dimensional input data for which we obtain the same minimum training loss as for Gaussian data with corresponding data covariance. In the limit of vanishing regularization, we further demonstrate that the training loss is independent of the data covariance. On the theoretical side, we prove this universality for an arbitrary mixture of homogeneous Gaussian clouds. Empirically, we show that the universality holds also for a broad range of real data sets.

3.
PLoS Comput Biol ; 19(1): e1010813, 2023 01.
Artigo em Inglês | MEDLINE | ID: mdl-36716332

RESUMO

The advent of comprehensive synaptic wiring diagrams of large neural circuits has created the field of connectomics and given rise to a number of open research questions. One such question is whether it is possible to reconstruct the information stored in a recurrent network of neurons, given its synaptic connectivity matrix. Here, we address this question by determining when solving such an inference problem is theoretically possible in specific attractor network models and by providing a practical algorithm to do so. The algorithm builds on ideas from statistical physics to perform approximate Bayesian inference and is amenable to exact analysis. We study its performance on three different models, compare the algorithm to standard algorithms such as PCA, and explore the limitations of reconstructing stored patterns from synaptic connectivity.


Assuntos
Redes Neurais de Computação , Neurônios , Teorema de Bayes , Neurônios/fisiologia , Algoritmos , Modelos Neurológicos
4.
Proc Natl Acad Sci U S A ; 118(32)2021 08 10.
Artigo em Inglês | MEDLINE | ID: mdl-34312253

RESUMO

Contact tracing is an essential tool to mitigate the impact of a pandemic, such as the COVID-19 pandemic. In order to achieve efficient and scalable contact tracing in real time, digital devices can play an important role. While a lot of attention has been paid to analyzing the privacy and ethical risks of the associated mobile applications, so far much less research has been devoted to optimizing their performance and assessing their impact on the mitigation of the epidemic. We develop Bayesian inference methods to estimate the risk that an individual is infected. This inference is based on the list of his recent contacts and their own risk levels, as well as personal information such as results of tests or presence of syndromes. We propose to use probabilistic risk estimation to optimize testing and quarantining strategies for the control of an epidemic. Our results show that in some range of epidemic spreading (typically when the manual tracing of all contacts of infected people becomes practically impossible but before the fraction of infected people reaches the scale where a lockdown becomes unavoidable), this inference of individuals at risk could be an efficient way to mitigate the epidemic. Our approaches translate into fully distributed algorithms that only require communication between individuals who have recently been in contact. Such communication may be encrypted and anonymized, and thus, it is compatible with privacy-preserving standards. We conclude that probabilistic risk estimation is capable of enhancing the performance of digital contact tracing and should be considered in the mobile applications.


Assuntos
Busca de Comunicante/métodos , Epidemias/prevenção & controle , Algoritmos , Teorema de Bayes , COVID-19/epidemiologia , COVID-19/prevenção & controle , Busca de Comunicante/estatística & dados numéricos , Humanos , Aplicativos Móveis , Privacidade , Medição de Risco , SARS-CoV-2
5.
J Stat Mech ; 2020(12): 124010, 2020 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-34262607

RESUMO

Deep neural networks achieve stellar generalisation even when they have enough parameters to easily fit all their training data. We study this phenomenon by analysing the dynamics and the performance of over-parameterised two-layer neural networks in the teacher-student setup, where one network, the student, is trained on data generated by another network, called the teacher. We show how the dynamics of stochastic gradient descent (SGD) is captured by a set of differential equations and prove that this description is asymptotically exact in the limit of large inputs. Using this framework, we calculate the final generalisation error of student networks that have more parameters than their teachers. We find that the final generalisation error of the student increases with network size when training only the first layer, but stays constant or even decreases with size when training both layers. We show that these different behaviours have their root in the different solutions SGD finds for different activation functions. Our results indicate that achieving good generalisation in neural networks goes beyond the properties of SGD alone and depends on the interplay of at least the algorithm, the model architecture, and the data set.

6.
Proc Natl Acad Sci U S A ; 116(12): 5451-5460, 2019 03 19.
Artigo em Inglês | MEDLINE | ID: mdl-30824595

RESUMO

Generalized linear models (GLMs) are used in high-dimensional machine learning, statistics, communications, and signal processing. In this paper we analyze GLMs when the data matrix is random, as relevant in problems such as compressed sensing, error-correcting codes, or benchmark models in neural networks. We evaluate the mutual information (or "free entropy") from which we deduce the Bayes-optimal estimation and generalization errors. Our analysis applies to the high-dimensional limit where both the number of samples and the dimension are large and their ratio is fixed. Nonrigorous predictions for the optimal errors existed for special cases of GLMs, e.g., for the perceptron, in the field of statistical physics based on the so-called replica method. Our present paper rigorously establishes those decades-old conjectures and brings forward their algorithmic interpretation in terms of performance of the generalized approximate message-passing algorithm. Furthermore, we tightly characterize, for many learning problems, regions of parameters for which this algorithm achieves the optimal performance and locate the associated sharp phase transitions separating learnable and nonlearnable regions. We believe that this random version of GLMs can serve as a challenging benchmark for multipurpose algorithms.

7.
Opt Express ; 23(9): 11898-911, 2015 May 04.
Artigo em Inglês | MEDLINE | ID: mdl-25969280

RESUMO

This paper investigates experimental means of measuring the transmission matrix (TM) of a highly scattering medium, with the simplest optical setup. Spatial light modulation is performed by a digital micromirror device (DMD), allowing high rates and high pixel counts but only binary amplitude modulation. On the sensor side, without a reference beam, the CCD camera provides only intensity measurements. Within this framework, this paper shows that the TM can still be retrieved, through signal processing techniques of phase retrieval. This is experimentally validated on three criteria : quality of prediction, distribution of singular values, and quality of focusing.

8.
J Stat Mech ; 2014(5)2014 May.
Artigo em Inglês | MEDLINE | ID: mdl-26167197

RESUMO

The proliferation of models for networks raises challenging problems of model selection: the data are sparse and globally dependent, and models are typically high-dimensional and have large numbers of latent variables. Together, these issues mean that the usual model-selection criteria do not work properly for networks. We illustrate these challenges, and show one way to resolve them, by considering the key network-analysis problem of dividing a graph into communities or blocks of nodes with homogeneous patterns of links to the rest of the network. The standard tool for undertaking this is the stochastic block model, under which the probability of a link between two nodes is a function solely of the blocks to which they belong. This imposes a homogeneous degree distribution within each block; this can be unrealistic, so degree-corrected block models add a parameter for each node, modulating its overall degree. The choice between ordinary and degree-corrected block models matters because they make very different inferences about communities. We present the first principled and tractable approach to model selection between standard and degree-corrected block models, based on new large-graph asymptotics for the distribution of log-likelihood ratios under the stochastic block model, finding substantial departures from classical results for sparse graphs. We also develop linear-time approximations for log-likelihoods under both the stochastic block model and the degree-corrected model, using belief propagation. Applications to simulated and real networks show excellent agreement with our approximations. Our results thus both solve the practical problem of deciding on degree correction and point to a general approach to model selection in network analysis.

9.
Proc Natl Acad Sci U S A ; 110(52): 20935-40, 2013 Dec 24.
Artigo em Inglês | MEDLINE | ID: mdl-24277835

RESUMO

Spectral algorithms are classic approaches to clustering and community detection in networks. However, for sparse networks the standard versions of these algorithms are suboptimal, in some cases completely failing to detect communities even when other algorithms such as belief propagation can do so. Here, we present a class of spectral algorithms based on a nonbacktracking walk on the directed edges of the graph. The spectrum of this operator is much better-behaved than that of the adjacency matrix or other commonly used matrices, maintaining a strong separation between the bulk eigenvalues and the eigenvalues relevant to community structure even in the sparse case. We show that our algorithm is optimal for graphs generated by the stochastic block model, detecting communities all of the way down to the theoretical limit. We also show the spectrum of the nonbacktracking operator for some real-world networks, illustrating its advantages over traditional spectral clustering.


Assuntos
Algoritmos , Análise por Conglomerados , Interpretação Estatística de Dados , Modelos Estatísticos
10.
Phys Rev Lett ; 107(6): 065701, 2011 Aug 05.
Artigo em Inglês | MEDLINE | ID: mdl-21902340

RESUMO

We present an asymptotically exact analysis of the problem of detecting communities in sparse random networks generated by stochastic block models. Using the cavity method of statistical physics and its relationship to belief propagation, we unveil a phase transition from a regime where we can infer the correct group assignments of the nodes to one where these groups are undetectable. Our approach yields an optimal inference algorithm for detecting modules, including both assortative and disassortative functional modules, assessing their significance, and learning the parameters of the underlying block model. Our algorithm is scalable and applicable to real-world networks, as long as they are well described by the block model.

11.
J Chem Phys ; 134(3): 034512, 2011 Jan 21.
Artigo em Inglês | MEDLINE | ID: mdl-21261373

RESUMO

The following properties are in the present literature associated with the behavior of supercooled glass-forming liquids: faster than exponential growth of the relaxation time, dynamical heterogeneities, growing point-to-set correlation length, crossover from mean-field behavior to activated dynamics. In this paper we argue that these properties are also present in a much simpler situation, namely the melting of the bulk of an ordered phase beyond a first order phase transition point. This is a promising path toward a better theoretical, numerical and experimental understanding of the above phenomena and of the physics of supercooled liquids. We discuss in detail the analogies and the differences between the glass and the bulk melting transitions.


Assuntos
Temperatura Alta , Vidro/química , Teoria Quântica
12.
J Chem Phys ; 134(3): 034513, 2011 Jan 21.
Artigo em Inglês | MEDLINE | ID: mdl-21261374

RESUMO

There are deep analogies between the melting dynamics in systems with a first-order phase transition and the dynamics from equilibrium in super-cooled liquids. For a class of Ising spin models undergoing a first-order transition--namely p-spin models on the so-called Nishimori line--it can be shown that the melting dynamics can be exactly mapped to the equilibrium dynamics. In this mapping the dynamical--or mode-coupling--glass transition corresponds to the spinodal point, while the Kauzmann transition corresponds to the first-order phase transition itself. Both in mean field and finite dimensional models this mapping provides an exact realization of the random first-order theory scenario for the glass transition. The corresponding glassy phenomenology can then be understood in the framework of a standard first-order phase transition.


Assuntos
Temperatura Alta , Vidro/química , Teoria Quântica
13.
Phys Rev E Stat Nonlin Soft Matter Phys ; 84(6 Pt 2): 066106, 2011 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-22304154

RESUMO

In this paper we extend our previous work on the stochastic block model, a commonly used generative model for social and biological networks, and the problem of inferring functional groups or communities from the topology of the network. We use the cavity method of statistical physics to obtain an asymptotically exact analysis of the phase diagram. We describe in detail properties of the detectability-undetectability phase transition and the easy-hard phase transition for the community detection problem. Our analysis translates naturally into a belief propagation algorithm for inferring the group memberships of the nodes in an optimal way, i.e., that maximizes the overlap with the underlying group memberships, and learning the underlying parameters of the block model. Finally, we apply the algorithm to two examples of real-world networks and discuss its performance.

14.
Phys Rev Lett ; 104(20): 207206, 2010 May 21.
Artigo em Inglês | MEDLINE | ID: mdl-20867059

RESUMO

We present a study of the phase diagram of a random optimization problem in the presence of quantum fluctuations. Our main result is the characterization of the nature of the phase transition, which we find to be a first-order quantum phase transition. We provide evidence that the gap vanishes exponentially with the system size at the transition. This indicates that the quantum adiabatic algorithm requires a time growing exponentially with system size to find the ground state of this problem.

15.
Phys Rev Lett ; 104(20): 207208, 2010 May 21.
Artigo em Inglês | MEDLINE | ID: mdl-20867061

RESUMO

We show rigorously that the spin-glass susceptibility in the random field Ising model is always bounded by the ferromagnetic susceptibility, and therefore that no spin-glass phase can be present at equilibrium out of the ferromagnetic critical line. When the magnetization is, however, fixed to values smaller than the equilibrium value, a spin-glass phase can exist, as we show explicitly on the Bethe lattice.

16.
Phys Rev Lett ; 102(23): 238701, 2009 Jun 12.
Artigo em Inglês | MEDLINE | ID: mdl-19658978

RESUMO

We study constraint satisfaction problems on the so-called planted random ensemble. We show that for a certain class of problems, e.g., graph coloring, many of the properties of the usual random ensemble are quantitatively identical in the planted random ensemble. We study the structural phase transitions and the easy-hard-easy pattern in the average computational complexity. We also discuss the finite temperature phase diagram, finding a close connection with the liquid-glass-solid phenomenology.

17.
Phys Rev Lett ; 103(2): 025701, 2009 Jul 10.
Artigo em Inglês | MEDLINE | ID: mdl-19659220

RESUMO

Recent ideas based on the properties of assemblies of frictionless particles in mechanical equilibrium provide a perspective of amorphous systems different from that offered by the traditional approach originating in liquid theory. The relation, if any, between these two points of view, and the relevance of the former to the glass phase, has been difficult to ascertain. In this Letter, we introduce a model for which both theories apply strictly: it exhibits on the one hand an ideal glass transition and on the other "jamming" features (fragility, soft modes) virtually identical to that of real systems. This allows us to disentangle the two physical phenomena.

18.
Phys Rev Lett ; 101(16): 165702, 2008 Oct 17.
Artigo em Inglês | MEDLINE | ID: mdl-18999686

RESUMO

We study a lattice model of attractive colloids. It is exactly solvable on sparse random graphs. As the pressure and temperature are varied, it reproduces many characteristic phenomena of liquids, glasses, and colloidal systems such as ideal gel formation, liquid-glass phase coexistence, jamming, or the re-entrance of the glass transition.

19.
Phys Rev Lett ; 101(14): 147204, 2008 Oct 03.
Artigo em Inglês | MEDLINE | ID: mdl-18851567

RESUMO

We study first-order quantum phase transitions in mean-field spin glasses. We solve the quantum random energy model using elementary methods and show that at the transition the eigenstate suddenly projects onto the unperturbed ground state and that the gap between the lowest states is exponentially small in the system size. We argue that this is a generic feature of all "random first-order" models, which includes benchmarks such as random satisfiability. We introduce a two-time instanton to calculate this gap in general, and discuss the consequences for quantum annealing.

20.
Phys Rev Lett ; 100(15): 159701; discussion 159702, 2008 Apr 18.
Artigo em Inglês | MEDLINE | ID: mdl-18518165
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...