Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 10 de 10
Filter
Add more filters










Publication year range
1.
Phys Rev E ; 109(5-1): 054131, 2024 May.
Article in English | MEDLINE | ID: mdl-38907500

ABSTRACT

We consider random hyperbolic graphs in hyperbolic spaces of any dimension d+1≥2. We present a rescaling of model parameters that casts the random hyperbolic graph model of any dimension to a unified mathematical framework, leaving the degree distribution invariant with respect to the dimension. Unlike the degree distribution, clustering does depend on the dimension, decreasing to 0 at d→∞. We analyze all of the other limiting regimes of the model, and we release a software package that generates random hyperbolic graphs and their limits in hyperbolic spaces of any dimension.

2.
Article in English | MEDLINE | ID: mdl-26382454

ABSTRACT

We introduce and explore a method for inferring hidden geometric coordinates of nodes in complex networks based on the number of common neighbors between the nodes. We compare this approach to the HyperMap method, which is based only on the connections (and disconnections) between the nodes, i.e., on the links that the nodes have (or do not have). We find that for high degree nodes, the common-neighbors approach yields a more accurate inference than the link-based method, unless heuristic periodic adjustments (or "correction steps") are used in the latter. The common-neighbors approach is computationally intensive, requiring O(t4) running time to map a network of t nodes, versus O(t3) in the link-based method. But we also develop a hybrid method with O(t3) running time, which combines the common-neighbors and link-based approaches, and we explore a heuristic that reduces its running time further to O(t2), without significant reduction in the mapping accuracy. We apply this method to the autonomous systems (ASs) Internet, and we reveal how soft communities of ASs evolve over time in the similarity space. We further demonstrate the method's predictive power by forecasting future links between ASs. Taken altogether, our results advance our understanding of how to efficiently and accurately map real networks to their latent geometric spaces, which is an important necessary step toward understanding the laws that govern the dynamics of nodes in these spaces, and the fine-grained dynamics of network connections.

3.
Bioinformatics ; 30(7): 1041-2, 2014 Apr 01.
Article in English | MEDLINE | ID: mdl-24363381

ABSTRACT

SUMMARY: Detecting communities and densely connected groups may contribute to unravel the underlying relationships among the units present in diverse biological networks (e.g. interactomes, coexpression networks, ecological networks). We recently showed that communities can be precisely characterized by maximizing Surprise, a global network parameter. Here, we present SurpriseMe, a tool that integrates the outputs of seven of the best algorithms available to estimate the maximum Surprise value. SurpriseMe also generates distance matrices that allow visualizing the relationships among the solutions generated by the algorithms. We show that the communities present in small- and medium-sized networks, with up to 10 000 nodes, can be easily characterized: on standard PC computers, these analyses take less than an hour. Also, four of the algorithms may rapidly analyze networks with up to 100 000 nodes, given enough memory resources. Because of its performance and simplicity, SurpriseMe is a reference tool for community structure characterization. AVAILABILITY AND IMPLEMENTATION: SurpriseMe is implemented in Perl and C/C++. It compiles and runs on any UNIX-based operating system, including Linux and Mac OS/X, using standard libraries. The source code is freely and publicly available under the GPL 3.0 license at http://github.com/raldecoa/SurpriseMe/releases.


Subject(s)
Software , Algorithms , Neural Networks, Computer , Programming Languages , Time Factors
4.
Sci Rep ; 3: 2216, 2013.
Article in English | MEDLINE | ID: mdl-23860510

ABSTRACT

The characterization of network community structure has profound implications in several scientific areas. Therefore, testing the algorithms developed to establish the optimal division of a network into communities is a fundamental problem in the field. We performed here a highly detailed evaluation of community detection algorithms, which has two main novelties: 1) using complex closed benchmarks, which provide precise ways to assess whether the solutions generated by the algorithms are optimal; and, 2) A novel type of analysis, based on hierarchically clustering the solutions suggested by multiple community detection algorithms, which allows to easily visualize how different are those solutions. Surprise, a global parameter that evaluates the quality of a partition, confirms the power of these analyses. We show that none of the community detection algorithms tested provide consistently optimal results in all networks and that Surprise maximization, obtained by combining multiple algorithms, obtains quasi-optimal performances in these difficult benchmarks.

5.
Sci Rep ; 3: 1060, 2013.
Article in English | MEDLINE | ID: mdl-23320141

ABSTRACT

How to determine the community structure of complex networks is an open question. It is critical to establish the best strategies for community detection in networks of unknown structure. Here, using standard synthetic benchmarks, we show that none of the algorithms hitherto developed for community structure characterization perform optimally. Significantly, evaluating the results according to their modularity, the most popular measure of the quality of a partition, systematically provides mistaken solutions. However, a novel quality function, called Surprise, can be used to elucidate which is the optimal division into communities. Consequently, we show that the best strategy to find the community structure of all the networks examined involves choosing among the solutions provided by multiple algorithms the one with the highest Surprise value. We conclude that Surprise maximization precisely reveals the community structure of complex networks.

6.
Phys Rev E Stat Nonlin Soft Matter Phys ; 85(2 Pt 2): 026109, 2012 Feb.
Article in English | MEDLINE | ID: mdl-22463281

ABSTRACT

Characterizing the community structure of complex networks is a key challenge in many scientific fields. Very diverse algorithms and methods have been proposed to this end, many working reasonably well in specific situations. However, no consensus has emerged on which of these methods is the best to use in practice. In part, this is due to the fact that testing their performance requires the generation of a comprehensive, standard set of synthetic benchmarks, a goal not yet fully achieved. Here, we present a type of benchmark that we call "closed," in which an initial network of known community structure is progressively converted into a second network whose communities are also known. This approach differs from all previously published ones, in which networks evolve toward randomness. The use of this type of benchmark allows us to monitor the transformation of the community structure of a network. Moreover, we can predict the optimal behavior of the variation of information, a measure of the quality of the partitions obtained, at any moment of the process. This enables us in many cases to determine the best partition among those suggested by different algorithms. Also, since any network can be used as a starting point, extensive studies and comparisons can be performed using a heterogeneous set of structures, including random ones. These properties make our benchmarks a general standard for comparing community detection algorithms.


Subject(s)
Models, Theoretical , Algorithms , Benchmarking
7.
PLoS One ; 6(9): e24195, 2011.
Article in English | MEDLINE | ID: mdl-21909420

ABSTRACT

The analysis of complex networks permeates all sciences, from biology to sociology. A fundamental, unsolved problem is how to characterize the community structure of a network. Here, using both standard and novel benchmarks, we show that maximization of a simple global parameter, which we call Surprise (S), leads to a very efficient characterization of the community structure of complex synthetic networks. Particularly, S qualitatively outperforms the most commonly used criterion to define communities, Newman and Girvan's modularity (Q). Applying S maximization to real networks often provides natural, well-supported partitions, but also sometimes counterintuitive solutions that expose the limitations of our previous knowledge. These results indicate that it is possible to define an effective global criterion for community structure and open new routes for the understanding of complex networks.


Subject(s)
Community Networks , Models, Theoretical , Algorithms , Saccharomyces cerevisiae/metabolism , Saccharomyces cerevisiae Proteins/metabolism
8.
Cell ; 143(6): 991-1004, 2010 Dec 10.
Article in English | MEDLINE | ID: mdl-21145464

ABSTRACT

To understand relationships between phosphorylation-based signaling pathways, we analyzed 150 deletion mutants of protein kinases and phosphatases in S. cerevisiae using DNA microarrays. Downstream changes in gene expression were treated as a phenotypic readout. Double mutants with synthetic genetic interactions were included to investigate genetic buffering relationships such as redundancy. Three types of genetic buffering relationships are identified: mixed epistasis, complete redundancy, and quantitative redundancy. In mixed epistasis, the most common buffering relationship, different gene sets respond in different epistatic ways. Mixed epistasis arises from pairs of regulators that have only partial overlap in function and that are coupled by additional regulatory links such as repression of one by the other. Such regulatory modules confer the ability to control different combinations of processes depending on condition or context. These properties likely contribute to the evolutionary maintenance of paralogs and indicate a way in which signaling pathways connect for multiprocess control.


Subject(s)
Saccharomyces cerevisiae/genetics , Saccharomyces cerevisiae/metabolism , Signal Transduction , Epistasis, Genetic , Gene Expression Profiling , Phosphoric Monoester Hydrolases/genetics , Phosphoric Monoester Hydrolases/metabolism , Phosphorylation , Phosphotransferases/genetics , Phosphotransferases/metabolism
9.
PLoS One ; 5(7): e11585, 2010 Jul 14.
Article in English | MEDLINE | ID: mdl-20644733

ABSTRACT

BACKGROUND: How to extract useful information from complex biological networks is a major goal in many fields, especially in genomics and proteomics. We have shown in several works that iterative hierarchical clustering, as implemented in the UVCluster program, is a powerful tool to analyze many of those networks. However, the amount of computation time required to perform UVCluster analyses imposed significant limitations to its use. METHODOLOGY/PRINCIPAL FINDINGS: We describe the suite Jerarca, designed to efficiently convert networks of interacting units into dendrograms by means of iterative hierarchical clustering. Jerarca is divided into three main sections. First, weighted distances among units are computed using up to three different approaches: a more efficient version of UVCluster and two new, related algorithms called RCluster and SCluster. Second, Jerarca builds dendrograms based on those distances, using well-known phylogenetic algorithms, such as UPGMA or Neighbor-Joining. Finally, Jerarca provides optimal partitions of the trees using statistical criteria based on the distribution of intra- and intercluster connections. Outputs compatible with the phylogenetic software MEGA and the Cytoscape package are generated, allowing the results to be easily visualized. CONCLUSIONS/SIGNIFICANCE: THE FOUR MAIN ADVANTAGES OF JERARCA IN RESPECT TO UVCLUSTER ARE: 1) Improved speed of a novel UVCluster algorithm; 2) Additional, alternative strategies to perform iterative hierarchical clustering; 3) Automatic evaluation of the hierarchical trees to obtain optimal partitions; and, 4) Outputs compatible with popular software such as MEGA and Cytoscape.


Subject(s)
Cluster Analysis , Computational Biology/methods , Signal Transduction/physiology , Algorithms , Software
10.
BMC Genomics ; 11: 169, 2010 Mar 12.
Article in English | MEDLINE | ID: mdl-20226017

ABSTRACT

BACKGROUND: In Drosophila melanogaster, dosage compensation is mediated by the action of the dosage compensation complex (DCC). How the DCC recognizes the fly X chromosome is still poorly understood. Characteristic sequence signatures at all DCC binding sites have not hitherto been found. RESULTS: In this study, we compare the known binding sites of the DCC with oligonucleotide profiles that measure the specificity of the sequences of the D. melanogaster X chromosome. We show that the X chromosome regions bound by the DCC are enriched for a particular type of short, repetitive sequences. Their distribution suggests that these sequences contribute to chromosome recognition, the generation of DCC binding sites and/or the local spreading of the complex. Comparative data indicate that the same sequences may be involved in dosage compensation in other Drosophila species. CONCLUSIONS: These results offer an explanation for the wild-type binding of the DCC along the Drosophila X chromosome, contribute to delineate the forces leading to the establishment of dosage compensation and suggest new experimental approaches to understand the precise biochemical features of the dosage compensation system.


Subject(s)
Dosage Compensation, Genetic , Drosophila melanogaster/genetics , Repetitive Sequences, Nucleic Acid , X Chromosome/genetics , Animals , Binding Sites/genetics , Conserved Sequence , Evolution, Molecular , Gene Expression Profiling , Sequence Analysis, DNA
SELECTION OF CITATIONS
SEARCH DETAIL
...