Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 11 de 11
Filter
Add more filters










Publication year range
1.
Phys Rev E Stat Nonlin Soft Matter Phys ; 86(1 Pt 2): 016107, 2012 Jul.
Article in English | MEDLINE | ID: mdl-23005491

ABSTRACT

Modularity maximization is extensively used to detect communities in complex networks. It has been shown, however, that this method suffers from a resolution limit: Small communities may be undetectable in the presence of larger ones even if they are very dense. To alleviate this defect, various modifications of the modularity function have been proposed as well as multiresolution methods. In this paper we systematically study a simple model (proposed by Pons and Latapy [Theor. Comput. Sci. 412, 892 (2011)] and similar to the parametric model of Reichardt and Bornholdt [Phys. Rev. E 74, 016110 (2006)]) with a single parameter α that balances the fraction of within community edges and the expected fraction of edges according to the configuration model. An exact algorithm is proposed to find optimal solutions for all values of α as well as the corresponding successive intervals of α values for which they are optimal. This algorithm relies upon a routine for exact modularity maximization and is limited to moderate size instances. An agglomerative hierarchical heuristic is therefore proposed to address parametric modularity detection in large networks. At each iteration the smallest value of α for which it is worthwhile to merge two communities of the current partition is found. Then merging is performed and the data are updated accordingly. An implementation is proposed with the same time and space complexity as the well-known Clauset-Newman-Moore (CNM) heuristic [Phys. Rev. E 70, 066111 (2004)]. Experimental results on artificial and real world problems show that (i) communities are detected by both exact and heuristic methods for all values of the parameter α; (ii) the dendrogram summarizing the results of the heuristic method provides a useful tool for substantive analysis, as illustrated particularly on a Les Misérables data set; (iii) the difference between the parametric modularity values given by the exact method and those given by the heuristic is moderate; (iv) the heuristic version of the proposed parametric method, viewed as a modularity maximization tool, gives better results than the CNM heuristic for large instances.


Subject(s)
Algorithms , Models, Statistical , Social Support , Computer Simulation
2.
Phys Rev E Stat Nonlin Soft Matter Phys ; 85(4 Pt 2): 046113, 2012 Apr.
Article in English | MEDLINE | ID: mdl-22680544

ABSTRACT

Finding communities, or clusters or modules, in networks can be done by optimizing an objective function defined globally and/or by specifying conditions which must be satisfied by all communities. Radicchi et al. [Proc. Natl. Acad. Sci. USA 101, 2658 (2004)] define a susbset of vertices of a network to be a community in the strong sense if each vertex of that subset has a larger inner degree than its outer degree. A partition in the strong sense has only strong communities. In this paper we first define an enumerative algorithm to list all partitions in the strong sense of a network of moderate size. The results of this algorithm are given for the Zachary karate club data set, which is solved by hand, as well as for several well-known real-world problems of the literature. Moreover, this algorithm is slightly modified in order to apply it to larger networks, keeping only partitions with the largest number of communities. It is shown that some of the partitions obtained are informative, although they often have only a few communities, while they fail to give any information in other cases having only one community. It appears that degree 2 vertices play a big role in forcing large inhomogeneous communities. Therefore, a weakening of the strong condition is proposed and explored: we define a partition in the almost-strong sense by substituting a nonstrict inequality to a strict one in the definition of strong community for all vertices of degree 2. Results, for the same set of problems as before, then give partitions with a larger number of communities and are more informative.


Subject(s)
Biophysics/methods , Interpersonal Relations , Population Dynamics , Residence Characteristics , Algorithms , Humans , Likelihood Functions , Models, Biological , Models, Statistical , Models, Theoretical , Reproducibility of Results , Social Behavior , Social Support
3.
Phys Rev E Stat Nonlin Soft Matter Phys ; 84(5 Pt 2): 058101, 2011 Nov.
Article in English | MEDLINE | ID: mdl-22181549

ABSTRACT

In a recent paper, Zhan, Zhang, Guan, and Zhou [Phys. Rev. E 83, 066120 (2011)] presented a modified adaptive genetic algorithm (MAGA) tailored to the discovery of maximum modularity partitions of the node set into communities in unipartite, bipartite, and directed networks. The authors claim that "detection of communities in unipartite networks or in directed networks can be transformed into the same task in bipartite networks." Actually, some tests show that it is not the case for the proposed transformations, and why. Experimental results of MAGA for modularity maximization of untransformed unipartite or bipartite networks are also discussed.


Subject(s)
Algorithms , Biological Evolution
4.
Phys Rev E Stat Nonlin Soft Matter Phys ; 83(5 Pt 2): 056105, 2011 May.
Article in English | MEDLINE | ID: mdl-21728603

ABSTRACT

Community detection in networks based on modularity maximization is currently done with hierarchical divisive or agglomerative as well as partitioning heuristics, hybrids, and, in a few papers, exact algorithms. We consider here the case of hierarchical networks in which communities should be detected and propose a divisive heuristic which is locally optimal in the sense that each of the successive bipartitions is done in a provably optimal way. This heuristic is compared with the spectral-based hierarchical divisive heuristic of Newman [Proc. Natl. Acad. Sci. USA 103, 8577 (2006).] and with the hierarchical agglomerative heuristic of Clauset, Newman, and Moore [Phys. Rev. E 70, 066111 (2004).]. Computational results are given for a series of problems of the literature with up to 4941 vertices and 6594 edges. They show that the proposed divisive heuristic gives better results than the divisive heuristic of Newman and than the agglomerative heuristic of Clauset et al.


Subject(s)
Data Interpretation, Statistical , Models, Theoretical
5.
Phys Rev E Stat Nonlin Soft Matter Phys ; 81(4 Pt 2): 046102, 2010 Apr.
Article in English | MEDLINE | ID: mdl-20481781

ABSTRACT

The modularity maximization model proposed by Newman and Girvan for the identification of communities in networks works for general graphs possibly with loops and multiple edges. However, the applications usually correspond to simple graphs. These graphs are compared to a null model where the degree distribution is maintained but edges are placed at random. Therefore, in this null model there will be loops and possibly multiple edges. Sharp bounds on the expected number of loops, and their impact on the modularity, are derived. Then, building upon the work of Massen and Doye, but using algebra rather than simulation, we propose modified null models associated with graphs without loops but with multiple edges, graphs with loops but without multiple edges and graphs without loops nor multiple edges. We validate our models by using the exact algorithm for clique partitioning of Grötschel and Wakabayashi.


Subject(s)
Models, Theoretical , Algorithms , Animals , Mathematics
6.
Phys Rev E Stat Nonlin Soft Matter Phys ; 81(2 Pt 2): 026105, 2010 Feb.
Article in English | MEDLINE | ID: mdl-20365629

ABSTRACT

A hierarchical divisive algorithm is proposed for identifying communities in complex networks. To that effect, the definition of community in the weak sense of Radicchi [Proc. Natl. Acad. Sci. U.S.A. 101, 2658 (2004)] is extended into a criterion for a bipartition to be optimal: one seeks to maximize the minimum for both classes of the bipartition of the ratio of inner edges to cut edges. A mathematical program is used within a dichotomous search to do this in an optimal way for each bipartition. This includes an exact solution of the problem of detecting indivisible communities. The resulting hierarchical divisive algorithm is compared with exact modularity maximization on both artificial and real world data sets. For two problems of the former kind optimal solutions are found; for five problems of the latter kind the edge ratio algorithm always appears to be competitive. Moreover, it provides additional information in several cases, notably through the use of the dendrogram summarizing the resolution. Finally, both algorithms are compared on reduced versions of the data sets of Girvan and Newman [Proc. Natl. Acad. Sci. U.S.A. 99, 7821 (2002)] and of Lancichinetti [Phys. Rev. E 78, 046110 (2008)]. Results for these instances appear to be comparable.


Subject(s)
Algorithms , Animals , Behavior, Animal , Cluster Analysis , Dolphins/physiology , Models, Biological
7.
Phys Rev E Stat Nonlin Soft Matter Phys ; 82(4 Pt 2): 046112, 2010 Oct.
Article in English | MEDLINE | ID: mdl-21230350

ABSTRACT

Finding modules, or clusters, in networks currently attracts much attention in several domains. The most studied criterion for doing so, due to Newman and Girvan [Phys. Rev. E 69, 026113 (2004)], is modularity maximization. Many heuristics have been proposed for maximizing modularity and yield rapidly near optimal solution or sometimes optimal ones but without a guarantee of optimality. There are few exact algorithms, prominent among which is a paper by Xu [Eur. Phys. J. B 60, 231 (2007)]. Modularity maximization can also be expressed as a clique partitioning problem and the row generation algorithm of Grötschel and Wakabayashi [Math. Program. 45, 59 (1989)] applied. We propose to extend both of these algorithms using the powerful column generation methods for linear and non linear integer programming. Performance of the four resulting algorithms is compared on problems from the literature. Instances with up to 512 entities are solved exactly. Moreover, the computing time of previously solved problems are reduced substantially.

8.
J Chem Inf Model ; 45(2): 222-30, 2005.
Article in English | MEDLINE | ID: mdl-15807482

ABSTRACT

Chemical graphs, as other ones, are regular if all their vertices have the same degree. Otherwise, they are irregular, and it is of interest to measure their irregularity both for descriptive purposes and for QSAR/QSPR studies. Three indices have been proposed in the literature for that purpose: those of Collatz-Sinogowitz, of Albertson, and of Bell's variance of degrees. We study their properties for the case of chemical trees. Structural conjectures are generated with the system AutoGraphiX, and most of them proved later by mathematical means. Analytical expressions for extremal values are obtained, and extremal graphs are characterized for the two last indices.

9.
Bull Math Biol ; 65(6): 1025-51, 2003 Nov.
Article in English | MEDLINE | ID: mdl-14607287

ABSTRACT

The goal of generalized logical analysis is to model complex biological systems, especially so-called regulatory systems, such as genetic networks. This theory is mainly characterized by its capacity to find all the steady states of a given system and the functional positive and negative circuits, which generate multistationarity and a cycle in the state sequence graph, respectively. So far, this has been achieved by exhaustive enumeration, which severely limits the size of the systems that can be analysed. In this paper, we introduce a mathematical function, called image function, which allows the calculation of the value of the logical parameter associated with a logical variable depending on the state of the system. Thus the state table of the system is represented analytically. We then show how all steady states can be derived as solutions to a system of steady-state equations. Constraint programming, a recent method for solving constraint satisfaction problems, is applied for that purpose. To illustrate the potential of our approach, we present results from computer experiments carried out on very large randomly-generated systems (graphs) with hundreds, or even thousands, of interacting components, and show that these systems can be solved using moderate computing time. Moreover, we illustrate the approach through two published applications, one of which concerns the computation times of all steady states for a large genetic network.


Subject(s)
Models, Genetic , Arabidopsis/anatomy & histology , Arabidopsis/genetics , Computational Biology , Escherichia coli/genetics , Flowers/anatomy & histology , Flowers/genetics , Operon/genetics
10.
J Chem Inf Comput Sci ; 43(3): 842-51, 2003.
Article in English | MEDLINE | ID: mdl-12767142

ABSTRACT

After a short historic review, we briefly describe a new algorithm for constructive enumeration of polyhex and fusene hydrocarbons. In this process our algorithm also enumerates isomers and symmetry groups of molecules (which implies enumeration of enantiomers). Contrary to previous methods often based on the boundary code or its variants (which records orientation of edges along the boundary) or on the DAST code, which uses a rigid dualist graph (whose vertices are associated with faces and edges with adjacency between them), the proposed algorithm proceeds in two phases. First inner dual graphs are enumerated; then molecules obtained from each of them by specifying angles between adjacent edges are obtained. Favorable computational results are reported. The new algorithm is so fast that output of the structures is by far the most time-consuming part of the process. It thus contributes to enumeration in chemistry, a topic studied for over a century, and is useful in library making, QSAR/QSPR, and synthesis studies.

11.
J Chem Inf Comput Sci ; 43(1): 1-14, 2003.
Article in English | MEDLINE | ID: mdl-12546532

ABSTRACT

Recently, Araujo and De la Peña gave bounds for the connectivity index of chemical trees as a function of this index for general trees and the ramification index of trees. They also gave bounds for the connectivity index of chemical graphs as a function of this index for maximal subgraphs which are trees and the cyclomatic number of the graphs. The ramification index of a tree is first shown to be equal to the number of pending vertices minus 2. Then, in view of extremal graphs obtained with the system AutoGraphiX, all bounds of Araujo and De la Peña are improved, yielding tight bounds, and in one case corrected. Moreover, chemical trees of a given order and a number of pending vertices with minimum and with maximum connectivity index are characterized.

SELECTION OF CITATIONS
SEARCH DETAIL
...