Entropy Rate of Diffusion Processes on Complex Networks
📝 Original Info
- Title: Entropy Rate of Diffusion Processes on Complex Networks
- ArXiv ID: 0712.0278
- Date: 2009-11-13
- Authors: Researchers mentioned in the ArXiv original paper
📝 Abstract
The concept of entropy rate for a dynamical process on a graph is introduced. We study diffusion processes where the node degrees are used as a local information by the random walkers. We describe analitically and numerically how the degree heterogeneity and correlations affect the diffusion entropy rate. In addition, the entropy rate is used to characterize complex networks from the real world. Our results point out how to design optimal diffusion processes that maximize the entropy for a given network structure, providing a new theoretical tool with applications to social, technological and communication networks.💡 Deep Analysis
This research explores the key findings and methodology presented in the paper: Entropy Rate of Diffusion Processes on Complex Networks.The concept of entropy rate for a dynamical process on a graph is introduced. We study diffusion processes where the node degrees are used as a local information by the random walkers. We describe analitically and numerically how the degree heterogeneity and correlations affect the diffusion entropy rate. In addition, the entropy rate is used to characterize complex networks from the real world. Our results point out how to design optimal diffusion processes that maximize the entropy for a given network structure, providing a new theoretical tool with applications to social, technological and communication networks.
📄 Full Content
The main theoretical and empirical interest in the study of complex networks is in understanding the relations between structure and function. Besides, many of the interaction dynamics that takes place in social, biological and technological systems can be analyzed in terms of diffusion processes on top of complex networks, e.g. data search and routing, information and disease spreading [4,5]. It is therefore of outmost importance to relate the properties of a diffusion process with the structure of the underlying network.
In this Letter, we show how to associate an entropy rate to a diffusion process on a graph. In particular, we consider processes such as biased random walks on the graph that can be represented as ergodic Markov chains. In this context, the entropy rate is a quantity more similar to the Kolmogorov-ÂSinai entropy rate of a dynamical system [12,13], than to the entropy of a statistical ensemble [1,4]. Differently from the network entropies previously defined, the entropy rate of diffusion processes depends both on the dynamical process (the kind of bias in the random walker) and on the graph topology. We provide the analytical expression that describes the entropy rate in scale-free networks as a function of the bias in the walk, and of the degree distribution and correlations. We show how the values of the entropy rate can provide useful information to characterize diffusion processes in real-world networks. In particular, a maximum value of entropy is found for different types of the bias in the diffusion processes, depending on the network structure.
Let us consider a connected undirected graph with N nodes (labelled as 1, 2, …, N ) and K links, described by the adjacency matrix A = {a ij } We limit our discussion to diffusion processes on the graph that can be represented as Markov chains [3]. In particular, we consider the case of biased random walks in which, at each time step, the walker at node i chooses one of the first neighbors of i, let say j, with a probability proportional to the power α (α ∈ R) of the degree k j . Such biased random walk corresponds to a time-invariant (the rule does not change in time) Markov chain with a transition probability matrix Π, with elements:
Notice that Π depends on either the graph topology and the kind of stochastic process we are considering. The exponent α allows to tune the dependence of the diffusion process on the nodes’ degree. When α = 0 we are introducing in the random movement of the particle a bias towards high-(α > 0) or low-degree (when α < 0) neighbors. On the other hand, when α = 0 the standard (unbiased) random walk is recovered. Since the walker must move from a node to somewhere, we have j π ji = 1, thus Π is a stochastic matrix. If w i (t) is the probability that the random walker is at node i at time t (with N i=1 w i (t) = 1 ∀t), then the probability w j (t + 1) of its being at j one step later is: w j (t+1) = i π ji w i (t). Writing the probabilities w i (t) as a N -dimensional column vector w(t) = ( w 1 (t), w 2 (t) . . . w N (t) ) ⊤ , the rule of the walk can be expressed in matricial form as: w(t + 1) = Πw(t). In the case of an undirected and connected network, the Perron-Frobenius theorem [14] assures that the dynamics described by Eq. ( 1) is an ergodic Markov chain [3]. This means that the Markov chain has a unique stationary distribution w * , such that lim t→∞ Π t w(0) = w * for any initial distribution w(0). In other words, any initial distribution of the random walker over the nodes of the graph will converge, under the dynamics of Eq. ( 1), to the same distribution w * .
The dynamical properties of the above diffusion processes over the graph can be accounted by evaluating the entropy rate of the associated Markov chain that, in the case of an ergodic Markov chain, is given by [3]:
The value of h measures how the entropy of the biased random walk grows with the number of hops. This means that we can practically represent the typical sequences of length n generated by the diffusion process by using approximately n • h information units. In different words, h measures the spreading of a set of independent random walkers, in terms of number of visited nodes.
To evaluate h for a given graph we need to calculate the stationary probability distribution w * . For this purpose, we consider the probability W i→j (t) of going from node i to node j in t time steps,
Since the network is undirected we have a ij = a ji ∀i, j.
Hence, the relation between the two probabilities W i→j (t) and W j→i (t) can be written as:
where c i = j a ij k α j . The above relation implies that for the stationary distribution w * the equation c i k α i w * j = c j k α j w * i holds, and hence w * reads:
By plugging expressions (1) and ( 5) into the definition of entropy (2), we finally get a closed form for the entropy rate of degree-biased random walks on the graph:
We notice that h depends on the the kind of bias in the random walker and also on the graph topology. In the following, we first evaluate analytically the entropy rate of unbiased and biased random walks on scale-free (SF) graphs with a powerlaw degree distribution P k ∼ k -γ , and γ > 2 [4,5]. Then, we study the entropy in networks from the real world. Unbiased Random Walks.-In the particular case α = 0, the transition probability reads π ji = a ij /k i , and the stationary distribution is easily obtained as: w * i = ki 2K . Substituting this expression in Eq. ( 2) and changing the sum over node indexes into a sum over degree classes, we can write the entropy rate of a unbiased random walk on a network with degree distribution P k as: In the case of SF networks of size N , the value of h can be easily expressed as a function of γ and N taking into account that the maximum degree of the network is k max ∼ k 0 N 1/(γ-1) , with k 0 being the minimum degree of a node. From Eq. ( 7), and approximating k as a continuum variable, we get:
.
The above expression diverges for SF networks when γ → 2. Conversely, when γ > 2 SF networks have a finite entropy in the thermodynamic limit: h(γ) = ln(k 0 ) + 1 γ-2 . In order to check the analytical results we have constructed ensembles of 10 2 SF networks with N = 10 5 nodes and different values of γ. We have obtained numerically the stationary distribution w * , and computed the entropy directly from Eq. ( 2). The results, averaged over the ensemble of networks, are reported in Fig. 1 as a function of γ. We notice a good agreement between numerics and Eq. (8).
Biased Random Walks.-Let us now concentrate on degreebiased diffusion (α = 0). In this case, the entropy rate of Eq. ( 6) can be re-written by changing again the sums over node indexes into sums over degree classes, as:
, and P k ′ ,k is the conditional probability that a link from a node of degree k ends in a node with degree k ′ . We notice that the entropy rate of biased random walks depends on the degree distribution of the network, P k , and on the conditional probabilities P k ′ ,k . In the particular case of a network with no degree-degree correlations we can write P k ′ ,k = kP k / k , and the expression for the entropy reduces to:
This expression only depends on the degree distribution of the network. For SF networks, we get in the the continuumdegree approximation:
.
When N → ∞, the entropy rate in SF networks with γ < 2 + α diverges as h ∼ ln(N ). On the other hand, when γ > 2+α, the entropy rate in the limit N → ∞ is finite and equal to:
Such an expression, valid in infinite size limit, shows a monotone growth of the entropy h(γ, α) with the degree-bias α, with h tending to infinity as α → (γ -2) -. More interestingly, the entropy rate in finite networks, Eq. ( 11), shows a single maximum at a value of α that depends on γ. This result is a consequence of the interplay between diffusion process and network topology. It indicates that, for a given network, is possible to maximize the entropy of the process by opportunely tuning the bias α of the walker.
To check the above analytical expressions we have computed numerically the entropy rate of degree-biased random walkers on computer-generated uncorrelated SF networks, as we did for the unbiased case. In Fig. 2.a we report the entropy rate as a function of the degree bias α for SF networks of size N = 10 5 . In Fig. 2.b we show the scaling of h with the system size N , in SF network with γ = 3. In both cases Eq. ( 11) is in good agreement with the numerical results reproducing the qualitative behavior of h as a function of α (being the global maximum of h well reproduced) and N (being both the divergence of h, for α > γ -2, and the asymptotic finite value of h, when α < γ -2, correctly reproduced) [15].
Real Networks.-Up to now, we focused on the entropy rate of biased random walks on SF networks. However, real networks are not perfect scale-free and, more importantly, show additional important structural properties such as degree-degree correlations, motifs and community structures [4,5]. Now we propose to characterize a real network by studying different diffusion processes on top of it, and finding the optimal value of the bias that maximizes the entropy. As reference system, we compare the entropy rate h of α-biased random walks on the network, with the entropy rate h Rand obtained, from Eq. ( 10), for a randomized version of the network, with the same degree sequence of the real one [16]. For this purpose, we have analyzed 10 different networks reported in Table I, corresponding to thre different functional classes where diffusion of data, rumors, viruses and diseases, takes place, namely (i) transportation, (ii) technological/communication and (iii) social networks.
In Fig. 3 we report, for six of the networks, the results obtained as a function of the bias parameter α. Two different behaviors emerge clearly for α > 0 [26], namely the entropy of the real network h is either larger or smaller than h rand for all the range of positive values of α. In table I we summarize this result by reporting the ratio h/h Rand for α = 1 (linear bias). We found that social networks have always h > h Rand , while the other networks have h < h Rand , with the exception of Internet routers. This difference in the entropy rate has its roots mainly on the different types of degree-degree correlations of the network, and points out that assortativity facilitates the spread of the diffusion. The optimal degree-bias, α opt that produces the maximal entropy rate is also reported in Table I. The results indicate that for assortative networks (e.g. social networks) the maximal entropy rate is obtained with a super-linear diffusion, while for dissasortative networks α opt is located in the sub-linear bias region.
FIG. 3: Entropy rate h for α-biased random walks on six of the networks in Table I (filled circles). Such entropy rate is compared with that obtained for the randomized version of the network (full circles). Both entropy rates are shown as a function of the degree-bias parameter α.
Summing up, in this Letter, we have introduced the entropy rate of degree-biased random walks on networks, a measure that is particularly suited to capture the interplay between network structure and diffusion dynamics. We have studied the dependence of the entropy rate with the topology of synthetic and real networks, in particular with the heterogeneity of degree distributions and the nature of the degree-degree correlations. The results indicate how it is possible to tune the bias in the random walk in order to maximize the entropy rate on a given topology. The method introduced can find useful applications to cases where diffusion in complex networks is the mechanism at work, such as in the search of efficient algo-rithms for data search in the WWW, in the improvement of information dissemination in social networks, or in the design of large impact virus/antivirus spreading in computer networks. The approach adopted here can be easily generalized to other types of diffusion processes and to more general network topologies, such as weighted graphs and also, with some appropriate modifications to directed and unconnected graphs.
📸 Image Gallery
