Network entropy

Comment: Asked Scope Creep a question on xyr talk page to clarify the further work required. Gryllida (talk, e-mail) 06:35, 7 June 2021 (UTC)

Comment: Can you please remove ref 3. It is borderline reliable. Probably better to select another paper. Thanks. Give me a shout when you're finished. scope_creep^Talk 20:01, 18 May 2021 (UTC)

In network science, the network entropy is a disorder measure derived from information theory to describe the level of randomness and the amount of information encoded in a graph..^[1]. It is a relevant metric to quantitatively characterize real complex networks and can also be used to quantify network complexity^[1]^[2]. There are several formulations in which to measure the network entropy and, as a rule, they all require a particular property of the graph to be focused, such as the adjacency matrix, degree sequence, degree distribution or number of bifurcations, what might lead to values of entropy that aren't invariant to the chosen network description.^[3]

Formulations

Degree Distribution Shannon Entropy

The Shannon entropy can be measured for the network degree probability distribution as an average measurement of the heterogeneity of the network.

${\mathcal {H}}=-\sum _{k=1}^{N-1}P(k)\ln {P(k)}$

This formulation has limited use with regards to complexity, information content, causation and temporal information. Be that as it may, algorithmic complexity has the ability to characterize any general or universal property of a graph or network and it is proven that graphs with low entropy have low algorithmic complexity because the statistical regularities found in a graph are useful for computer programs to recreate it. The same cannot be said for high entropy networks though, as these might have any value for algorithmic complexity.^[3]

Random Walker Shannon Entropy

Due to the limits of the previous formulation, it is possible to take a different approach while keeping the usage of the original Shannon Entropy equation.

Consider a random walker that travels around the graph, going from a node $i$ to any node $j$ adjacent to $i$ with equal probability. The probability distribution $p_{ij}$ that describes the behavior of this random walker would thus be

$p_{ij}={\begin{cases}{\frac {1}{k_{i}}},&{\text{if }}A_{ij}=1\\0,&{\text{if }}A_{ij}=0\\\end{cases}}$ ,

where $(A_{ij})$ is the graph adjacency matrix and $k_{i}$ is the node $i$ degree.

From that, the Shannon entropy from each node ${\mathcal {S}}_{i}$ can be defined as

${\mathcal {S}}_{i}=-\sum _{j=1}^{N-1}p_{ij}\ln {p_{ij}}=\ln {k_{i}}$

and, since $max(k_{i})=N-1$ , the normalized node entropy ${\mathcal {H}}_{i}$ is calculated

${\mathcal {H}}_{i}={\frac {{\mathcal {S}}_{i}}{max({\mathcal {S}}_{i})}}={\frac {\ln {k_{i}}}{\ln(max(k_{i}))}}={\frac {\ln {k_{i}}}{\ln(N-1)}}$

This leads to a normalized network entropy ${\mathcal {H}}$ , calculated by averaging the normalized node entropy over the whole network^[4]

${\mathcal {H}}={\frac {1}{N}}\sum _{i=1}^{N}{\mathcal {H}}_{i}={\frac {1}{N\ln(N-1)}}\sum _{i=1}^{N}\ln {k_{i}}$

The normalized network entropy is maximal ${\mathcal {H}}=1$ when the network is fully connected and decreases the sparser the network becomes ${\mathcal {H}}=0$ . Notice that isolated nodes $k_{i}=0$ do not have its probability $p_{ij}$ defined and, therefore, are not considered when measuring the network entropy. This formulation of network entropy has low sensitivity to hubs due to the logarithmic factor and is more meaningful for weighted networks.^[4], what ultimately makes it hard to differentiate scale-free networks using this measure alone. ^[2]

Random Walker Kolmogorov–Sinai Entropy

The limitations of the random walker Shannon entropy can be overcome by adapting it to use a Kolmogorov–Sinai entropy. In this context, network entropy is the entropy of a stochastic matrix associated with the graph adjacency matrix $(A_{ij})$ and the random walker Shannon entropy is called the dynamic entropy of the network. From that, let $\lambda$ be the dominant eigenvalue of $(A_{ij})$ . It is proven that $\ln \lambda$ satisfies a variational principal ^[5] that is equivalent to the dynamic entropy for unweighted networks, i.e., the adjacency matrix consists exclusively of boolean values. Therefore, the topological entropy is defined as

${\mathcal {H}}=\ln \lambda$

This formulation is important to the study of network robustness, i.e., the capacity of the network to withstand random structural changes. Robustness is actually difficult to be measured numerically whereas the entropy can be easily calculated for any network, what is specially import in the context of non-stationary networks. The entropic fluctuation theorem shows that this entropy is positively correlated to robustness and hence a greater insensitivity of an observable to dynamic or structural perturbations of the network. Moreover, the eigenvalues are inherently related to the multiplicity of internal pathways, leading to a negative correlation between the topological entropy and the shortest average path length. ^[6]

Other than that, the Kolmogorov entropy is related to the Ricci curvature of the network^[7], a metric that has been used to differentiate stages of cancer from gene co-expression networks ^[8], as well as to give hallmarks of financial crashes from stock correlation networks ^[9]

Maximum Entropy Principle

The maximum entropy principle is a variational principal stating that the probability distribution best representing the current state of a system is the one which maximizes the Shannon entropy ^[10]. This concept can be used to generate an ensemble of random graphs with given structural properties derived from the maximum entropy approach which, in its turn, describes the most probable network configuration: the maximum entropy principle allows for maximally unbiased information when lacking complete knowledge (microscopic configuration is not accessible, e.g.: we don't know the adjacency matrix). On the other hand, this ensemble serves as a null model when the actual microscopic configuration of the network is known, allowing to assess the significance of empirical patterns found in the network^[11]

Complex Network Ensembles

It is possible to extend the network entropy formulations to instead measure the ensemble entropy. Let $\Omega$ be an ensemble of all graphs $G$ with the same number $N$ of nodes and type of links, $P(G)$ is defined as the occurrence probability of a graph $G\in \Omega$ . From the maximum entropy principle, it is desired to achieve $P(G)$ such that it maximizes the Shannon entropy

${\mathcal {H}}=-\sum _{G\in \Omega }P(G)\ln P(G)$

Moreover, constraints shall be imposed in order to extract conclusions. Soft constraints (constraints set over the whole ensemble $\Omega$ ) will lead to the canonical ensemble whereas hard constraints (constraints set over each graph $G$ ) are going to define the microcanonical ensemble. In the end, these ensembles lead to models that can validate a range of network patterns and even reconstruct graphs from limited knowledge, showing that maximum entropy models based on local constraints can quantify the relevance of a set of observed features and extracting meaningful information from huge big data streams in real-time and high-dimensional noisy data.^[11]

References

^ ^a ^b Anand, Kartik; Krioukov, Dmitri; Bianconi, Ginestra (2014). "Entropy distribution and condensation in random networks with a given degree distribution". Physical Review E. 89 (6): 062807. arXiv:1403.5884. Bibcode:2014PhRvE..89f2807A. doi:10.1103/PhysRevE.89.062807. PMID 25019833. S2CID 761765.
^ ^a ^b Freitas, Cristopher GS; Aquino, Andre LL; Ramos, Heitor S; Frery, Alejandro C; Rosso, Osvaldo A (2019). "A detailed characterization of complex networks using Information Theory". Scientific Reports Nature Publishing Group. 9 (1): 16689. Bibcode:2019NatSR...916689F. doi:10.1038/s41598-019-53167-5. PMC 6853913. PMID 31723172. S2CID 207987035.
^ ^a ^b Zenil, Hector; Kiani, Narsis A; Tegnér, Jesper (2018). "A review of graph and network complexity from an algorithmic information perspective". Entropy. 20 (8): 551. Bibcode:2018Entrp..20..551Z. doi:10.3390/e20080551. PMC 7513075. PMID 33265640.{{cite journal}}: CS1 maint: unflagged free DOI (link)
^ ^a ^b Small, Michael (2013). "Complex networks from time series: Capturing dynamics". 2013 IEEE International Symposium on Circuits and Systems: 2509–2512. doi:10.1109/ISCAS.2013.6572389. ISBN 978-1-4673-5762-3. S2CID 9275909.
^ Arnold, Ludwig; Gundlach, Volker Matthias; Demetrius, Lloyd (1994). "Evolutionary formalism for products of positive random matrices". The Annals of Applied Probability. 4 (3): 859–901. doi:10.1214/aoap/1177004975. JSTOR 2245067.
^ Demetrius, Lloyd; Manke, Thomas (2005). "Robustness and network evolution—an entropic principle". Physica A: Statistical Mechanics and Its Applications. 346 (3): 682–696. Bibcode:2005PhyA..346..682D. doi:10.1016/j.physa.2004.07.011.
^ Lott, J.; Villani, C. (2009). "Ricci curvature for metric-measure spaces via optimal transport". Annals of Mathematics. 169 (3): 903–991. doi:10.4007/annals.2009.169.903. S2CID 15556613.
^ Sandhu, R.; Georgiou, T.; Reznik, E.; Zhu, L.; Kolesov, I.; Senbabaoglu, Y.; Tannenbaum, A. (2015). "Graph curvature for differentiating cancer networks". Nature Scientific Reports. 5: 12323. Bibcode:2015NatSR...512323S. doi:10.1038/srep12323. PMC 4500997. PMID 26169480.
^ Sandhu, Romeil S; Georgiou, Tryphon T; Tannenbaum, Allen R (2016). "Ricci curvature: An economic indicator for market fragility and systemic risk". Science Advances. 2 (5): e1501495. Bibcode:2016SciA....2E1495S. doi:10.1126/sciadv.1501495. PMC 4928924. PMID 27386522.
^ Jaynes, E. T. (1957). "Information Theory and Statistical Mechanics" (PDF). Physical Review. Series II. 106 (4): 620–630. Bibcode:1957PhRv..106..620J. doi:10.1103/PhysRev.106.620. MR 0087305.
^ ^a ^b Cimini, Giulio; Squartini, Tiziano; Saracco, Fabio; Garlaschelli, Diego; Gabrielli, Andrea; Caldarelli, Guido (2019). "The statistical physics of real-world networks". Nature Reviews Physics. 1 (1): 58–71. arXiv:1810.05095. Bibcode:2019NatRP...1...58C. doi:10.1038/s42254-018-0002-6. S2CID 52963395.

Network Entropy

[anand2014entropy-1] Anand, Kartik; Krioukov, Dmitri; Bianconi, Ginestra (2014). "Entropy distribution and condensation in random networks with a given degree distribution". Physical Review E. 89 (6): 062807. arXiv:1403.5884. Bibcode:2014PhRvE..89f2807A. doi:10.1103/PhysRevE.89.062807. PMID 25019833. S2CID 761765.

[freitas2019detailed-2] Freitas, Cristopher GS; Aquino, Andre LL; Ramos, Heitor S; Frery, Alejandro C; Rosso, Osvaldo A (2019). "A detailed characterization of complex networks using Information Theory". Scientific Reports Nature Publishing Group. 9 (1): 16689. Bibcode:2019NatSR...916689F. doi:10.1038/s41598-019-53167-5. PMC 6853913. PMID 31723172. S2CID 207987035.

[zenil2018review-3] Zenil, Hector; Kiani, Narsis A; Tegnér, Jesper (2018). "A review of graph and network complexity from an algorithmic information perspective". Entropy. 20 (8): 551. Bibcode:2018Entrp..20..551Z. doi:10.3390/e20080551. PMC 7513075. PMID 33265640.{{cite journal}}: CS1 maint: unflagged free DOI (link)

[small2013complex-4] Small, Michael (2013). "Complex networks from time series: Capturing dynamics". 2013 IEEE International Symposium on Circuits and Systems: 2509–2512. doi:10.1109/ISCAS.2013.6572389. ISBN 978-1-4673-5762-3. S2CID 9275909.

[5] Arnold, Ludwig; Gundlach, Volker Matthias; Demetrius, Lloyd (1994). "Evolutionary formalism for products of positive random matrices". The Annals of Applied Probability. 4 (3): 859–901. doi:10.1214/aoap/1177004975. JSTOR 2245067.

[demetrius2005robustness-6] Demetrius, Lloyd; Manke, Thomas (2005). "Robustness and network evolution—an entropic principle". Physica A: Statistical Mechanics and Its Applications. 346 (3): 682–696. Bibcode:2005PhyA..346..682D. doi:10.1016/j.physa.2004.07.011.

[7] Lott, J.; Villani, C. (2009). "Ricci curvature for metric-measure spaces via optimal transport". Annals of Mathematics. 169 (3): 903–991. doi:10.4007/annals.2009.169.903. S2CID 15556613.

[8] Sandhu, R.; Georgiou, T.; Reznik, E.; Zhu, L.; Kolesov, I.; Senbabaoglu, Y.; Tannenbaum, A. (2015). "Graph curvature for differentiating cancer networks". Nature Scientific Reports. 5: 12323. Bibcode:2015NatSR...512323S. doi:10.1038/srep12323. PMC 4500997. PMID 26169480.

[9] Sandhu, Romeil S; Georgiou, Tryphon T; Tannenbaum, Allen R (2016). "Ricci curvature: An economic indicator for market fragility and systemic risk". Science Advances. 2 (5): e1501495. Bibcode:2016SciA....2E1495S. doi:10.1126/sciadv.1501495. PMC 4928924. PMID 27386522.

[10] Jaynes, E. T. (1957). "Information Theory and Statistical Mechanics" (PDF). Physical Review. Series II. 106 (4): 620–630. Bibcode:1957PhRv..106..620J. doi:10.1103/PhysRev.106.620. MR 0087305.

[cimini2019statistical-11] Cimini, Giulio; Squartini, Tiziano; Saracco, Fabio; Garlaschelli, Diego; Gabrielli, Andrea; Caldarelli, Guido (2019). "The statistical physics of real-world networks". Nature Reviews Physics. 1 (1): 58–71. arXiv:1810.05095. Bibcode:2019NatRP...1...58C. doi:10.1038/s42254-018-0002-6. S2CID 52963395.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]