Branching Out: Quantifying Tree-like Structure in Complex Networks

Blair D. Sullivan Complex Systems Group Center for Engineering Science Advanced Research Computer Science and Mathematics Division Oak Ridge National Laboratory

Branching Out: Quantifying Tree-like Structure in Complex

Networks

MMDS, July 12, 2012

Joint work with Michael Mahoney & Aaron Adcock, Stanford University

2 Managed by UT-Battelle for the U.S. Department of Energy

Motivation • Large networks are becoming ubiquitous in many

domains – e.g. biology, physics, chemistry, infrastructure, communications, and sociology

• Many methods to understand structure at very large-scale (diameter), small-scale (clustering coefficient); very few to probe intermediate scale (clusters of size 5K in a 5M node network). Can we get good tools to understand and exploit this?

A partial map of the Internet, January 15 2005

The US electric transmission system. Courtesy North American Reliability Corporation. Drug-Target Network.

Nature Biotechnology 25(10), October 2007

Intermediate-Scale Structure

Ising model (ferromagnetism): Temperature parameter controls scale of local correlations between magnetic spins.

Intermediate-Scale Structure

• Determines network evolution & dynamics of diffusion, other processes

• Implicitly affects applicability of common data analysis tools

• This is where all the “interesting stuff” happens.

Ising model (ferromagnetism): Temperature parameter controls scale of local correlations between magnetic spins.

The “intermediate-scale structure” is the coupling of local & global properties.

Prior empirical evidence Claim: Many large complex networks are “tree-like” when viewed at intermediate scales:

• The Unreasonable Effectiveness of Tree-Based Theory for Networks with Clustering, Melnik, Hackett, Porter, Mucha, Gleeson. Physical Review E, Vol. 83, No. 3 (2010).

• Finding Hierarchy in Directed Online Social Networks, Gupta, Shankar, Li, Muthukrishnan, Iftode. WWW2011.

• "It was noted in recent years that the Internet structure has a highly connected core and long stretched tendrils, and that most of the routing paths between nodes in the tendrils pass through the core. Therefore, we suggest in this work, to embed the Internet distance metric in a hyperbolic space where routes are bent toward the center“ Shavitt, Tankel. 2008. Hyperbolic embedding of internet graph for distance estimation and overlay construction. IEEE/ACM Trans. Netw. 16, 1 (2008).

However, no consensus has been reached on defining and measuring this tree-like structure, making it difficult to exploit algorithmically.

Image credit: Munzer et al

Prior empirical evidence Claim: Many large complex networks are “tree-like” when viewed at intermediate scales:

• The Unreasonable Effectiveness of Tree-Based Theory for Networks with Clustering, Melnik, Hackett, Porter, Mucha, Gleeson. Physical Review E, Vol. 83, No. 3 (2010).

• Finding Hierarchy in Directed Online Social Networks, Gupta, Shankar, Li, Muthukrishnan, Iftode. WWW2011.

• "It was noted in recent years that the Internet structure has a highly connected core and long stretched tendrils, and that most of the routing paths between nodes in the tendrils pass through the core. Therefore, we suggest in this work, to embed the Internet distance metric in a hyperbolic space where routes are bent toward the center“ Shavitt, Tankel. 2008. Hyperbolic embedding of internet graph for distance estimation and overlay construction. IEEE/ACM Trans. Netw. 16, 1 (2008).

However, no consensus has been reached on defining and measuring this tree-like structure, making it difficult to exploit algorithmically.

Arxiv GR-QC collaboration

What do you mean, “tree-like”?

Image credit: Traub, Kelsic, Mucha, Porter

Image credit: Tim Davis

Facebook: Caltech Network

Autonomous

Systems

Image credit: Graphics@Illinois

Hyperbolic Space

• Multiple parallel lines pass through a point, and angles in a triangle sum to less than 180.

• At right, see a {7,3}-tessellation of the hyperbolic plane by equilateral triangles, and the dual {3,7}-tessellation by regular heptagons. All triangles and heptagons are of the same hyperbolic size but the size of their Euclidean representations exponentially decreases as a function of the distance from the center, while their number exponentially increases.

• In Euclidean space, a circle’s area grows polynomially with its diameter; in hyperbolic space, it grows exponentially. Think of growth as in a binary tree.

• The shortest paths in hyperbolic spaces are arcs through disk, not paths around the exterior (much like travel in a rooted tree)

Image credit Krioukov et al.

Hyperbolic Embedding and Greedy Routing

• Hyperbolic space gives us “extra room” to embed networks (as opposed to Euclidean space).

• A number of algorithms take advantage of this to devise greedy routing schemes

• Kleinberg uses a minimum spanning tree, embedded as a subset of a d-regular tree, where d is the maximum degree of the MST (d = 4 is shown at right)

Image credit Kleinberg

So is it good or bad?

Image credit M.C.Escher

A generative model • Three-parameter model introduced by Krioukov et

al uses an underlying hyperbolic geometry and allows us to vary the curvature, degree heterogeneity, and density. (Physicists: this is basically fermions)

• Idea: place nodes in the hyperbolic plane (Poincare disk) and connect them with a probability which is dependent on their hyperbolic distance.

• Knob 1: Power law exponent: determines distribution of nodes in the disk – the higher the exponent, the more nodes go towards the center. This determines the curvature (and degree heterogeneity)

• Knob 2: Temperature: determines how much we ignore the underlying geometry in adding edge; at high temperatures, edge connections become essential random (independent of distance).

• Knob 3: Average degree (target): approximately allows control over density

Power Law 2.1 2.25 2.5

Temperature 20 1.5 0.5

Avg. Degree 5 10 20

Our test parameters

Temp. Finite Infinite

Finite Random

hyperbolic graphs

Classical random graphs

(Erdos-Renyi)

Infinite Random

geometric graphs

Random graphs

w/given expected

Special Thanks

Special thanks to D. Krioukov for providing us code to generate networks according to the model described on the previous slide.

Image credit San Diego Reader

Hyperbolic Embedding for Inference

• Boguna, Krioukov, Papadopolous have mapped “the internet” to hyperbolic space, and used the embedding to identify community structure (and offer suggested routing schemes).

Image credit Boguna, Krioukov, Papadopolous

• Their methods rely on iterative MLE methods, and do not seem to be scalable to examine “big data”.

A geometric measure of tree-likeness

• Gromov’s δ-hyperbolicity arises from the geometry of metric spaces and δ measures the extent to which a (geodesic) metric space embeds in a tree metric.

d(u,v) + d(w,x) = 1 + 1 = 2 d(u,x) + d(v,w) = 1 + 1 = 2 d(u,w) + d(v,x) = 1 + 1 = 2

u δ = 0

d(u,v) + d(w,x) = 1 + 1 = 2 d(u,x) + d(v,w) = 2 + 2 = 4 d(u,w) + d(v,x) = 1 + 1 = 2

δ = 1 v v u

x w x w

• Note: d(u,v) is the length of the shortest path between u and v in the graph.

• The minimum δ for which G is δ-hyperbolic can be computed (naively) in O(n4)

Branching Out: Quantifying Tree-like Structure in Complex Networks

Documents

Anatomy of a tree...Anatomy of a tree outgroup: an early branching relative of the interest groups sister taxa: taxa derived from the same recent ancestor polytomy: >2 taxa emerge

Constraint Branching and Disjunctive Cuts for Mixed ... · “Constraint Branching and Disjunctive Cuts for Mixed Integer Programs” 3 Branching from Disjunctive Cuts • Branching

Branching Tree Grammars in TREEBAG · 2006. 6. 19. · branching tree grammar, there exists a sequence of a regular tree grammar and top-down tree transducers, which generates the

Sophora japonica ‘Pendula’: Weeping Scholar Tree · Sophora japonica ‘Pendula’: Weeping Scholar Tree 3 Pests Potato leafhopper kills young stems causing profuse branching

Classifying Ductal Tree Structures Using Topological ...Classifying Ductal Tree Structures Using Topological Descriptors of Branching Angeliki Skoura 1, Vasileios Megalooikonomou ,

On Greedy-Branching for Vertex Deletion to Tree-Like Graphsfpt.akt.tu-berlin.de/publications/theses/BA-vincent-borko.pdf · Algorithmus verwendet eine Greedy-Branching Methode, welche

Maximal displacement in a branching random walk through interfaces · 2016. 12. 23. · Branching random walk through interfaces For example, a Galton-Watson tree can be constructed

Team Foundation Server Branching Guidancethedesignspace.net/MT2archives/images/cvs/branching... · For more detailed documentation on branching with Team Foundation Server see: Branching

Growing High Quality Trees Crop tree selection Crop tree ......•Straight, branch-free bole Ideal crop tree: •Crown has lots of fine branching •Dead branches, thinning needles

Innovation as Evolution - COnnecting REpositories · 2012-03-26 · branching, a tree-like diagram defined as phylomemetic tree. 2. Cellphone Innovation as a Memetic Evolution As

National Archives Eighth Annual Genealogy Fair Branching ......National Archives Eighth Annual Genealogy Fair Branching Out: Exploring Your Family Tree April 18 & 19, 2012 Below are

Some studies in machine learning using the game of checkers · the number of leaf nodes in the game tree) is 1031 (Chess: 10123, Go: 10350) and the average branching factor (the branching

Git branching

Branching Algorithms - Inria · Branching Algorithms are also called I branch & bound algorithms I backtracking algorithms I search tree algorithms I branch & reduce algorithms I

Branching Coral

NEW MEXICO FFA FORESTRY CDE TREE IDENTIFICATIONTREE ...€¦ · tree identification list are the opposite branching ones. Rockyyp Mountain maple Boxelder Arizona ash. Rocky Mountain

Proceedings of the Fifth Annual Western Computer Symposiumalgorithmicbotany.org/papers/artificial-evolution-of-plant-forms.pdf · Figure 4: Genealogic tree of branching structures

What Looks Left-branching is Right-branching: Evidence

CO-OP Shared Branching Shared Branching 101 January 2014

Acropora Branching & Non Branching