≡ Menu

Anything But Random: 1,500,000 Blogs

7,100,000 Relationships Across 1,500,000 blogs

Before Google’s Social Graph API closed last month, I was able to gain access to a reasonable subset: 7,100,000 relationships across 1,500,000 sites (shown below). To be honest, it wasn’t what I was expecting.

The attach rate \left(P\left(k\right) \sim{} k^{-\gamma}\right) is pretty close to what Barabasi, Albert, and Jeong found in Scale-free characteristics of random networks. \gamma \sim{} 2.11 across all, with R^2 = 0.86. \gamma \sim{} 2.73 for nodes with fewer than 101 attachments; R^2 = 0.97.

all nodes

nodes with 100 or fewer edges

Even though that was not much different, what clearly stood out was the sizable amount of neighborhoods.

Following are compressed neighbor matrices for three random networks, and what was observed in the blog network.  While each have standout patterns, the blogs is nothing like the others.

1.5M Blogs

Random (ER)

Small World (BA)

Small World (WS)

Related Posts

  • Replacing Google Reader in 10 MinutesReplacing Google Reader in 10 Minutes
  • Communication Method, Scale, and EntropyCommunication Method, Scale, and Entropy
  • City Green as a Function of City ParksCity Green as a Function of City Parks
  • The Lambert Effect – Subtleties in Cloud ModelingThe Lambert Effect – Subtleties in Cloud Modeling
  • Life in a Networked AgeLife in a Networked Age
  • Maximizing CliquesMaximizing Cliques
{ 0 comments… add one }