Graph Layout by Random Vertex Sampling

Random Vertex Sampling

Force-directed graph layout algorithms work by modeling the graph’s vertices as charged particles that repel each other and the graph’s edges as springs that try to maintain an ideal distance between connected vertices. The algorithms run an iterative physics simulation to find a good set of vertex positions that minimizes these forces.

Comparison of the brute-force repulsive force algorithm and the Random Vertex Sampling algorithm. LEFT: The brute-force algorithm computes repulsive forces between each pair of vertices. RIGHT: Random Vertex Sampling uses a sliding window to select a subset of vertices to update. For each vertex in the update set, it randomly samples a smaller set of vertices to use for computing repulsive forces. Each vertex also has a small, constant-sized number of “neighbor” vertices that they use to compute repulsive forces. The algorithm is iterative, so after one iteration, the window slides forward and computes forces on a different subset of vertices.

In force-directed graph layouts, repulsive force calculations between the vertices are the main performance bottleneck. The brute-force algorithm computes repulsive forces between each pair of vertices, and therefore runs in O(n²) time at each iteration (n is the number of vertices in the graph).

Random Vertex Sampling reduces this runtime to O(n). It works by using a sliding window of length n^¾ to select a subset of vertices to update their velocity. For each vertex in the update set, it randomly selects n^¼ vertices to use for computing repulsive forces. Each vertex also has a constant-sized list of “neighbor” vertices used for computing repulsive forces on it. Because we choose the exponents carefully, and because the “neighbor” vertex lists are constant in size, the overall algorithm runs in linear time with O(n^¾) auxiliary space requirements. Before the next iteration, the window slides forward n^¾ positions in the vertex list to update a different set of vertices during the next iteration. Then the whole process repeats itself.

In contrast, consider tree-based approximation algorithms like Barnes-Hut or Fast Multipole. They build a spatial tree to approximate forces by aggregating distant vertices. This enables a O(n log n) runtime, which is pretty good, but still not as fast as Random Vertex Sampling. But, the spatial tree also comes with the expense of requiring O(n log n) auxiliary memory to store the tree, which is much more than Random Vertex Sampling.

Faster and Just as Good!

In my experiments, I found that force-directed algorithms using Random Vertex Sampling run about 3 times faster on average than algorithms using Barnes-Hut. That’s a big improvement! Even better, it converges to a good layout faster, too.

I also found that Random Vertex Sampling produces layouts that are about the same quality as Barnes-Hut. Sometimes the layouts look a little more chaotic, though. We can improve this by first computing a layout using Random Vertex Sampling, and then running a few iterations of Barnes-Hut to smooth out the layout. That runs almost as fast as using Random Vertex Sampling by itself.

Graph layouts using Random Vertex Sampling and Barnes-Hut

The detailed experiments use statistical analysis on five different graph layout quality metrics using a wide variety of graph types (social networks, transportation networks, geometric graphs, etc.). If you’re interested, see my research paper for all the details.

More Information

Try it out for yourself! It’s available in d3-force-sampled, a plugin for D3’s force-directed graph layout package v4 and above.

Publication:

Robert Gove. “A Random Sampling O(n) Force-calculation Algorithm for Graph Layouts.” Computer Graphics Forum 38, 3 (2019).

About the Author

Robert Gove

Robert Gove is a Distinguished Data Visualization Scientist at Two Six Technologies. He is interested in designing UIs and visualizations to answer deep analytical questions, and using machine learning and statistics to enhance analysts' capabilities. Recently he has worked on several algorithms to automatically produce more useful data visualizations. Robert has won several awards for his peer-reviewed papers in data visualization and cyber security. He holds a Master of Science in Computer Science from the University of Maryland, and two Bachelor of Science degrees in Computer Science and Applied Math from UNC Greensboro.

Graph Layout by Random Vertex Sampling

Random Vertex Sampling

Faster and Just as Good!

More Information

Tags

About the Author

Robert Gove

The Perplexing Diminishing Returns of Facebook Ads.

Which advertisement is the best? Additional considerations for A/B testing.

US Elections: Moscow favors former President Trump, but avoids risks that could lead to interference accusations

How China and Russia helped Venezuelan leader Nicolás Maduro pull off his reelection heist

PRODUCTS

Careers

Capabilities

About two six

Resources