LinkedIn: Using Set Cover to Optimize a Large-Scale Low Latency Distributed Graph

08/01/2025 12 min

Listen "LinkedIn: Using Set Cover to Optimize a Large-Scale Low Latency Distributed Graph"

Descargar episodio Ver en sitio original

Episode Synopsis

This research paper details LinkedIn's solution for optimizing low-latency graph computations within their large-scale distributed graph system. To improve performance, they implemented a modified greedy set cover algorithm to minimize the number of machines needed for processing second-degree connection queries. This optimization significantly reduced latency in constructing network caches and overall graph distance calculations, resulting in a better user experience. The paper also discusses the distributed graph architecture, including its partitioning and caching mechanisms, and compares their approach to related work in distributed graph processing. The improvements achieved demonstrate the effectiveness of the modified set cover algorithm in handling the challenges of large-scale graph queries in a real-world online environment.

https://www.usenix.org/system/files/conference/hotcloud13/hotcloud13-wang.pdf

More episodes of the podcast The Binary Breakdown

NeonDB: A Serverless PostgreSQL Analysis 31/07/2025

Anna: A KVS For Any Scale 29/05/2025

Conflict-free Replicated Data Types 21/05/2025

CAP Twelve Years Later: How the "Rules" Have Changed 14/05/2025

Raft versus Paxos: An Understandable Consensus Algorithm 07/05/2025

Neo4j Architecture: Graph Database Internals, Performance, and Optimization 01/05/2025

Sentry: Error Monitoring at Scale - Design Principles Analysis 23/04/2025

Istio Service Mesh: Architecture, Security, and Traffic Management 16/04/2025

CockroachDB: SQL for Global Scale Design Principles 09/04/2025

Snowflake: Revolutionizing Cloud Data Warehousing and Analytics 02/04/2025

Ver todos los episodios

ZARZA We are Zarza, the prestigious firm behind major projects in information technology.

LinkedIn: Using Set Cover to Optimize a Large-Scale Low Latency Distributed Graph

Listen "LinkedIn: Using Set Cover to Optimize a Large-Scale Low Latency Distributed Graph"

Episode Synopsis

More episodes of the podcast The Binary Breakdown

Dot COM: The Internet’s dominant TLD

Internet as human right and its scope

Bandwidth: Broadband or Narrowband?

Personnel recruitment via Web

Deep web or Invisible Internet

Subdomains, a glance with the experts!

Free Internet, a prediction in Nostradamus style

Educational Technology: From traditional to digital

Localhost, there’s no place like 127.0.0.1

Googling with breathtaking tricks you ignore

Gray Hat Hacking, those with ambiguous ethics…

Internet Predators on the prowl

Dot COM: The Internet’s dominant TLD