Listen "The Resilience Problem"
Episode Synopsis
…we have educated generations of computer scientists on the paradigm that analysis of algorithm only means analyzing their computational efficiency. As Wikipedia states: "In computer science, the analysis of algorithms is the process of finding the computational complexity of algorithms—the amount of time, storage, or other resources needed to execute them." In other words, efficiency is the sole concern in the design of algorithms. … What about resilience? —Moshe Y. Vardi
This quote set me to thinking about how efficiency and resilience might interact, or trade off against one another, in networks. The most obvious extreme cases are two routers connected via a single long-haul link and the highly parallel data center fabrics we build today. Obviously adding a second long-haul link would improve resilience—but at what cost in terms of efficiency? Its also obvious highly meshed data center fabrics have plenty of resilience—and yet they still sometimes fail. Why?
More episodes of the podcast DESIGN – rule 11 reader
Hedge 265: Out of Band Networks
04/04/2025
Architecture and Process
12/04/2024
Simple or Complex?
19/09/2023
Hedge 144: IPv6 Lessons Learned
25/08/2022
Route Servers and Loops
16/08/2022
Hedge 134: Ten Things
15/06/2022
Revisiting BGP Convergence
06/06/2022
BGP Policies (Part 2)
14/03/2022
BGP Policies (part 1)
07/03/2022
ZARZA We are Zarza, the prestigious firm behind major projects in information technology.