38
2

On the Resilience of Fast Failover Routing Against Dynamic Link Failures

Abstract

Modern communication networks feature local fast failover mechanisms in the data plane, to swiftly respond to link failures with pre-installed rerouting rules. This paper explores resilient routing meant to tolerate k\leq k simultaneous link failures, ensuring packet delivery, provided that the source and destination remain connected. While past theoretical works studied failover routing under static link failures, i.e., links which permanently and simultaneously fail, real-world networks often face link flapping--dynamic down states caused by, e.g., numerous short-lived software-related faults. Thus, in this initial work, we re-investigate the resilience of failover routing against link flapping, by categorizing link failures into static, semi-dynamic (removing the assumption that links fail simultaneously), and dynamic (removing the assumption that links fail permanently) types, shedding light on the capabilities and limitations of failover routing under these scenarios. We show that kk-edge-connected graphs exhibit (k1)(k-1)-resilient routing against dynamic failures for k5k \leq 5. We further show that this result extends to arbitrary kk if it is possible to rewrite logk\log k bits in the packet header. Rewriting 33 bits suffices to cope with kk semi-dynamic failures. However, on general graphs, tolerating 22 dynamic failures becomes impossible without bit-rewriting. Even by rewriting logk\log k bits, resilient routing cannot resolve kk dynamic failures, demonstrating the limitation of local fast rerouting.

View on arXiv
Comments on this paper