FastRoute: A scalable load-aware anycast routing architecture for modern CDNs

FastRoute: A scalable load-aware anycast routing architecture for modern CDNs - Flavel et al. 2015 This is the story of how a team at Microsoft redesigned their CDN that supports 'numerous popular online services.' It's also a great example of mature systems thinking: the team deliberately eschew designs that would give marginally better performance at … Continue reading FastRoute: A scalable load-aware anycast routing architecture for modern CDNs

Wormhole: Reliable pub-sub to support Geo-Replicated Internet Services

Wormhole: Reliable pub-sub to support Geo-Replicated Internet Services - Sharma et al. 2015 At Facebook, lots of applications are interested in data being written to Facebook's data stores. Having each of these applications poll the data stores of interest would be untenable, so Facebook built a pub-sub system to identify updates and transmit notifications to … Continue reading Wormhole: Reliable pub-sub to support Geo-Replicated Internet Services

The Design and Implementation of Open vSwitch

The Design and Implementation of Open vSwitch - Pfaff et al. 2015 Another selection from this month's NSDI 2015 programme, this time from the operational systems track. What inspired the creation of Open vSwitch? What has most influenced its design? And what's next? As virtualized (or containerized) workloads grew, physically provisioning networks to support them … Continue reading The Design and Implementation of Open vSwitch

Extensible Distributed Coordination

Extensible Distributed Coordination - Distler et al. 2015 Coordination services such as ZooKeeper offer a deliberately limited API. As a consequence, more complex coordination tasks have to be implemented as multiple RPCs. In Extensible Distributed Coordination, Distler et al. describe a sandboxed extension mechanism for coordination services that allows execution of client logic in the … Continue reading Extensible Distributed Coordination

Taming uncertainty in distributed systems with help from the network

Taming uncertainty in distributed systems with help from the network - Leners et al. 2015 Albatross is a membership service with a very interesting new twist: it exploits SDN functionality to actively enforce partitions! Perhaps it is not immediately obvious why that might be a good thing :). It turns out there are several benefits: … Continue reading Taming uncertainty in distributed systems with help from the network

Putting Consistency Back into Eventual Consistency

Putting Consistency Back into Eventual Consistency - Balegas et al. 2015 Today's choice is another pick from the recent crop of Eurosys 2015 papers. Balegas et al. show us that we don't have to put up with weak forms of eventual consistency, even in geo-replicated settings. In Building on Quicksand Helland argued that we need … Continue reading Putting Consistency Back into Eventual Consistency