Consistency Without Borders - Alvaro et al. 2013 We closed out last week by looking at the gap that has opened up between application developer needs and what the database community is providing, leading to the widespread adoption of Feral Concurrency Control. Today's paper, written two years earlier, anticipates this problem and discusses the possible … Continue reading Consistency Without Borders
Category: Distributed Systems
Core distributed systems topics, for example consistency, availability and so on.
Quantifying Isolation Anomalies
Quantifying Isolation Anomalies - Fekete et al. 2009 Before we get into today's content, for those of you that can be in London for the week of the 14th September the GOTO London conference will be taking place with curated themes from Adrian Cockcroft discussing how we can build systems that are agile, lean, and … Continue reading Quantifying Isolation Anomalies
The Potential Dangers of Causal Consistency and an Explicit Solution
The Potential Dangers of Causal Consistency and an Explicit Solution - Bailis et al. 2012 Yesterday we saw how we could get both better performance and stronger consistency by upgrading from eventual consistency to causal consistency. Are there any downsides? With useful semantics, low latency, partition tolerance, and, recently, a demonstrably efficient architecture, causal consistency … Continue reading The Potential Dangers of Causal Consistency and an Explicit Solution
Bolt-on Causal Consistency
Bolt-on Causal Consistency - Bailis et al. 2013 "It'll probably be OK" seems to reflect the prevailing application developer's attitude to working with eventually consistent stores. Thanks to the work of Bailis et al. on PBS, we can now quantify that 'probably.' And it looks pretty good at first glance, with 99+% probabilities achievable after … Continue reading Bolt-on Causal Consistency
Probabilistically Bounded Staleness for Practical Partial Quorums
Probabilistically Bounded Staleness for Practical Partial Quorums - Bailis et al. 2012, and Quantifying Eventual Consistency with PBS - Bailis et al. 2014 'Probabilistically Bounded Staleness... ' was the original VLDB '12 paper, and then the authors were invited to submit an extended version to the VLDB Journal ('Quantifying Eventual Consistency...') which was published in … Continue reading Probabilistically Bounded Staleness for Practical Partial Quorums
MillWheel: Fault-Tolerant Stream Processing at Internet Scale
MillWheel: Fault-Tolerant Stream Processing at Internet Scale - Akidau et al. (Google) 2013 Earlier this week we looked at the Google Cloud Dataflow model which is implemented on top of FlumeJava (for batch) and MillWheel (for streaming): We have implemented this model internally in FlumeJava, with MillWheel used as the underlying execution engine for streaming … Continue reading MillWheel: Fault-Tolerant Stream Processing at Internet Scale
Virtual Time
Virtual Time - Jefferson 1985 Justin LaPre sent me a tweet last week, "I think you would enjoy reading 'Virtual Time' by David Jefferson. Give it a read if you find yourself with some down time. " Thank you Justin, I did read it and I thought it was absolutely wonderful! This paper has time … Continue reading Virtual Time
Asynchronous Distributed Snapshots for Distributed Dataflows
Asynchronous Distributed Snapshots for Distributed Dataflows - Carbone et al. 2015 The team behind Apache Flink and data Artisans are a smart group of folks. Their recent blog post on High-throughput, low-latency, and exactly-once stream processing with Apache Flink is well worth reading and has a good description of the evolution of streaming architectures, the … Continue reading Asynchronous Distributed Snapshots for Distributed Dataflows
The Dataflow Model: A Practical Approach to Balancing Correctness, Latency, and Cost in Massive-Scale, Unbounded, Out-of-Order Data Processing
The Dataflow Model: A Practical Approach to Balancing Correctness, Latency, and Cost in Massive-Scale, Unbounded, Out-of-Order Data Processing - Akidau et al. (Google) - 2015 With thanks to William Vambenepe for suggesting this paper via twitter. Google Cloud Dataflow reached GA last week, and the team behind Cloud Dataflow have a paper accepted at VLDB'15 … Continue reading The Dataflow Model: A Practical Approach to Balancing Correctness, Latency, and Cost in Massive-Scale, Unbounded, Out-of-Order Data Processing
Lasp: A language for distributed, coordination-free programming
Lasp: A language for distributed, coordination-free programming - Meiklejohn & Van Roy 2015 * Update: fixed typo in Chris' surname above. * With thanks to Colin Barrett for suggesting today's choice, and to Chris Meiklejohn for providing a link to a paywall-free preprint of the paper. Christopher Meiklejohn recently announced he is leaving Basho to … Continue reading Lasp: A language for distributed, coordination-free programming