On the duality of resilience and privacy

On the duality of resilience and privacy - Crowcroft '15 Somewhat of a philosophical start to the week this week as Jon Crowcroft makes the argument for greater privacy through some of the same mechanisms that give systems greater resilience. Plus, it includes this quote: It is a truth universally acknowledged that centralized cloud services … Continue reading On the duality of resilience and privacy

Impala: a modern, open-source SQL engine for Hadoop

Impala: A modern, open-source SQL engine for Hadoop - Kornacker et al . 2015 (Cloudera*) This is post 4 of 5 in a series looking at the latest research from CIDR'15. Also in the series so far this week: 'The missing piece in complex analytics', 'WANalytics, analytics for a geo-distributed, data intensive world', and 'Liquid: … Continue reading Impala: a modern, open-source SQL engine for Hadoop

Liquid: Unifying nearline and offline big data integration

Liquid: Unifying Nearline and Offline Big Data Integration - Fernandez et al. 2015 This is post 3 of 5 in a series looking at the latest research from the CIDR '15 conference. Also in the series so far this week: 'The missing piece in complex analytics' and 'WANalytics: analytics for a geo-distributed, data intensive world'. … Continue reading Liquid: Unifying nearline and offline big data integration

WANalytics: Analytics for a geo-distributed, data intensive world

WANalytics: analytics for a geo-distributed data intensive world - Vulimiri et al. 2015 ...data is born distributed; we only control data replication and distributed execution strategies. This is true for so many sources of data. Combine this with Dave McCrory's observation that 'Data has Gravity' (i.e. it attracts applications and other data processing workloads to … Continue reading WANalytics: Analytics for a geo-distributed, data intensive world

Introducing CIDR’15 week on The Morning Paper

The data systems research community are a smart bunch, although it's not their research and papers I'm referring to here. Many conferences move around, but not the Conference on Innovative Data Systems Research (CIDR). CIDR has found a rather nice venue "on the Pacific Ocean, just south of Monterey", and decided to stick there. Schedule … Continue reading Introducing CIDR’15 week on The Morning Paper

Eraser: A dynamic data race detector for multi-threaded programs

Eraser: A dynamic data race detector for multi-threaded programs - Savage et al. 1997 Debugging a multithreaded program can be difficult. Simple errors in synchronization can produce timing-dependent data races that can take weeks or months to track down. Eraser dynamically detects data races in multi-threaded programs. There are two basic approaches to doing this, … Continue reading Eraser: A dynamic data race detector for multi-threaded programs