POPL 2016

Last month saw the 43rd edition of the ACM SIGPLAN-SIGACT Symposium on the Principles of Programming Languages (POPL). Gabriel Scherer did a wonderful job of gathering links to all of the accepted papers in a GitHub repo. For this week, I've chosen five papers from the conference that caught my eye. Links will go live … Continue reading POPL 2016

Panopticon: An Omniscient Lock Broker for Efficient Distributed Transactions in the Datacenter

Panopticon: An Omniscient Lock Broker for Efficient Distributed Transactions in the Datacenter - Tasci & Demirbas, 2015 Today we return to the theme of distributed transactions, and a paper that won a best paper award from IEEE Big Data in 2015. Panopticon is a centralized lock broker (like Chubby and ZooKeeper) that manages distributed (decentralized) … Continue reading Panopticon: An Omniscient Lock Broker for Efficient Distributed Transactions in the Datacenter

Chimera: Large-Scale Classification Using Machine Learning, Rules, and Crowdsourcing

Chimera: Large-Scale Classification Using Machine Learning, Rules, and Crowdsourcing - Sun et al. 2014 (WalmartLabs) Large-scale classification, where we need to classify hundreds of thousands or millions of items into thousands of classes, is becoming increasingly common in this age of Big Data... So far, however, very little has been published on how large-scale classification … Continue reading Chimera: Large-Scale Classification Using Machine Learning, Rules, and Crowdsourcing

Petuum: A New Platform for Distributed Machine Learning on Big Data

Petuum: A New Platform for Distributed Machine Learning on Big Data - Xing et al. 2015 How do you perform machine learning with big models (big here could be 100s of billions of parameters!) over big data sets (terabytes or petabytes)? Take for example state of the art image recognition systems that have embraced large-scale … Continue reading Petuum: A New Platform for Distributed Machine Learning on Big Data

Arabesque: A System for Distributed Graph Mining

Arabesque: A System For Distributed Graph Mining - Teixeira et al. 2015 We've studied graph computation systems before in The Morning Paper: systems such as Pregel, Giraph and GraphLab that provide vertex-centric programming models ('think like a vertex') on top of a Bulk Synchronous Parallel compute model. We've also seen some of the limitations of … Continue reading Arabesque: A System for Distributed Graph Mining

The Design and Implementation of the Wave Transactional Filesystem

The Design and Implementation of the Wave Transactional Filesystem - Escriva & Sirer 2015 Since we've been looking at various combinations of storage and transactions, it seemed appropriate to start this week with the Wave Transactional Filesystem. Throughout the paper you'll find this abbreviated as WTF, but my brain can't read that without supplying the … Continue reading The Design and Implementation of the Wave Transactional Filesystem

Blurred Persistence: Efficient Transactions in Persistent Memory

Blurred Persistence: Efficient Transactions in Persistent Memory - Lu, Shu, & Sun, 2015 We had software transactional memory (STM), then hardware support for transactional memory (HTM), and now with persistent memory in which the memory plays the role of stable storage, we can have persistent transactional memory. And with persistent transactional memory, there's an issue … Continue reading Blurred Persistence: Efficient Transactions in Persistent Memory

From ARIES to MARS: Transaction Support for Next-Generation Solid State Drives

From ARIES to MARS: Transaction Support for Next-Generation Solid State Drives - Coburn et al. 2013 For the last couple of weeks we've been bouncing around the topics of transaction support and the implications of a non-volatile memory and super-fast networking on system design. We've seen statements such as 'the bandwidth and latency characteristics of … Continue reading From ARIES to MARS: Transaction Support for Next-Generation Solid State Drives