Last month saw the 43rd edition of the ACM SIGPLAN-SIGACT Symposium on the Principles of Programming Languages (POPL). Gabriel Scherer did a wonderful job of gathering links to all of the accepted papers in a GitHub repo. For this week, I've chosen five papers from the conference that caught my eye. Links will go live … Continue reading POPL 2016
Month: January 2016
Panopticon: An Omniscient Lock Broker for Efficient Distributed Transactions in the Datacenter
Panopticon: An Omniscient Lock Broker for Efficient Distributed Transactions in the Datacenter - Tasci & Demirbas, 2015 Today we return to the theme of distributed transactions, and a paper that won a best paper award from IEEE Big Data in 2015. Panopticon is a centralized lock broker (like Chubby and ZooKeeper) that manages distributed (decentralized) … Continue reading Panopticon: An Omniscient Lock Broker for Efficient Distributed Transactions in the Datacenter
Chimera: Large-Scale Classification Using Machine Learning, Rules, and Crowdsourcing
Chimera: Large-Scale Classification Using Machine Learning, Rules, and Crowdsourcing - Sun et al. 2014 (WalmartLabs) Large-scale classification, where we need to classify hundreds of thousands or millions of items into thousands of classes, is becoming increasingly common in this age of Big Data... So far, however, very little has been published on how large-scale classification … Continue reading Chimera: Large-Scale Classification Using Machine Learning, Rules, and Crowdsourcing
Petuum: A New Platform for Distributed Machine Learning on Big Data
Petuum: A New Platform for Distributed Machine Learning on Big Data - Xing et al. 2015 How do you perform machine learning with big models (big here could be 100s of billions of parameters!) over big data sets (terabytes or petabytes)? Take for example state of the art image recognition systems that have embraced large-scale … Continue reading Petuum: A New Platform for Distributed Machine Learning on Big Data
Arabesque: A System for Distributed Graph Mining
Arabesque: A System For Distributed Graph Mining - Teixeira et al. 2015 We've studied graph computation systems before in The Morning Paper: systems such as Pregel, Giraph and GraphLab that provide vertex-centric programming models ('think like a vertex') on top of a Bulk Synchronous Parallel compute model. We've also seen some of the limitations of … Continue reading Arabesque: A System for Distributed Graph Mining
The Design and Implementation of the Wave Transactional Filesystem
The Design and Implementation of the Wave Transactional Filesystem - Escriva & Sirer 2015 Since we've been looking at various combinations of storage and transactions, it seemed appropriate to start this week with the Wave Transactional Filesystem. Throughout the paper you'll find this abbreviated as WTF, but my brain can't read that without supplying the … Continue reading The Design and Implementation of the Wave Transactional Filesystem
A Distributed Systems Seminar Reading List…
I stumbled upon Murat Demirbas' 'Distributed Systems Seminar's Reading List for Spring 2016.' If you're taking part in those seminars, you're in for some very interesting papers! I was pleased to discover I've read (and written up) most of them - but there are a few that I haven't. Given the high calibre of the … Continue reading A Distributed Systems Seminar Reading List…