Medea: scheduling of long running applications in shared production clusters

June 13, 2018June 9, 2018 ~ adriancolyer

Medea: scheduling of long running applications in shared production clusters Garefalakis et al., EuroSys'18 (If you don’t have ACM Digital Library access, the paper can be accessed either by following the link above directly from The Morning Paper blog site). We’re sticking with schedulers today, and a really interesting system called Medea which is designed … Continue reading Medea: scheduling of long running applications in shared production clusters

Optimus: an efficient dynamic resource scheduler for deep learning clusters

June 12, 2018June 9, 2018 ~ adriancolyer

Optimus: an efficient dynamic resource scheduler for deep learning clusters Peng et al., EuroSys'18 (If you don’t have ACM Digital Library access, the paper can be accessed either by following the link above directly from The Morning Paper blog site). It’s another paper promising to reduce your deep learning training times today. But instead of … Continue reading Optimus: an efficient dynamic resource scheduler for deep learning clusters

Improving the expressiveness of deep learning frameworks with recursion

June 11, 2018June 4, 2018 ~ adriancolyer

Improving the expressiveness of deep learning frameworks with recursion Jeong, Jeong et al., EuroSys'18 (If you don’t have ACM Digital Library access, the paper can be accessed either by following the link above directly from The Morning Paper blog site). Last week we looked at the embedded dynamic control flow operators in TensorFlow. In today’s … Continue reading Improving the expressiveness of deep learning frameworks with recursion

BDS: A centralized near-optimal overlay network for inter-datacenter data replication

June 8, 2018June 1, 2018 ~ adriancolyer ~ 1 Comment

BDS: A centralized near-optimal overlay network for inter-datacenter data replication Zhang et al., EuroSys'18 (If you don’t have ACM Digital Library access, the paper can be accessed either by following the link above directly from The Morning Paper blog site). This is the story of how inter-datacenter multicast transfers at Baidu were sped-up by a … Continue reading BDS: A centralized near-optimal overlay network for inter-datacenter data replication

Dynamic control flow in large-scale machine learning

June 7, 2018June 1, 2018 ~ adriancolyer ~ 2 Comments

Dynamic control flow in large-scale machine learning Yu et al., EuroSys'18 (If you don’t have ACM Digital Library access, the paper can be accessed either by following the link above directly from The Morning Paper blog site). In 2016 the Google Brain team published a paper giving an overview of TensorFlow, "TensorFlow: a system for … Continue reading Dynamic control flow in large-scale machine learning

Reducing DRAM footprint with NVM in Facebook

June 6, 2018May 30, 2018 ~ adriancolyer ~ 15 Comments

Reducing DRAM footprint with NVM in Facebook Eisenman et al., EuroSys'18 (If you don’t have ACM Digital Library access, the paper can be accessed either by following the link above directly from The Morning Paper blog site). ...to the best of our knowledge, this is the first study on the usage of NVM devices in … Continue reading Reducing DRAM footprint with NVM in Facebook

ServiceFabric: a distributed platform for building microservices in the cloud

June 5, 2018May 29, 2018 ~ adriancolyer ~ 19 Comments

ServiceFabric: a distributed platform for building microservices in the cloud Kakivaya et al., EuroSys'18 (If you don’t have ACM Digital Library access, the paper can be accessed either by following the link above directly from The Morning Paper blog site). Microsoft’s Service Fabric powers many of Azure’s critical services. It’s been in development for around … Continue reading ServiceFabric: a distributed platform for building microservices in the cloud

Hyperledger fabric: a distributed operating system for permissioned blockchains

June 4, 2018May 28, 2018 ~ adriancolyer ~ 1 Comment

Hyperledger fabric: a distributed operating system for permissioned blockchains Androulaki et al., EuroSys'18 (If you don’t have ACM Digital Library access, the paper can be accessed either by following the link above directly from The Morning Paper blog site). This very well written paper outlines the design of HyperLedger Fabric and the rationales for many … Continue reading Hyperledger fabric: a distributed operating system for permissioned blockchains

ForkBase: an efficient storage engine for blockchain and forkable applications

June 1, 2018May 28, 2018 ~ adriancolyer

ForkBase: an efficient storage engine for blockchain and forkable applications Wang et al., arXiv'18 ForkBase is a data storage system designed to support applications that need a combination of data versioning, forking, and tamper proofing. The prime example being blockchain systems, but this could also include collaborative applications such as GoogleDocs. Today for example Ethereum … Continue reading ForkBase: an efficient storage engine for blockchain and forkable applications

zkLedger: privacy-preserving auditing for distributed ledgers

May 31, 2018May 27, 2018 ~ adriancolyer ~ 1 Comment

zkLedger: privacy-preserving auditing for distributed ledgers Narula et al., NSDI'18 Somewhat similarly to Solidus that we looked at late last year, zkLedger (presumably this stands for zero-knowledge Ledger) provides transaction privacy for participants in a permissioned blockchain setting. zkLedger also has an extra trick up its sleeve: it provides rich and fully privacy-preserving auditing capabilities. … Continue reading zkLedger: privacy-preserving auditing for distributed ledgers