Medea: scheduling of long running applications in shared production clusters Garefalakis et al., EuroSys'18 (If you don’t have ACM Digital Library access, the paper can be accessed either by following the link above directly from The Morning Paper blog site). We’re sticking with schedulers today, and a really interesting system called Medea which is designed … Continue reading Medea: scheduling of long running applications in shared production clusters
Category: Uncategorized
Optimus: an efficient dynamic resource scheduler for deep learning clusters
Optimus: an efficient dynamic resource scheduler for deep learning clusters Peng et al., EuroSys'18 (If you don’t have ACM Digital Library access, the paper can be accessed either by following the link above directly from The Morning Paper blog site). It’s another paper promising to reduce your deep learning training times today. But instead of … Continue reading Optimus: an efficient dynamic resource scheduler for deep learning clusters
Improving the expressiveness of deep learning frameworks with recursion
Improving the expressiveness of deep learning frameworks with recursion Jeong, Jeong et al., EuroSys'18 (If you don’t have ACM Digital Library access, the paper can be accessed either by following the link above directly from The Morning Paper blog site). Last week we looked at the embedded dynamic control flow operators in TensorFlow. In today’s … Continue reading Improving the expressiveness of deep learning frameworks with recursion
BDS: A centralized near-optimal overlay network for inter-datacenter data replication
BDS: A centralized near-optimal overlay network for inter-datacenter data replication Zhang et al., EuroSys'18 (If you don’t have ACM Digital Library access, the paper can be accessed either by following the link above directly from The Morning Paper blog site). This is the story of how inter-datacenter multicast transfers at Baidu were sped-up by a … Continue reading BDS: A centralized near-optimal overlay network for inter-datacenter data replication
Dynamic control flow in large-scale machine learning
Dynamic control flow in large-scale machine learning Yu et al., EuroSys'18 (If you don’t have ACM Digital Library access, the paper can be accessed either by following the link above directly from The Morning Paper blog site). In 2016 the Google Brain team published a paper giving an overview of TensorFlow, "TensorFlow: a system for … Continue reading Dynamic control flow in large-scale machine learning
Reducing DRAM footprint with NVM in Facebook
Reducing DRAM footprint with NVM in Facebook Eisenman et al., EuroSys'18 (If you don’t have ACM Digital Library access, the paper can be accessed either by following the link above directly from The Morning Paper blog site). ...to the best of our knowledge, this is the first study on the usage of NVM devices in … Continue reading Reducing DRAM footprint with NVM in Facebook
ServiceFabric: a distributed platform for building microservices in the cloud
ServiceFabric: a distributed platform for building microservices in the cloud Kakivaya et al., EuroSys'18 (If you don’t have ACM Digital Library access, the paper can be accessed either by following the link above directly from The Morning Paper blog site). Microsoft’s Service Fabric powers many of Azure’s critical services. It’s been in development for around … Continue reading ServiceFabric: a distributed platform for building microservices in the cloud
Hyperledger fabric: a distributed operating system for permissioned blockchains
Hyperledger fabric: a distributed operating system for permissioned blockchains Androulaki et al., EuroSys'18 (If you don’t have ACM Digital Library access, the paper can be accessed either by following the link above directly from The Morning Paper blog site). This very well written paper outlines the design of HyperLedger Fabric and the rationales for many … Continue reading Hyperledger fabric: a distributed operating system for permissioned blockchains
ForkBase: an efficient storage engine for blockchain and forkable applications
ForkBase: an efficient storage engine for blockchain and forkable applications Wang et al., arXiv'18 ForkBase is a data storage system designed to support applications that need a combination of data versioning, forking, and tamper proofing. The prime example being blockchain systems, but this could also include collaborative applications such as GoogleDocs. Today for example Ethereum … Continue reading ForkBase: an efficient storage engine for blockchain and forkable applications
zkLedger: privacy-preserving auditing for distributed ledgers
zkLedger: privacy-preserving auditing for distributed ledgers Narula et al., NSDI'18 Somewhat similarly to Solidus that we looked at late last year, zkLedger (presumably this stands for zero-knowledge Ledger) provides transaction privacy for participants in a permissioned blockchain setting. zkLedger also has an extra trick up its sleeve: it provides rich and fully privacy-preserving auditing capabilities. … Continue reading zkLedger: privacy-preserving auditing for distributed ledgers