Understanding deep learning requires re-thinking generalization

May 11, 2017May 14, 2017 ~ adriancolyer ~ 34 Comments

Understanding deep learning requires re-thinking generalization Zhang et al., ICLR'17 This paper has a wonderful combination of properties: the results are easy to understand, somewhat surprising, and then leave you pondering over what it all might mean for a long while afterwards! The question the authors set out to answer was this: What is it … Continue reading Understanding deep learning requires re-thinking generalization

Neural architecture search with reinforcement learning

May 10, 2017May 6, 2017 ~ adriancolyer ~ 7 Comments

Neural architecture search with reinforcement learning Zoph & Le, ICLR'17 Earlier this year we looked at 'Large scale evolution of image classifiers' which used an evolutionary algorithm to guide a search for the best network architectures. In today's paper, Zoph & Le also demonstrate that learning network architectures (and also in their case recurrent cell … Continue reading Neural architecture search with reinforcement learning

Efficient memory disaggregation with Infiniswap

May 5, 2017April 30, 2017 ~ adriancolyer ~ 3 Comments

Efficient memory disaggregation with Infiniswap Gu et al., NSDI '17 If we move performance numbers onto a human scale (let 1ns of processor time = 1 second of human time) then it's easier to get an intuition - for me at least - of the relative cost of different operations. In this world, it takes … Continue reading Efficient memory disaggregation with Infiniswap

CherryPick: Adaptively unearthing the best cloud configurations for big data analytics

May 4, 2017May 5, 2017 ~ adriancolyer ~ 14 Comments

CherryPick: Adaptively unearthing the best cloud configurations for big data analytics Alipourfard et al., NSDI'17 For big data analytics jobs, especially recurring jobs, finding a good cloud configuration (number and type of machines, CPU, memory ,disk and network options) can make a big different to overall cost and runtimes. Likewise, a poor choice can seriously … Continue reading CherryPick: Adaptively unearthing the best cloud configurations for big data analytics

vCorfu: A cloud-scale object store on a shared log

May 3, 2017April 30, 2017 ~ adriancolyer ~ 2 Comments

vCorfu: A cloud-scale object store on a shared log Wei et al., NSDI'17 vCorfu builds on the idea of a distributed shared log that we looked at yesterday with CORFU, to construct a distributed object store. We show that vCorfu outperforms Cassandra, a popular state-of-the-art NoSQL store, while providing strong consistency (opacity, read-own-writes), efficient transactions, … Continue reading vCorfu: A cloud-scale object store on a shared log

Corfu: A distributed shared log

May 2, 2017May 2, 2017 ~ adriancolyer ~ 12 Comments

Corfu: A distributed shared log Balakrishnan et al., ACM TOCS, 2013 (If you experience any difficulty in accessing the pdf in the above link please let me know, it should be open for you on the ACM DL. UPDATE, many readers are still seeing a paywall for the above paper link, here's an alternative open … Continue reading Corfu: A distributed shared log

The design, implementation and deployment of a system to transparently compress hundreds of petabytes of image files for a file storage service

May 1, 2017April 30, 2017 ~ adriancolyer ~ 3 Comments

The design, implementation, and deployment of a system to transparently compress hundreds of petabytes of image files for a file storage service Horn et al., NSDI'17 When I first started reading, I thought this paper was going to be about a new compression format Dropbox had introduced for JPEG images. And it is about that, … Continue reading The design, implementation and deployment of a system to transparently compress hundreds of petabytes of image files for a file storage service

FM Backscatter: Enabling connected cities and smart fabrics

April 28, 2017April 20, 2017 ~ adriancolyer ~ 4 Comments

FM Backscatter: Enabling connected cities and smart fabrics Wang et al., NSDI'17 If we want to connect all the things, then we need a means of sending and/or receiving information at each thing. These transmissions require power, and no-one wants to have to plug in chargers or keep swapping batteries for endless everyday objects. So … Continue reading FM Backscatter: Enabling connected cities and smart fabrics

ViewMap: Sharing private in-vehicle dashcam videos

April 27, 2017April 20, 2017 ~ adriancolyer

ViewMap: Sharing private in-vehicle dashcam videos Kim et al., NSDI'17 In the world of sensor-laden connected cars that we're rushing towards, ViewMap addresses an interesting question: how can we use the information collected by those cars for common good, without significant invasion of privacy? It raises deeper questions too about the limits of state surveillance … Continue reading ViewMap: Sharing private in-vehicle dashcam videos

Improving user perceived page load time using gaze

April 26, 2017April 20, 2017 ~ adriancolyer ~ 2 Comments

Improving user perceived page load time using gaze Kelton, Ryoo, et al., NSDI 2017 I feel like I'm stretching things a little bit including this paper in an IoT flavoured week, but it does use at least bridge from the physical world to the virtual, if only via a webcam. What's really interesting here to … Continue reading Improving user perceived page load time using gaze