Challenging common assumptions in the unsupervised learning of disentangled representations

July 17, 2019May 25, 2020 ~ Adrian Colyer ~ Leave a comment

Challenging common assumptions in the unsupervised learning of disentangled representations Locatello et al., ICML'19 Today’s paper choice won a best paper award at ICML’19. The ‘common assumptions’ that the paper challenges seem to be: "unsupervised learning of disentangled representations is possible, and useful!" The key idea behind the unsupervised learning of disentangled representations is that ... Continue Reading

Data Shapley: equitable valuation of data for machine learning

July 15, 2019May 25, 2020 ~ Adrian Colyer ~ 6 Comments

Data Shapley: equitable valuation of data for machine learning Ghorbani & Zou et al., ICML'19 It’s incredibly difficult from afar to make sense of the almost 800 papers published at ICML this year! In practical terms I was reduced to looking at papers highlighted by others (e.g. via best paper awards), and scanning the list ... Continue Reading

Software engineering for machine learning: a case study

July 8, 2019May 25, 2020 ~ Adrian Colyer ~ 10 Comments

Software engineering for machine learning: a case study Amershi et al., ICSE'19 Previously on The Morning Paper we’ve looked at the spread of machine learning through Facebook and Google and some of the lessons learned together with processes and tools to address the challenges arising. Today it’s the turn of Microsoft. More specifically, we’ll be ... Continue Reading

Machine learning systems are stuck in a rut

June 28, 2019May 25, 2020 ~ Adrian Colyer ~ 18 Comments

Machine learning systems are stuck in a rut Barham & Isard, HotOS'19 In this paper we argue that systems for numerical computing are stuck in a local basin of performance and programmability. Systems researchers are doing an excellent job improving the performance of 5-year old benchmarks, but gradually making it harder to explore innovative machine ... Continue Reading

A case for managed and model-less inference serving

June 14, 2019May 25, 2020 ~ Adrian Colyer ~ 1 Comment

A case for managed and model-less inference serving Yadwadkar et al., HotOS'19 HotOS’19 is presenting me with something of a problem as there are so many interesting looking papers in the proceedings this year it’s going to be hard to cover them all! As a transition from the SysML papers we’ve been looking at recently, ... Continue Reading

PyTorch-BigGraph: a large-scale graph embedding system

June 10, 2019May 25, 2020 ~ Adrian Colyer ~ Leave a comment

PyTorch-BigGraph: a large-scale graph embedding system Lerer et al., SysML'19 We looked at graph neural networks earlier this year, which operate directly over a graph structure. Via graph autoencoders or other means, another approach is to learn embeddings for the nodes in the graph, and then use these embeddings as inputs into a (regular) neural ... Continue Reading

Towards federated learning at scale: system design

June 7, 2019May 25, 2020 ~ Adrian Colyer ~ 1 Comment

Towards federated learning at scale: system design Bonawitz et al., SysML 2019 This is a high level paper describing Google’s production system for federated learning. One of the most interesting things to me here is simply to know that Google are working on this, have a first version in production working with tens of millions ... Continue Reading

Data validation for machine learning

June 5, 2019May 25, 2020 ~ Adrian Colyer ~ 15 Comments

Data validation for machine learning Breck et al., SysML'19 Last time out we looked at continuous integration testing of machine learning models, but arguably even more important than the model is the data. Garbage in, garbage out. In this paper we focus on the problem of validation the input data fed to ML pipelines. The ... Continue Reading

Continuous integration of machine learning models with ease.ml/ci

June 3, 2019May 25, 2020 ~ Adrian Colyer ~ 2 Comments

Continuous integration of machine learning models with ease.ml/ci: towards a rigorous yet practical treatment Renggli et al., SysML'19 Developing machine learning models is no different from developing traditional software, in the sense that it is also a full life cycle involving design, implementation, tuning, testing, and deployment. As machine learning models are used in more ... Continue Reading

Boosted race trees for low energy classification

May 29, 2019May 25, 2020 ~ Adrian Colyer ~ Leave a comment

Boosted race trees for low energy classification Tzimpragos et al., ASPLOS'19 We don’t talk about energy as often as we probably should on this blog, but it’s certainly true that our data centres and various IT systems consume an awful lot of it. So it’s interesting to see a paper using nano-Joules per prediction as ... Continue Reading

the morning paper

a random walk through Computer Science research, by Adrian Colyer
Made delightfully fast by strattic

Machine Learning