Why does deep and cheap learning work so well?

October 5, 2016November 11, 2019 ~ Adrian Colyer ~ 7 Comments

Why does deep and cheap learning work so well Lin & Tegmark 2016 Deep learning works remarkably well, and has helped dramatically improve the state-of-the-art in areas ranging from speech recognition, translation, and visual object recognition to drug discovery, genomics, and automatic game playing. However, it is still not fully understood why deep learning works ... Continue Reading

Texture networks: feed-forward synthesis of textures and stylized images

September 23, 2016November 11, 2019 ~ Adrian Colyer ~ 5 Comments

Texture Networks: Feed-forward synthesis of textures and stylized images Ulyanov et al., arXiv, March 2016 During the summer break I mostly stayed away from news feeds and twitter, which induces terrible FOMO (Fear Of Missing Out) to start with. What great research was published / discussed that I missed? Was there a major industry announcement ... Continue Reading

Mastering the game of Go with deep neural networks and tree search

September 20, 2016November 11, 2019 ~ Adrian Colyer ~ 4 Comments

Mastering the Game of Go with Deep Neural Networks and Tree Search Silver, Huang et al., Nature vol 529, 2016 Pretty much everyone has heard about AlphaGo’s tremendous Go playing success beating the European champion by 5 games to 0. In all the excitement at the time, less was written about how AlphaGo actually worked ... Continue Reading

Deep neural networks for YouTube recommendations

September 19, 2016November 11, 2019 ~ Adrian Colyer ~ 5 Comments

Deep Neural Networks for YouTube Recommendations Covington et al, RecSys '16 The lovely people at InfoQ have been very kind to The Morning Paper, producing beautiful looking "Quarterly Editions." Today's paper choice was first highlighted to me by InfoQ's very own Charles Humble. In it, Google describe how they overhauled the YouTube recommendation system using ... Continue Reading

End-to-end learning of semantic role labeling using recurrent neural networks

July 5, 2016 ~ Adrian Colyer ~ 2 Comments

End-to-end learning of semantic role labeling using recurrent neural networks Zhou & Xu International joint conference on Natural Language Processing, 2015 Collobert’s 2011 paper that we looked at yesterday represented a turning point in NLP in which they achieved state of the art performance on part-of-speech tagging (POS), chunking, and named entity recognition (NER) using ... Continue Reading

Building end-to-end dialogue systems using generative hierarchical neural network models

July 1, 2016 ~ Adrian Colyer ~ 2 Comments

Building end-to-end dialogue systems using generative hierarchical neural network models Serban et al. AAAI 2016 After reading a few of these papers on generative non-goal driven dialogue systems, I’ve ended up both impressed at the early results and the direction they point in, as well as somewhat underwhelmed at the potential for this technology to ... Continue Reading

Incorporating (a) copying mechanism in sequence to sequence learning

June 30, 2016 ~ Adrian Colyer ~ 3 Comments

Incorporating copying mechanism in sequence to sequence learning Gu et al. 2016, with a side-helping of Neural machine translation by jointly learning to align and translate Bahdanau et al. ICLR 2015 Today’s paper shows how the sequence-to-sequence conversational model we looked at yesterday can be made to seem more natural by including a “copying mechanism” ... Continue Reading

A neural conversation model

June 29, 2016 ~ Adrian Colyer ~ 4 Comments

A Neural Conversation Model Vinyals & Le, ICML 2015 What happens if you build a bot that is trained on conversational data, and only conversational data: no programmed understanding of the domain at all, just lots and lots of sample conversations…? Building on the sequence to sequence technique that we looked at previously, this is ... Continue Reading

A survey of available corpora for building data-driven dialogue systems

June 28, 2016 ~ Adrian Colyer ~ 4 Comments

A survey of available corpora for building data-driven dialogue systems Serban et al. 2015 Bear with me, it’s more interesting than it sounds :). Yes, this (46-page) paper does include a catalogue of data sets with dialogues from different domains, but it also includes a high level survey of techniques that are used in building ... Continue Reading

Semi-supervised sequence learning

June 3, 2016 ~ Adrian Colyer ~ 1 Comment

Semi-supervised sequence learning - Dai & Le, NIPS 2015. The sequence to sequence learning approach we looked at yesterday has been used for machine translation, text parsing, image captioning, video analysis, and conversational modeling. In Semi-supervised sequence learning, Dai & Le use a clever twist on the sequence-to-sequence approach to enable it to be used ... Continue Reading

the morning paper

a random walk through Computer Science research, by Adrian Colyer
Made delightfully fast by strattic

Deep Learning