Skip to content

the morning paper

a random walk through Computer Science research, by Adrian Colyer
Made delightfully fast by strattic

  • About
  • Archives
  • InfoQ QR Editions
  • Subscribe
  • Tags
  • Privacy

Month: December 2020

An overview of end-to-end entity resolution for big data

December 14, 2020December 10, 2020 ~ Adrian Colyer ~ Leave a comment
An overview of end-to-end entity resolution for big data, Christophides et al., ACM Computing Surveys, Dec. 2020, Article No. 127 The ACM Computing Surveys are always a great way to get a quick orientation in a new subject area, and hot off the press is this survey on the entity resolution (aka record linking) problem. ... Continue Reading

Bias in word embeddings

December 8, 2020December 7, 2020 ~ Adrian Colyer ~ Leave a comment
Bias in word embeddings, Papakyriakopoulos et al., FAT*’20 There are no (stochastic) parrots in this paper, but it does examine bias in word embeddings, and how that bias carries forward into models that are trained using them. There are definitely some dangers to be aware of here, but also some cause for hope as we ... Continue Reading