Achieving 100Gbps intrusion prevention on a single server

November 16, 2020October 20, 2025 ~ adriancolyer

Achieving 100 Gbps intrusion prevention on a single server, Zhao et al., OSDI’20 Papers-we-love is hosting a mini-event this Wednesday (18th) where I’ll be leading a panel discussion including one of the authors of today’s paper choice: Justine Sherry. Please do join us if you can. We always want more! This stems from a combination of Jevon’s paradox … Continue reading Achieving 100Gbps intrusion prevention on a single server

Taiji: managing global user traffic for large-scale Internet services at the edge

November 15, 2019November 15, 2019 ~ adriancolyer ~ 1 Comment

Taiji: managing global user traffic for large-scale internet services at the edge Xu et al., SOSP'19 It’s another networking paper to close out the week (and our coverage of SOSP’19), but whereas Snap looked at traffic routing within the datacenter, Taiji is concerned with routing traffic from the edge to a datacenter. It’s been in … Continue reading Taiji: managing global user traffic for large-scale Internet services at the edge

Snap: a microkernel approach to host networking

November 11, 2019November 10, 2019 ~ adriancolyer ~ 7 Comments

Snap: a microkernel approach to host networking Marty et al., SOSP'19 This paper describes the networking stack, Snap, that has been running in production at Google for the last three years+. It’s been clear for a while that software designed explicitly for the data center environment will increasingly want/need to make different design trade-offs to … Continue reading Snap: a microkernel approach to host networking

RPCValet: NI-driven tail-aware balancing of µs-scale RPCs

May 20, 2019May 16, 2019 ~ adriancolyer

RPCValet: NI-driven tail-aware balancing of µs-scale RPCs Daglis et al., ASPLOS'19 Last week we learned about the [increased tail-latency sensitivity of microservices based applications with high RPC fan-outs. Seer uses estimates of queue depths to mitigate latency spikes on the order of 10-100ms, in conjunction with a cluster manager. Today’s paper choice, RPCValet, operates at … Continue reading RPCValet: NI-driven tail-aware balancing of µs-scale RPCs

Slim: OS kernel support for a low-overhead container overlay network

March 22, 2019March 16, 2019 ~ adriancolyer ~ 1 Comment

Slim: OS kernel support for a low-overhead container overlay network Zhuo et al., NSDI'19 Container overlay networks rely on packet transformations, with each packet traversing the networking stack twice on its way from the sending container to the receiving container. There are CPU, throughput, and latency overheads associated with those traversals. In this paper, we … Continue reading Slim: OS kernel support for a low-overhead container overlay network

Understanding lifecycle management complexity of datacenter topologies

March 20, 2019March 14, 2019 ~ adriancolyer ~ 1 Comment

Understanding lifecycle management complexity of datacenter topologies Zhang et al., NSDI'19 There has been plenty of interesting research on network topologies for datacenters, with Clos-like tree topologies and Expander based graph topologies both shown to scale using widely deployed hardware. This research tends to focus on performance properties such as throughput and latency, together with … Continue reading Understanding lifecycle management complexity of datacenter topologies

Datacenter RPCs can be general and fast

March 18, 2019March 14, 2019 ~ adriancolyer ~ 9 Comments

Datacenter RPCs can be general and fast Kalia et al., NSDI'19 We’ve seen a lot of exciting work exploiting combinations of RDMA, FPGAs, and programmable network switches in the quest for high performance distributed systems. I’m as guilty as anyone for getting excited about all of that. The wonderful thing about today’s paper, for which … Continue reading Datacenter RPCs can be general and fast

The case for network-accelerated query processing

January 28, 2019January 24, 2019 ~ adriancolyer ~ 2 Comments

The case for network-accelerated query processing Lerner et al., CIDR'19 Datastores continue to advance on a number of fronts. Some of those that come to mind are adapting to faster networks (e.g. ‘FARM: Fast Remote Memory’) and persistent memory (see e.g. ‘Let’s talk about storage and recovery methods for non-volatile memory database systems’), deeply integrating … Continue reading The case for network-accelerated query processing

BDS: A centralized near-optimal overlay network for inter-datacenter data replication

June 8, 2018June 1, 2018 ~ adriancolyer ~ 1 Comment

BDS: A centralized near-optimal overlay network for inter-datacenter data replication Zhang et al., EuroSys'18 (If you don’t have ACM Digital Library access, the paper can be accessed either by following the link above directly from The Morning Paper blog site). This is the story of how inter-datacenter multicast transfers at Baidu were sped-up by a … Continue reading BDS: A centralized near-optimal overlay network for inter-datacenter data replication

Stateless datacenter load-balancing with Beamer

May 3, 2018April 27, 2018 ~ adriancolyer ~ 3 Comments

Stateless datacenter load-balancing with Beamer Olteanu et al., NSDI'18 We’ve spent the last couple of days looking at datacenter network infrastructure, but we didn’t touch on the topic of load balancing. For a single TCP connection, you want all of the packets to end up at the same destination. Logically, a load balancer (a.k.a. ‘mux’) … Continue reading Stateless datacenter load-balancing with Beamer