How Application-Level Priority Management Keeps Latency Low and Throughput High

March 22, 2022

Throughput and latency are at a constant tension. Optimizing for throughput requires running machines at high utilization, which increases queuing delays and hurts latency. This Linux Foundation talk by ScyllaDB CTO Avi Kivity shows how high throughput and low latency can both be achieved in a single application by using application-level priority scheduling.

Vertical scalability, or how a single node is a network
Shard-per-core, shared nothing design
Async everywhere
Schedulers for CPU and IO
Prioritizing multiple workloads on the same cluster
Getting the most out of the Linux OS and C++20

How Application-Level Priority Management Keeps Latency Low and Throughput High

Vector Search with ScyllaDB

Why We Changed ScyllaDB’s Data Streaming Approach

ScyllaDB's Architecture for Extreme Scale

A Deep Dive into ScyllaDB’s Architecture

ScyllaDB: 10 Years and Beyond

ScyllaDB’s Monstrous Engineering Advances

Quantifying the Performance Impact of Shard-per-core Architecture

Tablets: Rethinking Replication

Avi Kivity and Dor Laor AMA: “Tablets” replication, all things async, caching, object storage & more

Why ScyllaDB is Moving to a New Replication Algorithm: Tablets

Inside ScyllaDB’s Internal Cache

Extreme Elasticity with Tablets, Raft and Kubernetes

Surviving Majority Loss: When a Leader Fails

Topology on Raft: An Inside Look

ScyllaDB’s Path to Strong Consistency: A New Milestone

What’s Next on ScyllaDB’s Path to Strong Consistency

Different I/O Access Methods for Linux, What We Chose for ScyllaDB, and Why

Workload Prioritization: How to Balance Multiple Workloads in a Cluster

Exploring ScyllaDB’s DynamoDB-Compatible API

ScyllaDB is No Longer “Just a Faster Cassandra”