Microsecond Latency at Terabyte Scale: Inside Valkey Search


May 27-29, 2026 • Computer History Museum, CaliforniaDate, time, and room will be announced soon.
Valkey Search enables low-latency search and aggregation on large operational datasets for use cases such as real-time personalization and anomaly detection. In this talk, we dive into the architecture behind Valkey Search and examine how search workloads behave differently from traditional Valkey access patterns. We will walk through how queries execute across shards and within shards, and how the module’s multithreading and coordination model balances throughput, latency, and freshness. Attendees will leave with practical knowledge on cluster topology, query patterns, and the performance and consistency tradeoffs that matter when running search on hot data in production.
Speakers

Allen Samuels is a Principal Engineer at AWS. He is passionate about distributed, performant systems. When not travelling the world for pleasure or playing duplicate bridge, Allen can be found in San Jose, California.

Karthik Subbarao is a Senior Engineer at AWS who is passionate about distributed systems, databases, Rust, and, in general, innovating through software development / technology.

