Client-Side Load Balancing at a Million Requests Per Second
How we built an in-process client-side load balancer for a million requests per second of internal fan-out traffic,... Read more...
2026
Introducing Lightstep UQL to PromQL Translator
Automate telemetry query migration from Lightstep UQL to PromQL with this open-source Go SDK and Web UI. Read more...
2026
Rejecting Invalid Ingress Routes at Apply Time
How Zalando used Skipper as a validating admission webhook to reject invalid filters and predicates at apply time,... Read more...
2026
The Day Our Own Queries DoSβed Us: Inside Zalando Search
Once upon a time, during a normal Sunday, our team ran into an unexpected challenge: an Elasticsearch cluster that... Read more...
2025
Dead Ends or Data Goldmines? Investment Insights from Two Years of AI-Powered Postmortem Analysis
Your incidents hold the blueprint to your most strategic infrastructure wins β if you're listening correctly. Read more...
2025
Introducing Lightstep Receiver for OpenTelemetry Collector
OpenTelemetry Lightstep Receiver helps you ingest traces generated by legacy Lightstep tracers in a simple way. Read more...
2025
OpenTelemetry for JavaScript Observability at Zalando
How Zalando improved observability for Node.js and web applications using OpenTelemetry Read more...
2024
Node.js and the tale of worker threads
Join me on a Friday night on-call investigation into a rogue Node.js service. Read more...
2024
End-to-end test probes with Playwright
Learn how we set up reliable automated end-to-end test probes for our Zalando website using Playwright Read more...
2024
Failing to Auto Scale Elasticsearch in Kubernetes
A story of operational failure in large scale Elastisearch installation including the root cause analysis and... Read more...
2024









