Revision 5
Articles
🔥 Why and How eBay Pivoted to OpenTelemetry (link)
🔥 A guide to making on-call holidays suck less (link)
Toil: Still Plaguing Engineering Teams - by PagerDuty (link)
Improving Application Availability with Pod Readiness Gates (link)
awesome-scalability is a collection with patterns of scalable, reliable and performance large-scale systems
How to handle Kubernetes health probes (to avoid a Black Friday outage) - by Doordash (link)
The principles of chaos engineering (link)
How Twilio replaced their data pipeline with zero downtime (link)
Bouncer: Simple AWS Auto Scalling Rollovers - by Palantir (link)
Other updates and projects
Github Availability Report: November 2022 (link)
Vaulty is a tool that helps you securely share passwords - free and paid versions available (link)
Nova helps you find outdated or deprecated Helm charts running in your cluster (github)
Don't miss out on the latest in SRE – sign up for my weekly newsletter and stay ahead of the curve!