Articles
🔥 How DoorDash Ensures Velocity and Reliability through Policy Automation (link)
How we diagnosed and resolved Redis latency spikes with BPF and other tools (link)
Awesome SLI/SLO list
The secret to reducing on-call engineering team stress (link)
Hardening Palantir’s Kubernetes Infrastructure with Cilium (link)
Dev to SRE handover checklist - a reddit post
GraphQL, meet LiveGraph: a real-time data system at scale - by Figma (link)
Projects
kubeshark, think TCPDump and Wireshark re-invented for Kubernetes
awesome-containerized-security, a collection of tools to improve your containerized apps security posture