Revision 10
New year, new ideas and experimentation!
One of the ideas I had was to interview SREs from various backgrounds and companies as part of this newsletter.
Their responses would be posted as part of this newsletter in an effort to spread their knowledge and see how others do SRE!
What do you think?
In case you are positive, would you like to recommend specific people to get interviewed?
Articles and updates
🔥 Developing a data driven tool to estimate the cost of incidents (link)
5 Practices for Kubernetes Operations with Amazon EKS (link)
Github Availability Report: December 2022 (link)
Why Your Monitoring Dashboard May Be Feeding You Phantom Metrics (link)
Improving your monitoring setup by integrating Cloudflare’s analytics data into Prometheus and Grafana (link)
Aptakube is a new GUI tool for managing multiple Kubernetes clusters (reddit)
Effective Site Reliability Engineering (SRE) requires an observability strategy - by Deloitte (link)
90 days of AWS EKS in Production (link)
Understanding gRPC Concepts, Use Cases & Best Practices (link)