Revision 37
Articles and updates
The Pyramid of Alerting (link)
How Uber Optimized Cassandra Operations At Scale (link)
How we combined OpenTelemetry traces with Prometheus metrics to build a powerful alerting mechanism (link)
Monitoring of AWS EKS using AWS Distro for OpenTelemetry (ADOT) and Amazon Managed Service for Prometheus (AMP) (link)
Incident Review for Site-wide Outage for GitLab.com - Stale Terraform Pipeline (link)
CDN Observability: Why You Must Monitor Your Extended Infrastructure (link)
The Future of VMs on Kubernetes: Building on KubeVirt (link)
Plaid: pain-free deployments at global scale (link)
Crossplane vs. Terraform (link)
Harnessing Komiser and Grafana for Custom Cloud Dashboards (link)
Rethinking infrastructure as code from scratch (link)