Revision 171
Articles and updates:
What Is OpenTelemetry and Why It Matters (link)
The Human Infrastructure: How Netflix Built the Operations Layer Behind Live at Scale (link)
K3s on On-Prem Infrastructures the GitOps Way: Writing a Custom k0rdent Template from Scratch (link)
The On-Call Problem AI Can Actually Solve (link)
Introducing Pyroscope 2.0: faster, more cost-effective continuous profiling at scale (link)
Auto-diagnosing Kubernetes alerts with HolmesGPT and CNCF tools (link)
How Skyscanner scales OpenTelemetry: managing collectors across 24 production clusters (link)
Simplifying Prometheus metrics collection across your AWS infrastructure (link)
From Ingress NGINX to Higress: migrating 60+ resources in 30 minutes with AI (link)
Introducing o11y-bench: an open benchmark for AI agents running observability workflows (link)
Deprecating OpenTracing compatibility requirements (link)
K8 Sidecars: Gotta Drop ‘em All! (link)
Why AI agents burn tokens on every reliability query (link)



