Revision 128
Articles and updates:
What I Really Mean When I Say “Good Communication” in Incident Response (link)
Load testing: Prepare for the growth you dream of! (link)
ClickStack: A High-Performance OSS Observability Stack on ClickHouse (link)
Logs, Metrics, Traces… Leaks? The Case for Auditable Observability (link)
Understanding and optimizing resource consumption in Prometheus (link)
Open Data Standards: Postgres, OTel, and Iceberg (link)
Building a Distributed Cache for S3 (link)
So You Wanna Be a Startup SRE? Read This First. (link)
ELK alternative: Modern log management setup with Opentelemetry and Opensearch (link)
Mastering the OpenTelemetry Transformation Language (OTTL) (link)
We're open sourcing CRE and preq to make it easier for humans and agents to find and fix reliability problems (link)
Kubernetes Debug Profiles (link)
Kubernetes CPU Metrics in the kubeletstats Receiver: Transition from .cpu.utilization to .cpu.usage (link)
Securing Kubernetes Traffic with Calico Ingress Gateway (link)
Mastering Kubernetes Migrations From Planning to Execution (link)
Create rich, up-to-date visualizations of your AWS infrastructure with Cloudcraft in Datadog (link)
Database observability: How OpenTelemetry semantic conventions improve consistency across signals (link)
Exposing OTel Collector in Kubernetes with Gateway API & mTLS (link)
Kubernetes Infrastructure Design Assessment: Optimizing Your Cloud-Native Foundation (link)
Simplify Kubernetes Security With Kyverno and OPA Gatekeeper (link)
Concrete Applications of Purposeful Instrumentation (link)
Your Collector, Your Rules: Introducing BYOC and the OpenTelemetry Distribution Builder (link)
Monitor OpenTelemetry-native metrics with Datadog (link)